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THE 3x + 1 PROBLEM AND ITS GENERALIZATIONS 


JEFFREY C. LAGARIAS 
AT & T Bell Laboratories, Murray Hill, NJ 07974 


1. Introduction. The 3x + 1 problem, also known as the Collatz problem, the Syracuse prob- 
lem, Kakutani’s problem, Hasse’s algorithm, and Ulam’s problem, concerns the behavior of the 
iterates of the function which takes odd integers n to 3n + 1 and even integers n to n/2. The 
3x + 1 Conjecture asserts that, starting from any positive integer n, repeated iteration of this 
function eventually produces the value 1. 

The 3x + 1 Conjecture is simple to state and apparently intractably hard to solve. It shares 
these properties with other iteration problems, for example that of aliquot sequences (see Guy 
[36], Problem B6) and with celebrated Diophantine equations such as Fermat’s last theorem. Paul 
Erdés commented concerning the intractability of the 3x + 1 problem: “Mathematics is not yet 
ready for such problems.” Despite this doleful pronouncement, study of the 3x + 1 problem has 
not been without reward. It has interesting connections with the Diophantine approximation of 
log, 3 and the distribution (mod1) of the sequence {(3/2)*: k = 1,2,...}, with questions of 
ergodic theory on the 2-adic integers Z,, and with computability theory—a generalization of the 
3x + 1 problem has been shown to be a computationally unsolvable problem. In this paper I 
describe the history of the 3x + 1 problem and survey all the literature I am aware of about this 
problem and its generalizations. 

The exact origin of the 3x + 1 problem is obscure. It has circulated by word of mouth in the 
mathematical community for many years. The problem is traditionally credited to Lothar Collatz, 
at the University of Hamburg. In his student days in the 1930’s, stimulated by the lectures of 
Edmund Landau, Oskar Perron, and Issai Schur, he became interested in number-theoretic 
functions. His interest in graph theory led him to the idea of representing such number-theoretic 
functions as directed graphs, and questions about the structure of such graphs are tied to the 
behavior of iterates of such functions [25]. In his notebook dated July 1, 1932, he considered the 
function 


=n, if n = 0 (mod 3), 
g(n) = =n ; if n = 1 (mod 3), 

4 ; 

3 , if n = 2 (mod 3), 


which gives rise to a permutation P of the natural numbers 


pe(h 234567 8 9 | 
132574 9 ll 6 : 


He posed the problem of determining the cycle structure of P, and asked in particular whether or 
not the cycle of this permutation containing 8 is finite or infinite, i.e., whether or not the iterates 
g‘*) (8) remain bounded or are unbounded [24]. I will call the study of the iterates of g(n) the 


Jeffrey C. Lagarias: I was first exposed to the 3x + 1 problem in 1967 as a high school student working at the 
National Bureau of Standards. Afterwards I worked on it from time to time. Out of curiosity and frustration I 
gradually became a historian of the problem, accumulating a collection of papers about it. This survey is a happy 
consequence. I obtained a Ph.D. (1974) in analytic number theory at M.I.T. under the supervision of Harold Stark. I 
have been on the staff of AT & T Bell Laboratories since then, and have held visiting positions at the University of 
Maryland (mathematics) and Rutgers University (computer science). My research interests include computational 
complexity theory, number theory, and cryptography. 
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original Collatz problem. Although Collatz never published any of his iteration problems, he 
circulated them at the International Congress of Mathematicians in 1950 in Cambridge, Mas- 
sachusetts, and eventually the original Collatz problem appeared in print ([9], [47], [59]). His 
original question concerning g‘*) (8) has never been answered; the cycle it belongs to is believed 
to be infinite. Whatever its exact origins, the 3x + 1 problem was certainly known to the 
mathematical community by the early 1950’s; it was discovered in 1952 by B. Thwaites [69]. 

During its travels the 3x + 1 problem has been christened with a variety of names. Collatz’s 
colleague H. Hasse was interested in the 3x + 1 problem and discussed generalizations of it with 
many people, leading to the name Hasse’s algorithm [40]. The name Syracuse problem was 
proposed by Hasse during a visit to Syracuse University in the 1950’s. Around 1960, S. Kakutani 
heard the problem, became interested in it, and circulated it to a number of people. He said “For 
about a month everybody at Yale worked on it, with no result. A similar phenomenon happened 
when I mentioned it at the University of Chicago. A joke was made that this problem was part of 
a conspiracy to slow down mathematical research in the U.S. [45].” In this process it acquired the 
name Kakutani’s problem. S. Ulam also heard the problem and circulated the problem at Los 
Alamos and elsewhere, and it is called Ulam’s problem in some circles ({13], [69]). 

In the last ten years the 3x + 1 problem has forsaken its underground existence by appearing 
in various forms as a problem in books and journals, sometimes without attribution as an 
unsolved problem. Prizes have been offered for its solution: $50 by H. S. M. Coxeter in 1970, then 
$500 by Paul Erdés, and more recently £1000 by B. Thwaites [69]. Over twenty research articles 
have appeared on the 3x + 1 problem and related problems. 

In what follows I first discuss what is known about the 3x + 1 problem itself, and then discuss 
generalizations of the problem. I have included or sketched proofs of Theorems B, D, E, F, M and 
N because these results are either new or have not appeared in as sharp a form previously; the 
casual reader may skip these proofs. 


2. The 3x + 1 problem. The known results on the 3x +1 problem are most elegantly 
expressed in terms of iterations of the function 


3n4+1 


if n = 1 (mod 2), 
(2.1) T(n) = 
if n = 0 (mod 2). 


No | 3 


One way to think of the 3x + 1 problem involves a directed graph whose vertices are the positive 
integers and that has directed edges from n to T(n). I call this graph the Collatz graph of T(n) in 
honor of L. Collatz [25]. A portion of the Collatz graph of T(n) is pictured in Fig. 1. A directed 
graph is said to be weakly connected if it is connected when viewed as an undirected graph, Le., 
for any two vertices there is a path of edges joining them, ignoring the directions on the edges. The 
3x + 1 Conjecture can be formulated in terms of the Collatz graph as follows. 


3x + 1 CONJECTURE (First form). The Collatz graph of T(n) on the positive integers is weakly 
connected. 


We call the sequence of iterates (n, T(n), T?(n), T®(n), ...) the trajectory of n. There are 
three possible behaviors for such trajectories when n > 0. 


(i) Convergent trajectory. Some T“)(n) = 1. 
(ii) Non-trivial cyclic trajectory. The sequence 7‘*)(n) eventually becomes periodic and 
T)(n) #1 for any k > 1. 
(iii) Divergent trajectory. lim, _,,, T)(n) = oo. 


The 3x + 1 Conjecture asserts that all trajectories of positive n are convergent. It is certainly true 
for n> 1 that T“)(n) =1 cannot occur without some T“*)(n) < n occurring. Call the least 
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Fic. 1. A portion of the Collatz graph of iteraies of the function T(7). 


positive k for which T‘)(n) < n the stopping time a(n) of n, and set o(n) = 00 if no k occurs 
with T‘)(n) < n. Also call the least positive k for which T‘)(n) = 1 the total stopping time 
g,,(n) of n, and set o,,(n) = oo if no such k occurs. We may restate the 3x + 1 Conjecture in 
terms of the stopping time as follows. 


3x + 1 CONJECTURE (Second form). Every integer n > 2 has a finite stopping time. 


The appeal of the 3x + 1 problem lies in the irregular behavior of the successive iterates 
T‘*)(n). One can measure this behavior using the stopping time, the total stopping time, and the 
expansion factor s(n) defined by 


sup T’)(n) 
k>0 
s(n) = 


9 


n 


if n has a bounded trajectory and s(n) = +00 if n has a divergent trajectory. For example 
n = 27 requires 70 iterations to arrive at the value 1 and 


sup T*) (27) 


Table 1 illustrates the concepts defined so far by giving data on the iterates T‘*)(n) for selected 
values of n. 


6 JEFFREY C. LAGARIAS [January 


TABLE 1. Behavior of iterates T‘*)(n). 
ay x | 
1 00 2 2 


7 3.7 
171. 
6.37 x 108 
1 
1.50 
1.11 x 10% 
1.50 


The 3x + 1 Conjecture has been numerically checked for a large range of values of n. It is an 
interesting problem to find efficient algorithms to test the conjecture on a computer. The current 
record for verifying the 3x + 1 Conjecture seems to be held by Nabuo Yoneda at the University 
of Tokyo, who has reportedly checked it for all n < 2*° = 1.2 x 10!” [2]. In several places the 
statement appears that A. S. Fraenkel has checked that all n < 2°° have a finite total stopping 
time; this statement is erroneous [32]. 


2.1. A heuristic argument. The following heuristic probabilistic argument supports the 3x + 1 
Conjecture (see [28]). Pick an odd integer n, at random and iterate the function T until another 
odd integer n, occurs. Then 4 of the time n, = (3, + 1)/2, 4 of the time n, = (32, + 1)/4, $ 
of the time n, = (32, + 1)/8, and so on. If one supposes that the function T is sufficiently 
“mixing” that successive odd integers in the trajectory of n behave as though they were drawn at 
random (mod 2*) from the set of odd integers (mod 2*) for all k, then the expected growth in size 
between two consecutive odd integers in such a trajectory is the multiplicative factor 


(3) (3) (a) ae] 
2 4 8 | 


Consequently this heuristic argument suggests that on average the iterates in a trajectory tend to 
shrink in size, so that divergent trajectories should not exist. Furthermore it suggests that the total 
stopping time a,.(7) is (in some average sense) a constant multiple of log n. 

From the viewpoint of this heuristic argument, the central difficulty of the 3x + 1 problem lies 
in understanding in detail the “mixing” properties of iterates of the function T(n) (mod 2*) for 
all powers of 2. The function T(n) does indeed have some “mixing” properties given by 
Theorems B and K below; these are much weaker than what one needs to settle the 3x + 1 
Conjecture. 


2.2. Behavior of the stopping time function. It is Riho Terras’s ingenious observation that 
although the behavior of the total stopping time function seems hard to analyze, a great deal can 
be said about the stopping time function. He proved the following fundamental result ([64], [65]), 
also found independently by Everett [31]. 


THEOREM A (Terras). The set of integers S, = {n: n has stopping time < k} has a limiting 
asymptotic density F(k), i.e., the limit 
1 
F(k) = lim —~#{n:n<x and o(n)<k} 
x70 X 


exists. In addition, F(k) ~ 1 as k > o0, so that almost all integers have a finite stopping time. 


The ideas behind Terras’s analysis seem basic to a deeper understanding of the 3x +1 
problem, so I describe them in detail. In order to do this, I introduce some notation to describe 
the results of the process of iterating the function T(n). Given an integer n, define a sequence of 
0-1 valued quantities x;(n) by 


(2.2) TO(n) =x,(n) (mod2), 0<i<o, 
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where T(n) =n. The results of the first k iterations of T are completely described by the 
parity vector 


(2.3) v,.(n) = (x9(n),..-,X%-1(7)), 


since the result of k iterations is 


(2.4) T’(n) =A,(n)n + p,(n), 
where 
Bxo(m)+ +x 4 —1(1) 
(2.5) A, (a) = a 
and 
k-1 Breit ov +x 4-171) 
(2.6) p,(n) = d xi(0) a 


Note that in (2.5), (2.6) both A, and p, are completely determined by the parity vector v = v, (7) 
given by (2.3); I sometimes indicate this by writing A,(v), p,(v) (instead of A,(n), ,(n)). The 
formula (2.4) shows that a necessary condition for T)(n) <n is that 


(2.7) A,(n) < 1, 


since p,(n) is nonnegative. Terras [64] defines the coefficient stopping time w(n) to be the least 
value of k such that (2.7) holds, and + oo if no such value of k exists. It is immediate that 


(2.8) w(n) < a(n). 
The function w(n) plays an important role in the analysis of the behavior of the stopping time 
function o(7), see Theorem C. 


The formula (2.2) expresses the parity vector v = v,(n) as a function of n. Terras’s idea is to 
reverse this process and express n as a function of v. 


THEOREM B. The function Q,: Z— Z/2*Z defined by 


k-1 


Q,(n) = » x,(n)2' 
i=0 
is periodic with period 2*. The induced function QO,: Z/2*Z — Z/2*Z is a permutation, and its 
order is a power of 2. 


Proof (sketch). The theorem is established by induction on k, using the inductive hypotheses: 
(1) x,(n) is periodic with period 2'** for 0 < i < k — 1. In fact 


(2.9) x,(n + 2') = x,(n) + 1 (mod 2) 
forO0 <i<k-1. 
(2) O,(n) is periodic with period 2*. 
(3) A,(n) and p,(n) are periodic with period 2". 
(4) QO, is a permutation whose order divides 2*. Also 
(2.10) O,(n + 24!) = O,(n) + 2*~! (mod 2*). 
I omit the details. @ 
The cycle structure and order of the first few permutations Q, are given in Table 2. 
(One-cycles are omitted.) It is interesting to observe that the order of the permutation Q, seems to 


be much smaller than the upper bound 2° proved in Theorem B. Is there some explanation of this 
phenomenon? 
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TABLE 2. Cycle structure and order of permutation Q,. 


identity 

identity 

(1, 5) 

(1, 5)(2, 10)(9, 13) 


(1, 21)(2, 10)(4, 20)(5, 17)(7, 23)(9, 29, 25, 13)(18, 26) 
(1, 21)(2, 42)(3, 35)(4, 20)(5, 17, 37, 49) 

(7, 23)(8, 40)(9, 29, 25, 13)(10, 34) 

(18, 58, 50, 26)(19, 51)(27, 59)(33, 53) 

(36, 52)(39, 55)(41, 61, 57, 45) 


Theorem B allows one to associate with each vector v = (U),...,0,_1) © (Z/2Z)* of length k 
a unique congruence class S(v) (mod 2*) given by 


S(v) = {n:v =(x0(n),...,x,%_1(n))}. 
The integer 
No (¥) = (Q,) | » o2 (mod 2‘) 
i=0 
with 0 < n,(v) < 2* is the minimal element in S(v) and S(v) is the arithmetic progression: 
S(v) = {no(v) + 24:0 <i < oo}. 


Now I consider the relation between a vector v and stopping times for integers n © S(v). 
Define a vector v = (Up, 01,...,U,_1) of length k to be admissible if 

(1) (u) + +++ +0,_,)In3 < k1n2, 

(2) (Up + ++: +0,)In3 > (+ Dln2, when0 <i< k —2. 


Note that all admissible vectors v of length k have 
(2.11) Ug + oes +v,_, =[k 4], 


where 6 = In2/1n3 = (log, 3)! = .63093 and [x] denotes the largest integer < x. The follow- 
ing result is due to Terras. 


THEOREM C (Terras). (a) The set of integers with coefficient stopping time k are exactly the set of 
integers in those congruence classes n (mod 2") for which there is an admissible vector v of length k 
with n = no(V). 

(b) Let n = no(v) for some vector v of length k. If v is admissible, then all sufficiently large 
integers congruent to n (mod 2“) have stopping time k. If v is not admissible, then only finitely 
many integers congruent to n (mod 2*) have stopping time k. 


Proof. The assertions made in (a) about coefficient stopping times follow from the definition 
of admissibility, because that definition asserts that 
(i) A.W) < 1, 
(i) A,(v)>1 for l<i<gk-1. 
To prove (b), first note that if v is admissible of length k, then 
3¥0F sos FU, 4 


T(n) > no for l<i<k-1, 
and so all elements of S(v) have stopping time at least k. Now define e, > 0 by 
31k 9] 
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where @ = (log, 3)‘, and note that (2.11) implies that 


3¥0F cee +U,_4 


9k 
for all admissible v. Now for n € S(v) for an admissible v, (2.4) may be rewritten as 
(2.13) T(n) =n +(p,(v) — &n). 

Hence when v is admissible, those n in S(v) with 

(2.14) n> & 'P,(V) 


have stopping time k, and w(n) = o(n) = k in this case. 

Now suppose v is not admissible. There are two cases, depending on whether or not some 
initial segment (Up,...,v;) of v is admissible. No initial segment of v is admissible if and only if 
(2.15) (up + ++: +0,_,)log3 > ilog2 for 1<i<k-—1, 


and when (2.15) holds say that v is inflating. If v is inflating, \,(v) > 1 so that T“)(n) > n for all 
n in S(v) by (2.4), so that no elements of S(v) have stopping time k or less. In the remaining case 
v has an initial segment w = (Up, v,,...,U;) with i < k — 1 which is admissible. Now S(v) C S(w) 
and all sufficiently large elements of S(w) have stopping time i + 1 < k by the argument just 
given. 


Theorem C asserts that the set of integers J, with a given coefficient stopping time k is a set of 
arithmetic progressions (mod 2"), which has the immediate consequence that J, has the asymp- 
totic density 


1 
d(1I,) = lim 5 #{ nin <xandneéI,} 
Xx 0c 
which is given by 
d(I,) = i # {v: vis admissible and of length k}. 
Furthermore Theorem C asserts that the set 


S, = {n: n has stopping time k } 


differs from J, by a finite set, so that S, also has an asymptotic density which is the same as that 
of I,. Consequently, Theorem C implies the first part of Theorem A, that the set of all integers 
with stopping time at most k have an asymptotic density F(k) given by 


(2.16) F(k)= ¥ weight (v), 
v admissible 
length (v)<k 

where 


weight (v) = 27 lensth™ 


Now the formula (2.16) can be used to prove the second part of Theorem A, and in fact to prove 
the stronger result that F(k) approaches 1 at an exponential rate as k > oo. 


THEOREM D. For all k > 1, 


(2.17) 1 — F(k) = lim = #{ nin <xand o(n) > k} <2, 
where 
(2.18) n=1-H(6) = .05004.... 


Here H(x) = —x log, x — (1 — x)log,(1 — x) is the entropy function and 6 = (log, 3)"'. 
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Proof. Let C = C, U C,, where 
C, = {v: vis admissible and length (v) < k} 
and 
C, = {v: vis inflating and length (v) = k}. 


Then C has the property that for any binary word w of length k there is a unique v € C withva 
prefix of w. Now for any v with length (v) < k 


weight(v) = )° weight(w), 
where the sum is over all w of length k for which v is a prefix of w. Hence 


>> weight(v)= >) — weight(w) =1. 


veEC length(w) =k 
From (2.16) this implies that 


L weight(v) =2°“|C,|=1—- F(k), 


ve C> 
where |C,| denotes the number of vectors in C,. The already proved first part of Theorem A shows 
that 
. 1 
1 — F(k) = lim <# {nin < x and o(n) > k}, 
xXx CO 


so that to prove (2.17) it suffices to bound |C,| from above. 
Now the definition (2.15) of an inflating vector implies that 


k-1 

C, Cc f > v, > Ka), 

i=0 

so that ° 

(2.19) IO|< » (‘). 
j>ké@ J 


The right side of (2.19) is just the tail of the binomial distribution. It is easily checked using 
Stirling’s formula that for any constant a > + and any e > 0 the bound 


» (‘) < K€ ba < J(H(a)+ek 


j>kea J 


holds for all sufficiently large k. With more work one can obtain the more precise estimate (Ash 
[8], Lemma 4.7.2) that for any a > 4 


y (‘) < Dak 
j>kea J 
which used in (2.19) implies (2.17). 
Theorem D cannot be substantially improved; it cannot be proved that for any e > 0 we have 
|C,| > 204 —Hk 
for all sufficiently large k depending on e. Hence for any e > 0 
1 — F(k) > 27>(t 94 


holds for all sufficiently large k depending on e. 


1985] THE 3x + 1 PROBLEM AND ITS GENERALIZATIONS 11 


2.3. What is the relation between the coefficient stopping time and the stopping time? Theorem 
C shows that generally they are equal: For any fixed k at most a finite number of those n having 
coefficient stopping time w(n) < k have a(n) # w(n). Terras [64] and later Garner [34] conjec- 
ture that this never occurs. 


COEFFICIENT STOPPING TIME CONJECTURE. For all n > 2, the stopping time a(n) equals the 
coefficient stopping time w(n). 


The Coefficient Stopping Time Conjecture has the aesthetic appeal that if it is true, then the set 
of positive integers with stopping time k is exactly a collection of congruence classes (mod k), as 
described by part (i) of Theorem C. Furthermore, the truth of the Coefficient Stopping Time 
Conjecture implies that there are no nontrivial cycles. To see this, suppose that there were a 
nontrivial cycle of period k and let ny be its smallest element, and note that o(7,.) = oo. Then 
T') (ny) > no for1 <i< k—- 1 and 


(2.20) T (ng) =Ax (M9) 29 + px ( 10) = No- 
Now p,(1,) # 0 since ny isn’t a power of 2, so that (2.20) implies that A,(m,)) < 1. Hence 
w(ny) <k, so that w(no) # a(n). 

The following result shows that the Coefficient Stopping Time Conjecture is “nearly true.” I 
will use it later to bound the number of elements not having a finite stopping time. 


THEOREM E. There is an effectively computable constant ky such that if v is admissible of length 


k > ko, then all elements of S(v) have stopping time k except possibly the smallest element no(v) 
of S. 


Proof (sketch). The results of A. Baker and N. I. Feldman on linear forms in logarithms of 
algebraic numbers ({10], Theorem 3.1) imply that there is an effectively computable absolute 
constant cy > 0 such that for all k,/ > 1, 


|Alog2 — /log3| > k™%. 


Consequently there is an effectively computable absolute constant c, such that for k,/ > c, one 
has 


1 
k_ ah A5kp-4 
|2 3'| > 72 ko, 
and (2.12) then yields 


Ex => ko %, 


Since v is admissible, vy + --- +v,_, < 9k, where 6 = (log, 3) * by (2.11). Therefore 


K-V gag yt F 0K [KA] 3 3 \ 9k 
poy So (FE) cua (3 
i=0 


Qk-i ~0 qitl 
i= 


< k20-9k, 
But all elements of S(v) except n,(v) exceed 2* and 
qk xs Kaitlgd-Ok y e 'p,(v) 
for all sufficiently large k, so the theorem follows by (2.14). @ 


2.4. How many elements don’t have a finite stopping time? The results proved so far can be 
used to obtain an upper bound for the number of elements not having a finite stopping time. Let 


a*(x) =|{n: n<xand a(n) < oo} |. 


The following result is the sharpest known result concerning the size of the “exceptional” set of n 
with o(n) = oo. 
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THEOREM F. There is a positive constant c, such that 
(2.21) |7*(x) — x| < ¢,x'™, 
where n ~ .05004...is the constant defined in Theorem D. 
Proof. Suppose 2*~! < x < 2*. Then 
|{ ain < x and o(n) = 00}| = |7*(x) — x] < S, + S), 


where S, = #{n <2": w(n)>k+1} and S,= #{n <2*: w(n)<k and w(n)# a(n)}. 
Now Theorem D shows that 


(2.22) S, <o,(2*)" <2e,x1-7, 
and Theorem E shows that 
S, < #{v: v admissible and length (v) < k} + ¢,, 


where c; = #{n: w(n) < ky and w(n) # o(n)} is a constant by Theorem C. Now 
# {v: v admissible and length (v) =i} < #{v: u + --: +0,_, =[i8]} = Lia) 


< C4200! 
using the binomial theorem and Stirling’s formula. Hence 
Sy < c52°~* +c, < (2c, + €3) x17". 


Then this inequality and (2.22) imply (2.21) with c, = 2c, +c; + 2c;. & 


2.5. Behavior of the total stopping time function. Much less is known about the total stopping 
time function than about the stopping time function. One phenomenon immediately observable 
from a table of the total stopping times of small integers is the occurrence of many pairs and 
triples of integers having the same finite total stopping time. From Figure 1 we see that 
0,, (20) = 9,,(21) = 6, 9, (12) = 6, (13) = 7, 9, (84) = G,, (85) = 8, Go, (52) = 9, (53) = 9, and 
0,, (340) = o,, (341) = 10. Indeed for larger values of n, multiple consecutive values occur with the 
same total stopping time. For example there are 17 consecutive values of n with o,,(n) = 40 for 
7083 < n < 7099. A related phenomenon is that over short ranges of n the function o,,(n) tends 
to assume only a few values (C. W. Dodge [67]). As an example the values of o,,(n) for 
1000 < n < 1099 are given in Table 3. Only 19 values for o,,(n) are observed, for which a 
frequency count is given in Table 4. Both of these phenomena have a simple explanation; they are 
caused by coalescence of trajectories of different n’s after a few steps. For example the 
trajectories of 8k + 4 and 8k + 5 coalesce after 3 steps, for all k > 0. More generally, the large 
number of coalescences of numbers n, and n, close together in size can be traced to the trivial 
cycle (1,2), as follows. Suppose n, and n, have o,,(n,) = o,,(n,) (mod 2), and let o,,(n,) =" > 
0,,(n,) =r. Then the trajectories of n, and n, coalesce after at most r, — 1 iterations, since 
T'"~Y)(n,) = T™—)(n,) = 2, since the trajectory of n, continues to cycle around the trivial 
cycle. If in addition A, (7) = A,,-1(”2), which nearly always happens if n, and n, are about 
the same size, then the trajectories of 2"~'k + n,, and 2"~'k + n, coalesce after at most r, — 1 
iterations, for k > 0. In particular, o,,(2" 'k + n,) = 6,,(2" 'k + n,) then holds for k > 1. In 
this case the original coalescence of n, and n, has produced an infinite arithmetic progression 
(mod 2"~') of coalescences. The gradual accumulation of all these arithmetic progressions of 
coalescences of numbers close together in size leads to the phenomena observed in Tables 3 and 4. 

Although the 3x + 1 Conjecture asserts that all integers n have a finite total stopping time, the 
strongest result proved so far concerning the density of the set of integers with a finite total 
stopping time is much weaker. 
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TABLE 3. Values of the total stopping time o,,(”) for 1000 < n < 1099. 
1000 1010 1020 1030 1040 1050 1060 1070 1080 1090 
—1009 —1019 —1029 —1039 —1049 —1059 —1069 —1079 —1089 —1099 
72 42 34 80 23 23 80 18 31 31 


91 42 34 26 80 61 80 88 31 


OoOn NHN BR WN YK © 


TABLE 4. Values of o,, (1) and their frequencies for 1000 < n < 1099. 
0,,(n) freq. 0,,(”) freq. 0,,(1) freq. 0,,(1) freq. 
10 1 29 50 3 83 


1 5 
15 1 


THEOREM G (Crandall). Let 
Troral( X) =|{ nin < xand o,,(n) < oo}|. 
Then there is a positive constant c, such that 
Total Xx) > x% 
for all sufficiently large x. 


Assuming that the 3x + 1 Conjecture is true, one can consider the problem of determining the 
expected size of the total stopping time function o,,(n). Crandall [28] and Shanks [60] were guided 
by probabilistic heuristic arguments (like the one described earlier) to conjecture that the average 
order of o,,(”) should be a constant times Inn; more precisely, that 


12 4\"! 
xb Ge(m) ~ 2(In5 In x. 


A modest amount of empirical evidence supports these conjectures, see [28]. 


2.6. Are there non-trivial cycles? A first observation is that there are other cycles if negative 
integers are allowed in the domain of the function. There is a cycle of period 1 starting from 
n = —1, and there are cycles of length 3 and 11 starting from n = —5 and n = —17, respec- 
tively. Bohm and Sontacchi [13] conjecture that these cycles together with the cycles starting with 
n = 0 and n = 1 make up the entire set of cycles occurring under iteration of T(n) applied to the 
integers Z. Several authors have proposed the following conjecture ({13], [28], [41], [64)). 


FINITE CYCLES CONJECTURE. There are only a finite number of distinct cycles for the function 
T(n) iterated on the domain Z. 


One can easily show that for any given length & there are only a finite number of integers n 
that are periodic under iteration by T with period k, in fact at most 2" such integers, as observed 
by Bohm and Sontacchi [13]. To see this, substitute the equation (2.4) into 


(2.23) T’(n)=n, nEZ 
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Xgtrrs +xy-1 Xotces +xXp-] k-1 I 
(2.24) 1) - 2 


9k 
There are only 2* choices for the 0-1 vector v = (x9,...,X,_1), and for each choice of v the 
equation (2.24) determines a unique rational solution n = n(v). Consequently there are at most 2* 
solutions to (2.23). Bohm and Sontacchi also noted that this gives an (inefficient) finite procedure 
for deciding if there are any cycles of a given length k, as follows: Determine the rational number 
n(v) for each of the 2* vectors v, and for each n(v) which is an integer test if (2.23) holds. 

The argument of B6hm and Sontacchi is a very general one that makes use only of the fact that 
the necessary condition (2.24) for a cycle has a unique solution when the values x, are fixed. In 
fact, considerably more can be proved about the nonexistence of nontrivial cyclic trajectories 
using special features of the necessary condition (2.24). For example, several authors have 
independently found a much more efficient computational procedure for proving the nonexistence 
of nontrivial cyclic trajectories of period < k; it essentially makes use of the inequality 


(1 -X,(v)) 2 < p,(¥), 
which must hold for v = (xo, x),..., X,) satisfying (2.24). This approach also allows one to check 
the truth of the Coefficient Stopping Time Conjecture for all n with w(n) < k. The basic result is 
as follows. 


ak x; ZX%ot +x, ° 


i= 


THEOREM H (Terras). For each k there is a finite bound M(k) given by 
(2.25) M(k) = max{ ¢; 'p,(v): v admissible, length (v) =i< k} 


such that w(n) < k implies that w(n) = a(n) whenever n > M(k). Consequently: 
(i) If a(n) < 00 forall n < M(k), then there are no non-trivial cycles of length < k. 
(ii) If w(n) = a(n) for alln < M(k), then w(n) < k implies w(n) = a(n). 


Proof. The existence of the bound M(k) follows immediately from (2.14), and (1i) follows 
immediately from this fact. 

To prove (i), Suppose a nontrivial cycle of length < k exists. We observed earlier that if no 1s 
the smallest element in a purely periodic nontrivial cycle of length < k, then w(n,)) =i < k and 
a(n.) = oo. The first part of the theorem then implies that ny < M(k). This contradicts the 
hypothesis of (i). & 


Theorem H can be used to show the nonexistence of nontrivial cycles of small period by 
obtaining upper bounds for the M(k) and checking that condition (i) holds. This approach has 
been taken by Crandall [28], Garner [34], Schuppar [58] and Terras [64]. In estimating M(k), one 
can show that the quantities p,(v) are never very large, so that the size of M(k) is essentially 
determined by how large 


can get. The worst cases occur when 3!8! is a very close approximation to 2’, ie., when 
i/{i@] is a very good rational approximation to @ = log, 3. The best rational approximations to 
@ are given by the convergents p,/q, of the continued fraction expansion of $= 
[1; 1,1, 2,2, 3,1, 5, 2,23, 2,2,1,1,55,1,4,3...]. Crandall [28] uses general properties of continued 
fraction convergents to obtain the following quantitative result. 


THEOREM I (Crandall). Let ny be the minimal element of a purely periodic trajectory of period k. 
Then 


3, 2No 
2.26 k > min| q,,-—~— — ], 
(2.26) 2 [a qj + o— 
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where p;/q, is any convergent of the continued fraction expansion of log, 3 with j > 4. 


As an application, use Yoneda’s bound [2] that n, > 2*° and choose j = 13 in (2.26), noting 
that g,,; = 190737 and q,, = 10590737, to conclude that there are no nontrivial cycles with period 
length less than 275,000. 

Further information about the nonexistence of nontrivial cyclic trajectories can be obtained by 
treating the necessary condition (2.24) as an exponential Diophantine equation. Davidson [29] calls 
a purely periodic trajectory of period k a circuit if there is a value i for which 


Ng < T(N9) < +++ < T (no) 
and 
TO (no) > TY? (ng) > +++ > TO(n9) = 10, 
i.e., the parity vector v,(%9) = (Xo(M%o),---»X,~_1(Mq)) has the special form 


1, when 0 <j <[k@] —1, 


(2.27) x,( No) ~ ‘ when[kO] <j<k-1, 


where 0 = (log, 3)‘. The cycle starting with ny = 1 is a circuit. Davidson observes that each 
solution to the exponential Diophantine equation 


(2.28) (2¢+> — 36)h=24-1a>l 


gives rise to a circuit of length k = a + b with [k0] = b and n, = 2°h — 1, and conversely. (The 
equation (2.28) is the necessary condition (2.24) specialized to the vector (2.27).) R. Steiner [61] 
showed that (a, b,h) = (1,1,1) is the only solution of (2.28), thus proving the following result. 


THEOREM J (Steiner). The only cycle that is a circuit is the trivial cycle. 


Proof (sketch). Steiner’s method is to show first that any solution of (2.28) with a > 4 has the 
property that (a + b)/b is a convergent in the continued fraction expansion of log, 3, since (2.28) 
implies that 


ave 3 <—__ 1 
b B21 S bin 2(2” — 1) 


He checks that this rational approximation (a + b)/b is so good that it violates the effective 
estimates of A. Baker [10, p. 45] for linear forms in logarithms of algebraic numbers if b > 101°”. 
Finally he checks that (2.29) fails to hold for all that b < 10!” by computing the convergents of 
the continued fractions of log, 3 up to 10'””. m 

The most remarkable thing about Theorem J is the weakness of its conclusion compared to the 
strength of the methods used in its proof. The proof of Theorem J does have the merit that it 
shows that the coefficient Stopping Time Conjecture holds for the infinite set of admissible vectors 
v of the form (2.27). 


(2.29) 0< 


2.7. Do divergent trajectories exist? Several authors have observed that heuristic probabilistic 
arguments suggest that no divergent trajectories occur. 


DIVERGENT TRAJECTORIES CONJECTURE. The function T: Z — Z has no divergent trajectories, 
i.e., there exists no integer ny for which 


(2.30) lim |T“(n,)| = 00. 
k— oo 
If a divergent trajectory {T‘)(n,): 0 < k < oo} exists, it cannot be equidistributed (mod 2). 
Indeed if one defines 
N*(k) =|{ jf: j < k and T/)(ny) = 1 (mod 2)} |, 
then it can be proved that the condition (2.30) implies that 
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N* _ 
(2.31) lim inf (4) > (log, 3)~* = .63097. 


Theorem F constrains the possible behavior of divergent trajectories. Indeed, associated to any 
divergent trajectory D = {T“)(n)): k > 1} is the infinite set Up = {n: n © D and T“(n) > n 
for all k > 1}. Since o(n) = o0 for all n © U,, Theorem F implies that 


(2.32) |{ne Upin<x}|[ <x", 


where 7» = .05004. Roughly speaking, (2.32) asserts that the elements of a divergent trajectory 
cannot go to infinity “too slowly.” 


2.8. Connections of the 3x + 1 problem to ergodic theory. The study of the general behavior of 
the iterates of measure preserving functions on a measure space is called ergodic theory. The 
3x + 1 problem has some interesting connections to ergodic theory, because the function T 
extends to a measure-preserving function on the 2-adic integers Z, defined with respect to the 
2-adic measure. To explain this, I need some basic facts about the 2-adic integers Z,, cf. [14], [49]. 
The 2-adic integers Z, consist of all series 


a=a,+a,2+4,2°4+---, all a; = Oorl, 


where the {a;: 0 < i < co} are called the 2-adic digits of a. One can define congruences (mod 2*) 
on Z, by a=8 (mod2*) if the first k 2-adic digits of a and B agree. Addition and 
multiplication on Z, are given by 


X=a+ B @ X (mod 2*) = a(mod2*) + B(mod 2") for all k, 
X = aB @ X (mod2*) = a(mod 2*) - B(mod 2*) for all k. 
The 2-adic valuation ||, on Z, is given by |0|, = 0 and for a # 0 by |a|, = 2~*, where a, is the 
first nonzero 2-adic digit of a. The valuation | |, induces a metric d on Z, defined by 
d(a,B) =|a— Bl. 


As a topological space Z, is compact and complete with respect to the metric d; a basis of open 
sets for this topology is given by the 2-adic discs of radius 2~* about a: 


B,(a) ={B €Z,: a= B (mod2*)}. 
Finally one may consistently define the 2-adic measure w, on Z, so that 
po ( B,C a)) = 2-5; 
in particular »,(Z,) = 1. The integers Z are a subset of Z,; for example 
—1=141-241-2?4.--. 
Now one can extend the definition of the function T: Z — Z given by (2.1) to T: Z, > Z, by 


- if a = 0(mod2), 


if a = 1 (mod2). 


Ergodic theory is concerned with the extent to which iterates of a function mix subsets of a 
measure space. I will use the following basic concepts of ergodic theory specialized to the measure 
space Z, with the measure p,. A measure-preserving function H: Z, > Z, is ergodic if the only 
l.5-measurable sets E for which H~'(E) = E are Z, and the empty set, ie., such a function does 
such a good job of mixing points in the space that it has no nontrivial p,-invariant sets. It can be 
shown [39, p. 36] that an equivalent condition for ergodicity is that 
N 


lim i 


wal (B,C a)) () B,(B)) = > (B,(a)) w( B,(B)) — Q- (kt) 
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for all a, 8 © Z, and all integers k,/ > 0. This condition in turn is equivalent to the assertion 
that for almost all a € Z, the sequence of iterates 


{ H'(a): i=0,1,2,...} 
is uniformly distributed (mod 2*) for all k > 1. A function H: Z, > Z, is strongly mixing if 
lim p,(H-™(B,(a)) 0 B(B)) = 2-%*? 
N-— 0 


for all a, 8B © Z, and all k,/ > 0. Strongly mixing functions are ergodic. 
The following result is a special case of a result of K. P. Matthews and A. M. Watts [50]. 


THEOREM K. The map T is a measure-preserving transformation of Z. which is strongly mixing. 

Consequently it is ergodic, and hence for almost all a € Z, the sequence 
{T(a): i=0,1,2,...} 
is uniformly distributed (mod 2*) for all k > 1. 

Theorem K implies nothing about the behavior of T on the set of integers Z because it is a 
measure 0 subset of Z,. In fact, the trajectory {T“(n): i = 0,1,2,...} of any integer n can never 
have the property of the conclusion of Theorem K, for if the trajectory is eventually periodic with 
period k, it cannot be uniformly distributed (mod 2**'), while if it is a divergent trajectory, it 
cannot even be equidistributed (mod 2) by (2.31). Consequently, this connection of the 3x + 1 
problem to ergodic theory does not seem to yield any deep insight into the 3x + 1 problem itself. 

There is, however, another connection of the 3x + 1 problem to ergodic theory on Z, that may 
conceivably yield more information on the 3x + 1 problem. For each a € Z, define the 0-1 
variables x; by 

T (a) = x, (mod 2). 
Now define the function Q,,: Z, —- Z, by Q,,(a) = B, where 
(2.33) B=x,+x,24+x,2°+--- 


The value Q,,(a) thus encodes the behavior of all the iterates of a under T. The following result 
has been observed by several people, including R. Terras and C. Pomerance, but has not been 
explicitly stated before. 


THEOREM L. The map Q,,: Z, — Z, is a continuous, one-one, onto, and measure-preserving 
map on the 2-adic integers Z,. 


Proof. This is essentially a consequence of Theorem B. Use the fact that QO, (a) = Q,,(a) 
(mod 2”). For any a,,a, in Z,, if ja, — a,|, < 27”, then a, = a, (mod 2"), so 
0, (a) = Q,,(a%) = Q,,( a2) = Q.,( a2) (mod 2”), 


so that |Q,,(a,) — Q,,(a)| < 27" and Q,, is continuous. If a, # a,, then a, # a, (mod 2”) for 
some n, so that 


0, (a) = Q,,( a) # Q,,( a) = Q,,( a2) (mod 2”) 
and Q., is one-to-one. To see that Q,, is onto, given a one can find B, so that 
Q,,(8,) = « (mod 2"), 


since Q, is a permutation. Then |Q..(8,) — al, < 2~”. Now {B,} forms a Cauchy sequence in 
the 2-adic metric and Z, is compact, hence the limiting value B of {B,} satisfies Q,.(B) = a. 
Now Q;,' is defined, and Q,,(a) = Q;,'(a) (mod 2”) implies that Q>.' is continuous. 


The 3x + 1 Conjecture can be reformulated in terms of the function Q,, as follows. 


3x + 1 CONJECTURE (Third form). Let N* denote the positive integers. Then Q,,(N*) © 3Z. In 
fact 0,(N*) C4Z—-Z. 
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For example Q,, (1) = £%., 27” = —1/3, Q,,(2) = —2/3, and Q,,(3) = —20/3. 

The behavior of the function Q. under iteration is itself of interest. Let Q, denote the set of 
all rational numbers having odd denominators, so that Q, C Z,. The set Q, consists of exactly 
those 2-adic integers whose 2-adic expansion is finite or eventually periodic. The Finite Cycles 
Conjecture is equivalent to the assertion that there is a finite odd integer M such that 


Q,,(Z) S=22 


In fact one can take M = II(2' — 1), where the product runs over all integers / for which there is 
a cycle of minimal length 7. As a hypothesis for further work I advance the following conjecture. 


PERIODICITY CONJECTURE. Q,,(Q;) = Qs. 


For example, one may calculate that Q,,(10) = —26/3, Q,,(—26/3) = —54, 0,(-54) = 
— 82/7, Q0.,(— 82/7) = 2/15. It can be shown that if m has a divergent trajectory, then the 
sequence (xX9(7”), X;(”), X,(”),...) cannot be eventually periodic. As a consequence the truth of 
the Periodicity Conjecture implies the truth of the Divergent Trajectories Conjecture. 


Theorem B has a curious consequence concerning the fixed points of iterates of Q..,. 


THEOREM M. Suppose the kth iterate QO of Q,, has a fixed point a € Z, which is not a fixed 
point of any O for 1 <1< k. Then k is a power of 2. 


Proof. By hypothesis Q%(a) =a and Q&(a)=a,# a, for 1</1<k. All the a,’s are 
distinct for 0 </<k, since OM) (a) = Q&)(a) implies Ots-')(@) =a, since Q,, is one-one 
and onto. Consequently one can pick m large enough so that all the residue classes a, (mod 2”) 
are distinct, for 0 < /<k, where ay = a. Now the action of Q,, (mod 2”) is exactly that of the 
permutation Q,,, hence 


O(a (mod 2”)) = a, (mod 2”) 


for 0 </<k. In particular (ay (mod2”), a, (mod2”),...,a,_; (mod2”)) makes up a single 
cycle of the permutation Q,,, hence k is a power of 2 by Theorem B. @ 


3. Generalizations of the 3x + 1 problem. The 3x + 1 problem can be generalized by consider- 
ing other functions U: N — N defined on the natural numbers N that are similar to the function 
T. The functions I consider to be similar to the function T are the periodically linear functions, 
which are those functions U for which there is a finite modulus d such that the function U when 
restricted to any congruence class k (mod d@) is linear. Some reasons to study generalizations of 
the 3x + 1 problem are that they may uncover new phenomena, they can indicate the limits of 
validity of known results, and they can lead to simpler, more revealing proofs. Here I discuss three 
directions of generalizations of the 3x + 1 problem. These deal with algorithmic decidability 
questions, with the existence of stopping times for almost all integers, and with the fractional parts 
of (3/2)*. 


3.1. Algorithmic decidability questions. J. H. Conway [26] proved the remarkable result that a 
simple generalization of the 3x + 1 problem is algorithmically undecidable. He considers the class 
F of periodically piecewise linear functions g: N — N having the structure 


(3.1) g(n "= TD d) 


specified by the nonnegative integers (d, a),..., @,_,). These are exactly the functions g: N > N 
such that g(n)/n is periodic. 


nifn=k(modd), for0<k<d-1l, 


THEOREM O (Conway). For every partial recursive function f defined on a subset D of the natural 
numbers N there exists a function g: N — N such that 
(1) g(n)/n is periodic (mod d) for some d and takes rational values. 
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(2) There is some iterate k > 1 such that g*)(n) = 2/ for some j if and only if n is in D. 
(3) g(n) = 2/™ for the minimal k > 1 such that g‘"(n) is a power of 2. 


Conway’s proof actually gives in principle a procedure for explicitly constructing such a function 
g given a description of a Turing machine* that computes f. He carried out this procedure to find 
a function g associated to a particular partial recursive function f having the property that 
f(2?") = p,+1, where p, is the nth prime; this is described in Guy [38]. 

By choosing a particular partial recursive function whose domain is not a recursive subset of N, 
e.g., a function f, that encodes the halting problem for Turing machines, we obtain the following 
corollary of Theorem O. 


THEOREM P (Conway). There exists a particular, explicitly constructible function gy: N — N such 
that 2)(n)/n is periodic (mod d) for a finite modulus d and takes rational values, for which there is 
no Turing machine that, when given n, always decides in a finite number of steps whether or not some 
iterate g\“)(n) with k > 1 is a power of 2. 


3.2. Existence of stopping times for almost all integers. Several authors have investigated the 
range of validity of the result that T(n) has a finite stopping time for almost all integers n by 
considering more general classes of periodically linear functions. One such class G consists of all 
functions U = U(m,d, R) which are given by 


7 ifn =0(modd), 


(3.2) U(n) = mn—r if n #0 (modd), and r€ R is such that 


d ° mn =r (mod 4d), 


where m and d are positive integers with (m,d) = land R = {r,: 7, =i (modd),1<i<d-—1} 
is a fixed set of residue class representatives of the nonzero residue classes (mod d). The 3x + 1 
function T is in the class G. H. Mdller [52] completely characterized the functions U = U(m, d, R) 
in the set G which have a finite stopping time for almost all integers n. He showed they are 
exactly those functions for which 


(3.3) m < d4/4-0, 


E. Heppner [41] proved the following quantitative version of this result, thereby generalizing 
Theorem D. 


THEOREM Q (Heppner). Let U = U(m,d, R) be a function in the class G. 

(i) If m < d¢/“4-), then there exist real numbers 8,8 > 0 such that for N = [log x/log d] we 
have #{n: n < x and U')(n) > nx7*1} = O(x!~®) as x > 00. 

(ii) If m > d¢/“4-), then there exist real numbers 83,8, > 0 such that for N = [log x/log d] we 
have #{n: n < x and U'™)(n) < nx®} = O(x'7*) as x > 00. 


J.-P. Allouche [1] has further sharpened Theorem Q and Matthews and Watts [53], [54] have 
extended it to a larger class of functions. 

It is a measure of the difficulty of problems in this area that even the following apparently 
weak conjecture is unsolved. 


EXISTENCE CONJECTURE. Let U be any function in the class G. Then: 
(i) U has at least one purely periodic trajectory if m < d4/\4~»; 
(ii) U has at least one divergent trajectory if m > d¢4/“4—), 


3.3. Fractional parts of (3 /2)*. Attempts to understand the distribution (mod1) of the 
sequence {(3/2)*: 1 < k < oo} have uncovered oblique connections with ergodic-theoretic aspects 
of a generalization of the 3x + 1 problem. It is conjectured that the sequence (3/2)* is uniformly 


* Conway’s proof used Minsky machines, which have the same computational power as Turing machines. 
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distributed (mod 1). (This conjecture seems intractable at present.) One approach to this problem 
is to determine what kinds of (mod 1) distributions can occur for sequences {(3/2)*é: 1 < k < oo}, 
where € is a fixed real number. In this vein K. Mahler [48] considered the problem of whether or 
not there exist real numbers £, which he called Z-numbers, having the property that 


3\* 1 
(3.4) 0<{(5] ) <p kaL2 3. 


where {x} = x —[x] is the fractional part of x. He showed that the set of Z-numbers is 
countable, by showing that there is at most one Z-number in each interval [n,n + 1), for 
n= 1,2,3,... . He went on to show that a necessary condition for the existence of a Z-number in 
the interval [n,n + 1) is that the trajectory (n,W(n),W®(n),...) of n produced by the 
periodically linear function 


a if n = 0 (mod 2), 
(3.5) W(n) = 

a if n = 1(mod2), 
satisfy 
(3.6) W“)(n) #3(mod4),  1<k< oo. 


Mahler concluded from this that it is unlikely that any Z-numbers exist. This is supported by the 
following heuristic argument. The function W may be interpreted as acting on the 2-adic integers 
by (3.5), and it has properties exactly analogous to the properties of T given by Theorem K. In 
particular, for almost all 2-adic integers a the sequence of iterates (a, W(a),W(a),...) has 
infinitely many values k with W‘*)(a) = 3 (mod 4). Thus if a given n € Z behaves like almost all 
2-adic integers a, then (3.6) will not hold for n. Note that it is possible that all the trajectories 
(n,W(n),W(n),...) for n>1 are uniformly distributed (mod 2*) for all k, unlike the 
behavior of the function T(n). 

In passing, I note that the possible distributions (mod 1) of {(3/2)*é; 1 < k < oo} for real 
have an intricate structure (see G. Choquet [16-22] and A. D. Pollington [55,56]). In particular, 
Pollington [56] proves that there are uncountably many real numbers é such that 


1 3\* 24 
35 <{(5) é} S 55° k = 1,2,3,..., 


in contrast to the at most countable number of solutions € of (3.4). 


4. Conclusion. Is the 3x + 1 problem intractably hard? The difficulty of settling the 3x + 1 
problem seems connected to the fact that it is a deterministic process that simulates “random” 
behavior. We face this dilemma: On the one hand, to the extent that the problem has structure, we 
can analyze it—yet it is precisely this structure that seems to prevent us from proving that it 
behaves “randomly”. On the other hand, to the extent that the problem is structureless and 
“random”, we have nothing to analyze and consequently cannot rigorously prove anything. Of 
course there remains the possibility that someone will find some hidden regularity in the 3x + 1 
problem that allows some of the conjectures about it to be settled. The existing general methods in 
number theory and ergodic theory do not seem to touch the 3x + 1 problem; in this sense it seems 
intractable at present. Indeed all the conjectures made in this paper seem currently to be out of 
reach if they are true; I think there is more chance of disproving those that are false. 

If the 3x + 1 problem is intractable, why should one bother to study it? One answer is provided 
by the following aphorism: “No problem is so intractable that something interesting cannot be 
said about it.” Study of the 3x + 1 problem has uncovered a number of interesting phenomena; I 
believe further study of it may be rewarded by the discovery of other new phenomena. It also 
serves as a benchmark to measure the progress of general mathematical theories. For example, 
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future developments in solving exponential Diophantine equations may lead to the resolution of 
the Finite Cycles Conjecture. 

If all the conjectures made in this paper are intractable, where would one begin to do research on 
this deceptively simple problem? As a guide to doing research, I ask questions. Here are a few that 
occur to me: For the 3x + 1 problem, what restrictions are there on the growth in size of members 
of a divergent trajectory assuming that one exists? What interesting properties does the function 
Q,, have? Is there some direct characterization of Q,, other than the recursive definition (2.33)? 
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1. Introduction. We present a theory to deal with geometric problems of a varied nature. The 
apparently disparate concepts of Fourier analysis and Hermitian forms are fundamental to the 
theory. Indeed, our exposition might serve as an introduction to these important notions since it 
relies only on the elementary properties of complex numbers, vectors, and polynomials. 

Our approach evolved from the decomposition theory of n-gons initiated by Bachmann and 
Schmidt [1], which in turn has its roots in Fourier analysis: an n-gon is written as a linear 
combination of “regular” n-gons, with the scalars being the associated finite Fourier coefficients. 
Typical examples to illustrate the theory are provided in Section 2. Many of these have been 
discussed already in an earlier paper [12]. The deeper metric properties of polygons then are 
approached through Hermitian forms on n-tuples of complex numbers. In Section 4 Hermitian 
forms are applied to isoperimetric inequalities for polygonal regions. This leads us in the final 
section to consider regions bounded by continuous curves. For this we extend our results 
concerning finite dimensional vector spaces to the corresponding results in Hilbert space. The 
finite Fourier series extend quite naturally to their more familiar infinite cousins. 


All three authors came to Regina during the late 60’s when the former college was expanding into a major 
university. Chris Fisher received his Ph.D. from the University of Toronto (1972) under the supervision of H. S. M. 
Coxeter. He has also been visiting Professor at the University of Bologna, the Technical University of Munich, and 
the Free University of Brussels. On the basis of his contacts, he has enriched substantially the guest speaker program 
of his comparatively remote university. Dieter Ruoff studied at the University of Zurich. He wrote his Ph.D. thesis 
under the guidance of B. L. van der Waerden on a topic in the foundations of geometry, and he has worked in this 
field since then. Before emigrating to Canada he spent two postdoctoral years with F. Bachmann in Kiel. He now 
specializes in incidence geometry. Like Fisher, he often visits Europe pursuing professional and personal contacts 
and his interests in the arts. J. Shilleto, a Berkeley Ph.D., has chosen geometry as his primary interest after beginning 
his career in mathematical logic and doing some topology. He believes strongly in a developmental approach to 
teaching mathematics and is currently engaged in writing a geometry text which reflects this view. 
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2. Affine n-gon Theory 


The Vector Space of n-gons. An n-gon & in the Euclidean plane is an ordered n-tuple of 


points, = (ap, a,,...,a,_,). These points are called vertices, and the line segments joining a, 
to a,, a, tO ay,...,a,_, tO dp are called sides. Note that we do not require an n-gon to be 
convex. We also allow any number of the vertices to coincide; in fact the n-fold repeated origin 
O = (0,0,...,0) turns out to be an important n-gon. 
By virtue of our identification of the vertex 
a; = ( Xj y;) 


with the complex number 
,= x ti (4.4 ER, i=V—7), 


an n-gon in the plane is at the same time a vector in C” (that is, an n-tuple of complex numbers). 
Thus a linear combination of n-gons @= (a;),;<, and B= (bh) is identified with the linear 
combination of the corresponding vectors in C”: 


aL+ b¥= (aa, + bb, ) 


J<n 


a,bEeC. 


J<n? 


In other words, aw+ b@ is the n-gon whose vertices are obtained as linear combinations of the 
vertices of x and &. So (re'’).@ is the image of W under a dilatation by the real factor r and a 
rotation through the angle @, both about the origin. Also, for example, (1/2) #%+ (1/2)@ is the 
n-gon whose /th vertex is the midpoint of the segment joining the jth vertices of w and &. 


Multiplication of n-gons by Polynomials. Note that the identification of n-gons with vectors 
compels us to define two n-gons to be equal just in case they are equal as vectors in C”. For 
example, (@),4,,...,@,_,) and (a@,,a5,...,da,) are distinct n-gons (unless ay =a, = -:: = 
a, -1) even though they have the same set of vertices and edges. 


Notation. Let x* be the n-gon obtained from & by a cyclic shift of the entries of 
(dp, @,,...,a, 1) k places to the left. That is, 


XW = (4,,47,...,4,-1,Q9), 

x° B= (d,,43,...,49,4)), 

etc. 

In matrix form we have that 
0 O 0 1 
1 O 0 O 
x = (y,4),...,4,-1)X; where x = ol 0 9 

0 O wae 1 O 


Extending the multiplication above to arbitrary polynomials p(x) =c) + ¢,x + --: +c,x* 
(c, € C) we define p(x). to be the linear combination cp 7+ cxW@t --> +c, x". 

It is also possible to describe p(x) as (ao,4,,...,4,-,)I1 where II = cyx® + ax 
+ +++ +¢,x* is an n X n circulant matrix (i.e., a square matrix (c,,) for which ¢,; = ¢¢ 41413 
see [8] or [9]). However, the polynomial notation is more familiar and readily brings out its 
inherent geometrical features. We shall see this demonstrated in the examples to follow. 


LEMMAS AND THEOREMS. 


(2.1) An n-gon W=(a;),<, whose vertices repeat after m steps satisfies (x” —1)¥%#=O 
(where, as always, O = (0,0,...,0)). 
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Indeed, (x” — 1)#=O means precisely a,,,, — a, = 0 for all j = 0,1,...,2 —1. As an im- 
portant special case, (x” — 1) ¥%= O for every n-gon &. It follows that x"~“ represents a shift 
of vertices k places to the right. We usually write this as x~ “27. In the same spirit, 

1 ( x" —-1 


1 n—-1 n—2 — 
(xen XM + +1) = | 


Jer 
is the n-gon whose vertices all coincide with the centroid of #. 
We define the n-gon W= (ay, a,,...,a,_1,) to be k-regular if there exists a point such that the 


rotation through the angle 2k7/n about it moves ay to a,, a, tO ay,...,a,_1 tO Ao (see Fig. 
2A). 


a, ai41 


Fic. 2A. A k-regular n-gon. 


Throughout this paper we denote the nth roots of unity as powers of 
w= em / ne 


Then, in our shift notation, we have the following result. 
(2.2) WL is a k-regular n-gon whose centroid is 0 if and only if there is an integer k for which 
(x-—w*) =O. 


Although one usually calls such n-gons “regular” [6, p. 36], we employ the prefix k to 
emphasize that each side subtends an angle of 2ka7/n at the center. When k = 1, # is convex 
and labelled counterclockwise; when k = —1, # is convex and clockwise. For k relatively prime 
to n, M isa regular star n-gon; generally speaking it is some regular (n/d)-gon (d = the greatest 
common divisor of n and k) described d times. We shall denote by #, the special k-regular 
n-gon centered at 0 with a, = 1, namely, 


R. =(1,wk, wk... wonky, 


The image of a k-regular n-gon under an affinity is called k-affinely regular. Recall that an 
affinity is the product of a nonsingular linear transformation and a translation [6, §13.3]. An 
affinity therefore preserves ratios of lengths of parallel line segments. Affinely regular polygons 
are discussed in [5], [6, §13.4], [12], and [23]. It is a simple geometric fact (see Fig. 2B) that a 
k-regular n-gon centered at 0 fulfills 


1 
(2.3) 5 (4; + 4,7) = cos(2ka/n)a;,, 
This condition leads to the following characterization. 


(2.4) An n-gon & is k-affinely regular with centroid 0 if and only if for some integer k 
(x? —¢,x + 1) v= O, 
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where c, = 2cos(2ka/n). 


Note that (2.4) allows the vertices of ~ to be collinear; this is somewhat nonstandard but is 
convenient for our algebraic approach. We are obliged, however, to prove that when the vertices of 
& are not collinear, (2.4) characterizes the n-gons with centroid at 0 that are images of k-regular 
n-gons under affinities. 


Fic. 2B. Justifying (2.4): 3(w + wkU*?)) = cos(2ka/n)wkUtY, 


Proof. Since (2.3) is preserved by affinities that fix the origin, it (and its equivalent formulation 
(x* — ¢,x + 1)¥M= O) is a necessary condition. Conversely, one defines the affinity a so that it 
maps the centroid and first two vertices of #, to the corresponding points of .. Then one uses 
(2.3) inductively to prove that the remaining vertices of #, and © correspond under a. O 


Here are two familiar special cases. All triangles are affinely regular since a triangle with 


centroid 0 satisfies (x* + x + 1)#”= O-c, = —1. Also, the affinely regular quadrangles are 
parallelograms because (x* + 1).= O holds if and only if . is a parallelogram with centroid 
O-c, = 0. 


To see how our notation works, let us prove the well-known fact that the midpoints of the sides 
of a quadrangle form a parallelogram: To simplify matters let x be a quadrangle whose centroid is 
0; that is, $(x° + x7 + x+1)#= O. Its midpoint figure @ satisfies @= 4(x + 1).2”. Thus 
O=4(x3 + x7 +x4+)V=(x* + LIE(x4 DM] = (x? + D&B, which shows that @ is a 
parallelogram (with centroid 0). 


The next theorem was inspired by [10, p. 63, #57], which dealt with the case of a triangle. 


(2.5) THEOREM. Let squares be erected externally on the sides of an affinely regular n-gon 
whose centroid is 0. Then the segment joining neighboring vertices of adjacent squares is perpendicu- 
lar to the corresponding radius vector of & and is 2 — c, times as long. (The quantity c, is the 
parameter 2 cos (2k7/n) of (2.4).) 


Proof. Labelling as in Fig. 2C, we have that 
b,-—a,= —i(aj,,—a4,) and c,—a,= —i(a,;-a,_,). 
SO 
B-A= —i(x-1)L%, 
@-@M=-i(1—x )v  (by(2.1) x w= x"), 
and therefore 
B-€=—-i(x-24+x')w#v= —i((x —c¢, +x 1)-(2- c,)) 8 = i(2 — c,) &. 
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FIG. 2C. Theorem (2.5): 0 is the centroid of .; squares are erected externally on its sides, and neighboring vertices 
of adjacent squares are joined. 


b, \ 
| \ 


Fic. 2D. The Napoleon-Barlotti theorem: b, is the center of a k-regular n-gon erected externally on a side of ¥. 


The last equality uses (x —c, + x~')#@=O which is essentially (2.4). Thus @- ¢= 
i(2 — c,)&, which is precisely what was to be proved. 0 


(2.6) THEOREM (Napoleon-Barlotti). For a given n-gon &, let B be the n-gon whose vertices are 
the centroids of k-regular n-gons erected externally on the sides of X. Then & is k-regular if and only 
if L is k-affinely regular. 

Proof. We may assume & has centroid 0. By virtue of (2.2), (2.4), and the fact that 
(x — w*)\(x — w>*) = x? — 2cos(2ka/n)x +1, we need only to show that (x — w*) x 
(x — w-*)= O if and only if (x — w*)@= O. As shown in Fig. 2D, the construction of the 
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vertices b, of @ may be restated by the equation 


w*(a4,-b)=a,-b (j=0,....n-1,w = e277"), 


More simply, 


w*(xW¥—- B)=A- B. 
SO 
(a) (x-w“)w=(1-—w*)&, 
and hence (x — w*)(x — w7*)#= O if and only if (x — w*)#= 0.0 


This result with n = 3, when © and @ are triangles, is popularly attributed to Napoleon, 
whose role in the story is a source of speculation. An extensive search (by V. G. Cavallaro [4]) 
uncovered a 1917 Italian textbook in which the triangle problem was introduced with the phrase 
“Theorem proposed for demonstration by Napoleon to Lagrange” (Exercise 494, page 186 of A. 
Faifofer, Elementi di Geometria, 20th ed., Sorteni e Vidotti, Venezia, 1917; it can be found also in 
some earlier editions such as the 18th (1912)). In addition to this reference the search turned up 
the result, unaccompanied by any mention of Napoleon, in an 1843 text. It was not until 1955 that 
the theorem for n > 3 (and k = 1) was formulated (by Barlotti in [2]). There have been several 
recent related results as well as an extension of the theorem to more general settings (see [12], [13], 
[23]). 

The n-gon ¢ of centroids of k-regular n-gons erected internally on the sides of the k-affinely 
regular n-gon W# satisfies the equation 


(b) (x — w*) v= (1 - w*)G. 


Equivalently, @ can be considered the n-gon of centroids of (n — k)-regular n-gons erected 
externally on the sides of . We shall call @ and @ the Barlotti n-gons of #. Subtracting (b) 
from (a) we obtain that 


l1-w* 1 — w* 
A = E, 
wk —w* wk —w* 
and so 
1 w* 
2.7 A = Br A w* # —1). 
(2.7) we +1 we +1 ( ) 


Note that when is k-affinely regular, (2.6) implies that (w* + 1)7'@ and w*(w* + 1)7'@ are 
k- and (n — k)-regular respectively; thus we have proved that every k-affinely regular n-gon is the 
sum of a k-regular n-gon and an (n — k)-regular n-gon. This fact is only the tip of the iceberg. 

The fundamental result in [23], which is a clarification and simplification of the main theorem 
of Bachmann and Schmidt [1, §6.2], says that an n-gon can be written as a linear combination of 
“simpler” n-gons. For us the following special case of this theorem will be important. 


(2.8) Bo, Z,,...,B,_; form a basis for the vector space C” of n-gons over the complex numbers. 


This is actually a restatement of a well-known fact about finite Fourier series (see [18] or [25], for 
example). More precisely, for an n-gon Y= (a,;),.,, there exists a unique set of n complex 


numbers a), @,,...,@,—1, the finite Fourier coefficients of X, such that W= Li, . ,a,A,. In terms 
of the vertices of . this means that 
(2.8’) a,= Daw (w= er/n), 

k<n 


It is a standard argument ([18, §1] or [25, §1]) to prove (2.8) by showing that each a, satisfies 


1 _ 
(2.9) a, =~ yaw. 


j<n 


1985] PERPENDICULAR POLYGONS 29 


Note that, in particular, a, is the centroid of #. 


3. Hermitian Forms. We shall prove that signed area is the quadratic form associated with 
some Hermitian form on the vector space C” of all n-gons. This will permit proofs about areas 
using the techniques developed above. 

Let us denote by /(.#, #) the standard inner product L; .,a,b, defined on C”. (J is used to 
suggest inertia since [(.%, 2) is the moment of inertia of with respect to the origin.) Then (for 
L,#B,ECEC", a €C) the following is true: 


(a) I(V%+ BG) =1(H#,%)+1(F,@) 
and 
I(#7,B+ G) =1(87,B)+1(L,@), 
(b) al( %,B) =I(aWv,B) =1(L,aB), 
(c) I(B, xf) = If, 2) . 
When / is replaced in the equations above by an arbitrary form P: C” x C” — C”, properties (a) 
and (b) define sesquilinear (or conjugate bilinear) forms, while (a), (b) and (c) define Hermitian 


forms (see, for example, [14, Chapter XI, §1]). There is another simple property of J worth 
mentioning, namely 
(d) I(xW,xB) = 1(H,B). 

By a standard argument (see [14, p. a for example), for every sesquilinear form there is an 
n X n matrix (hjx) such that P(7,#)=2L, Kf jn, b,. Property (c) forces (h,,) to be a Hermi- 
tian matrix, that is (h,,)* = (h,,), where (h,,)* is defined to be (h, j). Property (d) is equivalent 
to the condition ae (hx) be a circulant matrix; this property can be restated in terms of the 


polynomial p(x) = 1, <,c;x/, where c; = ho, as the following theorem (whose proof is straight- 
forward). 

(3.1) A form P satisfies (a), (b) and (d) if and only if there exists a polynomial p(x) for which 
P(%,B) = I( p(x) LM, @) for all n-gons LH and B. 

Defining (©; < ,¢;*x x/)* Lij<ncjx" 7, we see that for all # and &, I( p(x)#,#) = 
I( XW, p*(x)B); hence a form I( p(x)—, —) determined by a polynomial p(x) is Hermitian if and 
only if p(x) = p*(x). We shall call forms satisfying (a), (b), (c) and (d) polygonal forms. These are 
the ones determined by polynomials p(x) for which p(x) = p*(x) and include those of 
particular interest to the geometer. See [9], [18] and [25] for alternative approaches. 


The quadratic form associated with a Hermitian (even sesquilinear) form P(.~/, @) is defined 
by P(.#):= P(W, M). It determines the Hermitian form since for all ~ and &, 


(3.2) 2P(#,B) =(P( + B) — P(x) — P(B)) + i( P( + iB) — P(x) — P(B)) 


(see [9, p. 308]). We now introduce the form F(—), with F standing for the German Fidche, that 
assigns area to an n-gon; we use (3.2) to pass to the associated polygonal form F(—,—). F(#) 


Aya) = (X41 Ya.) 


a, = (x,y) 


Fic. 3A. Area = 3(x,X,41 — X41) = (1/41) 4,414, — 4,4;41). 


30 J. CHRIS FISHER, D. RUOFF, AND J. SHILLETO [January 


will assume both positive and negative values and thus represent a signed area. This is in contrast 
to I(@), which is positive definite (i.e., [(.) > 0 unless #= O). 

The area F(.<) of the n-gon # is defined to be the sum of the signed areas of the triangles 
Oa,a,;,, (as in Fig. 3A): 


1 _ _ 
Fi= F(#)1= Ai » (4,414; — a;0j41)5 
j<n 


from this it follows by an application of (3.2) that 


1 _ 
F( Wf, B) = Ai » (4,415; — a;bj41). 


j<n 
Finally (abbreviating as earlier x"~! by x~'), we have that 
x—x! 
ra y,4) 

which establishes that signed area F is the quadratic form of a polygonal form determined by the 
polynomial (x — x~')/4i. 

The third and final form we shall consider here is S = S(.#), the sum of the squares of the 
side lengths of the polygon ». We shall use it later in linking area and perimeter. As above, we 
can calculate the corresponding form S(.%, #) by using (3.2). Thus from the definition 


S() = Llajai — a,\’ = Vay) — a, )( G41 — a;), 


(3.3) F(t, B) = [1 (xa, @) — I, xB)) - 7 


we obtain that 
S(.#, B) = i (aj41 — aj)( B+ _ b), 
and hence that 
(3.4) S(7%,B) = I((x — 1) #%,(x -1)f) = 1((1 — x) #,(1 — x) B) 
= I((2 -(x + x7')) #,B). 

Therefore the mapping assigning the sum of the squares of the sides to a polygon is a quadratic form 
arising from a polygonal form determined by the polynomial 2 — (x + x7). 

Note that S is positive definite on the subspace of C” comprising the n-gons whose centroid is 


0 (but it is only positive semi-definite on the whole space C”). 
As an illustration of the use of these definitions we shall prove (cf. [18, (8.4)], [25, Th. 4], [26]): 


(3.5) If P is a k-affinely regular n-gon with centroid 0, then 
S(#) =(2-¢,)I(#), 
where, as always, c, = 2.cos(2ka/n). 
Proof. Let 0 be the centroid of . Because (x + x~').W~= c,.# (from (2.4)) we have that 
S(.#) = 1((2 -(« + x7!) #, #) = 1((2 - 4) H, 8) =(2-¢)I(#).0 


The theorem below is fundamental to all that follows. It permits us to extend automatically the 
validity of many types of statements from regular n-gons to arbitrary n-gons. 


(3.6) BASIS THEOREM. The regular n-gons &2),2,,...,%,—, form an orthogonal basis with 
respect to any polygonal form. 


Proof. We already know (from (2.8)) that 2,...,%,_, form a basis. So we only need to 
prove, for any polynomial p(x) and for distinct integers 7 and k between 0 and n — 1, that 
I( p(x)2,, #,) = 0. Now p(x)B, = p(w/)B, because x#, = w/®, for each j (from (2.2)); 
therefore 
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I( p(x) %,%) = p(w!) (A, A) = Ow!) De worn 


Because the sum of the first n powers of any nontrivial nth root of unity is 0, the last expression 
is 0 and the theorem follows. 0 


From this theorem we can conclude immediately that for all n-gons ~=%,_,a,B, and 
B=%,-,8,A, and for any polygonal form P, 


(3.7) P(#,B) = & a, B,P(%,) and P(.#) = 2 ja, |°P( 2, ). 


A more substantial application of the Basis Theorem is the following addition to the 
Napoleon-Barlotti Theorem (2.6). It is a well-known result (see [7, 3.38] for an alternative 
approach) that the area of an arbitrary triangle equals the sum of the areas of its Barlotti triangles 
(which we defined after (2.6)). More generally, 


(3.8) THEOREM. If © is a k-affinely regular n-gon with centroid at 0 and & and @ are its 
Barlotti n-gons, then 


(3.8, ) (2+ c,) P(#%) = P(#) + P(@), 
where P is any polygonal form. 


For example, P could be the area, the sum of the squared sides, or the polar moment of inertia (F, 
S, or I). 


Proof. First observe that (3.8,) just says 0 = 0 when w* = —1 (and c, = —2); so let us 
assume w* # —1. Also note that |w*| = 1 and 


jw< + 1)? =(w* +1)(w*¥ +1) =24+ 4. 


By (2.6) & and @ are k- and (n — k)-regular respectively; hence, by the Basis Theorem, they are 
orthogonal. Therefore, recalling (2.7), namely 


= + , 
wk +1 wk +1 
we obtain 
1 |w* |? 
P(X) = ———- P( @) + ———_ P( 
(7) we + 1)? (4) Jw*< + 1)? (#) 
1 


= 5g (PCB) + P))-0 


The theorem above has a partial converse which extends [13, Theorem 3]. Its proof will follow 
from the next result. 


(3.9) THEOREM. For any polygonal form P which is positive definite on the subspace of n-gons 
centered at 0, 


4 
equality holds if and only if ¥ is affinely regular and convex. 


2 + 
P(As-w) < “1 P( of); 


Proof. We represent Y as W=)L,.,a,A,. Since W has centroid 0, aj = 0 (see 2.9). 
Recalling (2.2) (that x, = w*B,) and noting that c, = 2cos(2ka/n) < c, forl <k <n, 


p(=#)- x Ja, ?P(~5—&, | 
2 kon 2 
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| 
M 
Q 
aa 
a) 


| 
M 
Q 

= 

Pd 
4 

- 
» 
SS 
~~” 


a * 


& la 


2+¢ 
S 4 ; dy lax \?P(%,) 
k<n 


2+ 


“7 *k p( #,) 


=" py wv), 


Our assumption on P forces P(&,)> 0 for all k # 0, which explains the inequality in the 
manipulations above; as a further consequence, equality holds if and only if a, = 0 for every 
k #1, n —1,10e., if and only if ~ is affinely regular and convex. 0 


The inequality in (3.9) is a comparison of the values of a form applied to an n-gon # and to 
its midpoint figure $(x + 1). If we replace Ww by (x — 1)&, we have an upper bound for 
P(x? — 1)&, namely 


P(x? —1)@) <(2+4)P((x-1)4), 


which is attained only for affinely regular convex n-gons. For the form J this result coincides with 
Theorem 2 in [13], which compares the sum of the squared sides of an n-gon & with the sum of 
the squares of the diagonals from each 5, to 5,,, 

We are now ready for the promised converse to (3.8). 


Let & be an n-gon that has its centroid at 0, @ and @ the outer and inner Barlotti n-gons of ®, 
and P any polygonal form that is positive definite on the subspace of n-gons with centroid 0. If 
(2 + c,)P(H%) = P(B)+ P(@), then & is convex and affinely regular. 


Proof. We substitute the formulas from Theorem (2.6), 


g=2 and ¢=2—— 
l-—w- l1—w 


into the given equation (3.8,) to obtain 


A, 


(2 + ¢,) P(#) = p=" + P( I = ~ od 
== = (P(x wt) + P(x w) #)), 
so that 
(4 — c?) P(.) = 4P(W) — co, P(x, 8) — 4 P( 8, xX). 
Therefore 
c,P( WS) = P(x, AL) + P(L,xL) 
= P((x + 1) #7) —2P(), 
and finally, 


(2+ ¢,) P(H) = P((x+ 1) #). 


The result now follows immediately from (3.9). 
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4. Isoperimetric Inequalities. In this section we will apply Hermitian forms to obtain various 
isoperimetric inequalities. We have designed this section to be as independent of the first part of 
the paper as is possible. It depends only on the Basis Theorem (3.6) and the fact that signed area 
F = F(®#) and sum of squared sides S = S$(.%) of an n-gon # are quadratic forms arising from 
Hermitian forms. We first establish the following statement. 


(4.1) For any n-gon &@, S > 4tan(a/n)|F|, and equality holds if and only if & is regular and 
convex. 


Fic. 4A. F(2,) = nsin(ka/n)cos(ka/n); S(A,) = 4nsin?(ka/n). 
Proof. For each k the n-gon #, can be subdivided into n congruent triangles each of which 
has a base of 2 sin(k/n) and a height of cos(k2/n) (see Fig. 4A). So 
F(&,) = nsin(ka/n)cos( ka/n) 
and 
S(&,) = 4nsin(ka/n). 
Because |tan(ka/n)| > tan(7/n) for k = 1,2,...,n —1, 
S(&,) = 4nsin(ka/n) 

= 4|tan(ka/n) F( 2, )| 

> 4tan(a/n)|F( 2, )|. 
The required inequality now follows from the Basis Theorem (3.6) since, for @= L, < 0,2; 


» a 2, = » Joa, |°S( B,) 


k<n k<n 


» Jou, | F( R,) 


k<n 


F( > a. % 


k<n 


s = s| 


> 4tan( 7 /n) 


= 4tan( 7 /n) 


= 4tan(a/n)|F|. 


An inspection shows that equality holds if and only if either all a, = 0 except possibly a,, or all 
a, = 0 except possibly a, _,, ie., Y% is both regular and convex. O 
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There is another proof of this inequality which brings out the essential properties of the 
polynomials po(x) = 2 — (x + x7") and p;(x) = (x — x71)/4i discussed in Section 3. Note 
that 


(x + 1) ps(x) = —4i(x — 1) pp(x). 
The function 


Ps(e”) _ 

pr(e”) 

increases for 0 < 6 <a and decreases for 7 < 8 < 27. Therefore its minimum value among 

0=20/n,4a/n,...,2(n — 1)a/n occurs for 0 = +2a/n. Recalling that p(x)&, = p(w*)B, 

for any polynomial p(x) (see the proof of (3.6)), we have that 

Ps(w *) 
Pr(w") 


f(8) = 


ef —] 
eF 4] 


S(R,) = ps(w") (A) = [pr(w") (2) 


5 | 20). 
Pr(») 


The remainder of the proof is as above. 


[pr(w")|I(2,) = 4tan(a/n)| F(A, )|. 


Now we can prove our next result. 


(4.2) For an n-gon & with perimeter L, L* > 4ntan(m/n)|F|, and equality holds if and only if 
SS is regular and convex. 


Outline of the proof. When x is equilateral L? = nS. (Just let the common edge be e so that 
L = ne and S = ne’.) Thus for equilateral n-gons our claim is an immediate consequence of (4.1). 
The argument up to here is much like that of Schoenberg in [25], who attributes the result for 
equilateral n-gons to Jacob Steiner. For ways to extend the result to arbitrary n-gons we refer the 
reader to [17, Chapter 2], [21, Chapter IV], or [22, Chapter X]. An argument of a topological 
nature is common to all known proofs of (4.2) and appears to be inescapable (see [17, Section 
2.4]). We shall skip the details. 


Let us instead improve (4.1) in another direction by obtaining a more precise bound on the 
difference S — 4tan(7/n) F. The first step is to prove for k = 2,3,...,m — 1 that 


S(&,) — 4tan(a/n) F(&,) > 8nsin’(a/n). 


To this end we apply the formulas S(&,) = 4nsin’(a/n) and F(&,) = nsin(ka/n) obtained 
above as well as some trigonometric manipulations and see that 


S(B,) — 4tan( s/n) F(&,) = 4nsin’?(ka/n) — 4ntan(2/n)sin(ka/n)cos( ka/n) 
> 4nsir(2a/n) — 4n tan( 2/n)sin(27/n)cos(27/n) 
= 8nsin’(a/n). 
The inequality above is easily checked for 2 < k < n — 2; when k = n — 1, equality holds. 
Next, because S(#,) = 4 tan(7/n) F(2&,) for k = 0,1 it follows that, for = L, . ,a,B,, 


S( of) — 4tan( n/n) F(x) = Y tau?(S( 4) — 4tan(/n) F(%)) 


n-l1 
> 8nsir(a/n) >> \a,/’. 
k=2 
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We shall now estimate £7_}|a,|* in terms of the vertices of ». In order to express our 
conclusion most simply we shall assume that a) = 0, ie., that the centroid of is 0 (see (2.9)). 


Moreover, also by (2.9), a, = “5 a,w/ and so 


j<nvy 


1 
lal<— Dla) 


j<n 


We know that 


Mat) = ¥ lag %) = L la? Dw?) =D wl 


k<n k<n j<n k<n 
and hence 
1 
2 _ 2 
© lal? =— D la? 
k<n j<n 
Thus 
n—-1 
2 _ 2 2 
» |a,|" = » la,|“ — |a,| 
(4.3) k=2 k<n 
1 » 1 ? 
>— ) Ia ~ 3 » 14) 
jan j<n 


The expression (4.3) is equal to 0”, the variance of the set {|ao|,|@,|,...,|@,-;|}; it measures the 
extent to which w# differs from an n-gon inscribed in a circle whose radius is > j<nl4jl 
Combining the facts above we conclude for any n-gon #& centered at 0 that 
S(.#) — 4tan(7/n) F(.W) > 8no’ sin (a/n). 

As in (4.2) this inequality has an immediate consequence for equilateral n-gons. 

(4.4) THEOREM. For any equilateral n-gon & with centroid 0, perimeter L, and area F, 

L? — 4ntan(a/n) F > 8n’o* sir’ (a /n), 

where a? is the variance of |ao|,|a,|,.--,|@,—1|. 

5. Extension to Closed Curves. The results of Section 4 extend naturally to a closed curve 
whose perimeter L is well-defined. We shall denote the curve by # and its arc length 
parametrization by a(s):[0, L] ~ C, where a(L) = a(0). We can consider the points of a(s) as 


the limit points of the vertices a, of inscribed equilateral n-gons as n — oo. The finite Fourier 
series a; = Li, < ,a,w/* then becomes the Fourier series 


a(s) _ > a,eremsi/E 
kez 


where a, = = fa(s) e 2k*/" as 
0 


Let us emphasize the obvious: A parameter other than s will lead to a different Fourier series. 
The quantities L, F, and o” exist as integrals which are limits of the values for n-gons as 
n-increases to oo. Explicitly 


(5.1) F = F() =5 [‘tma(s)a"(s) ds. 


This formula can be recognized as Green’s Theorem for areas expressed in the notation of 
complex numbers. The variance o” of |a(s)| is given by an integral obtained from (4.3), namely 
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(5.2) 0 = (las) — uy ab, 


where 


1 sL 
n= zs ja(s)| ds. 


The latter quantity » can be thought of as the “average distance” of the curve from 0. When 0 is 
the centroid of the curve, p is its “average radius”. 


(5.3) THEOREM. Given a closed curve & with centroid 0 and perimeter L, which surrounds a 
region whose area is F > QO, then 


L? > 4nF + 81707. 
Moreover L* = 4¢qF if and only if @ is a circle. 


Proof. Inscribe an equilateral n-gon %, in WX, or, to avoid a detour discussing whether this is 
always possible, follow W. Blaschke’s approach, which consists in defining ~%, as an equilateral 
n-gon whose vertices all lie on & except possibly for one which is “close” to W (see [21, p. 75]). 
Then (5.3) follows by taking the limit of the inequality in (4.4) for the polygons w,: 


L? — 4ntan(a/n) F, > 8n’o7sin’(7/n). 


As n> o, L, > L, ntan(a/n) > 7, F, > F, 0, > 6, and n’sin’(a/n) > m’. If L = 4aF, 
then o” must equal 0 and so, by (5.2), |a(s)| = w for every s. 0 


Theorem (5.3) is an example of what was termed in [19] and [20] a Bonnesen-style inequality—an 
isoperimetric inequality incorporating an error term that measures, in terms of some geometric 
property, the deviation of the curve from a circle. Our error term is not related to any of the 
results mentioned in [20]; a similar (but weaker) inequality was obtained in [24] by quite different 
methods. 

Of course Theorem (5.3) can be proved directly, without recourse to our n-gon theory. This is 
the approach taken in a fascinating 1902 paper [16] of A. Hurwitz, the mentor of Hilbert and 
Minkowski. That paper contains among its many geometric applications of Fourier series a proof 
of the isoperimetric inequality that is close in spirit to ours (although Hurwitz failed to interpret 
the error term). His proof was reproduced in [11] and appears frequently in textbooks. Surpris- 
ingly, the Hurwitz paper contains virtually all of the published applications of Fourier series to 
geometry, even though it is clear that there is an untapped potential in his methods. Likewise, our 
n-gon theory demonstrates that there are many geometric applications of finite Fourier series as 
well; however, except for [25] their use has been restricted almost entirely to numerical techniques 
in such fields as applied harmonic analysis (see [3] or [15]). We hope that both our explanations 
and references provide the reader with some useful tools for geometry. 
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is as old as calculus itself and textbooks abound with worked examples. The biochemist, however, 
is interested in a restricted form of the problem in which the coefficients of the polynomials 


f(x) =a) ta,x+--+ +4a,x", 
e(x)=b, + bx+--- +b, x", 


are required to be nonnegative and only the part of the graph of r(x) in the positive quadrant is 
of significance. We shall not attempt to explain the numerous situations in life sciences where the 
study of such graphs is important nor the chemical reactions which give rise to the rational 
functions r(x). Such information can be found elsewhere [1], [2], [3], [4]. Let it suffice to mention 
that, to the experimental biochemist, theoretical results concerning the shapes of rational func- 
tions are of considerable interest. So in this paper we collect together a number of mathematical 
theorems concerning the turning points, inflexions, and sigmoidicity of rational functions. 
As an example, we quote the following result. 


1.1. THEOREM. The number of critical points in the positive quadrant of a nonconstant rational 
function 


Ap t+ ayxt+-+- +a,x" 
by + bx + s+ +b x? 


with all coefficients a;,b, nonnegative, cannot exceed the minimum of the degrees m,n. If m=n, 


then this number cannot exceed m — 1. 


The proof of this theorem is taken up in Section 2, where we further show that the result is best 
possible. A necessary condition for achieving the maximum number of critical points is also 
indicated. Related problems concerning the distribution of critical points and inflexions are 
discussed in Section 3. Finally in Section 4 we define the notion of sigmoidicity for a certain class 
of rational functions and solve the problem of maximising sigmoidicity up to degree four. 


2. Critical Points. Consider two polynomials 
f(x) =a) + a,x +--+ +a,x", 
g(x) =bt+bxt+-+- +b,x", 
with a,b,, # 0. The Wronskian is defined by 
W(f.8) =fg’ -f'3 = —W(g.f), 


where f’ and g’ are the derivatives of f and g, respectively. We are particularly interested in the 
situation where all coefficients a,,b, are nonnegative. The positive zeros of W(f,g) are the 
positive critical points of the rational function r = f/g because r’ = W(g,f)/g7. We begin, 
however, with a version of Rolle’s theorem for suitably differentiable functions f and g. Our main 
results follow as corollaries. 


2.1. THEOREM. Let I = [a,b] be an interval of the real line and suppose that neither g nor g’ has 
a zero in I. Then between two zeros of W(f,g) in I there is at least one zero of W(f’, g’). 


To prove the theorem we first expand the identically zero determinant 


4} 


§ § 


g’ 
g g g”\=0, 
f f’ f” 
to obtain 


gW(8',f’) = 8'W(g,f) — 8° W(8,f). 
An application of Rolle’s theorem to W(g, f)/g’ then shows that between two zeros of W(g, f) 
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there is at least one zero of W(g’, f’). The argument requires that g,g’ do not vanish in the 
interval. 

This concludes the proof of Theorem 2.1. It should be noted of course that the roles of f and g 
may be interchanged because W(f, g) = — W(g,f). 


For our first corollary we assume that the coefficients of g are all nonnegative. The same is 
then true for the derivatives of g. We take J = [0, 00) as our interval. 


2.2. COROLLARY. Let f(x), g(x) be polynomials of degrees n, m, respectively, and suppose g(x) 
has nonnegative coefficients. If n < m, then the number of positive zeros of W( f, 2) cannot exceed 
n. If n = m then this number cannot exceed n — 1. 


Theorem 1.1 is an immediate consequence of this corollary. The proof of Corollary 2.2 is by 
induction on degrees, but first of all we make a few preliminary remarks about the nature of the 
result. The degree of W(f, g) is m + n — 1 unless m = n in which case it is m + n — 2 or less. 
Theorem 1.1 does therefore impose a significant restriction on the number of critical points f/g in 
the positive quadrant. When enumerating zeros of polynomials we always take account of 
multiplicity. For example if f(x) = x", g(x) =x",m # n, then 


W(f,g) =(m—n)x™*r, 

which has m + n — 1 zeros at x = 0. Incidentally this does not contradict Corollary 2.2 which 
only refers to (strictly) positive zeros. Let us suppose then, contrary to the statement of the 
corollary, that W(f, g) has n + 1 positive zeros in case n < m or n positive zeros in case n = m. 
Applying Theorem 2.1 iteratively n times with J = [0, 00), we arrive at a situation where the new f 
is constant say f(x) = c and the Wronskian W(/f, g) = cg’ has at least one positive zero. This is a 
contradiction and proves Theorem 2.1 in case n < m. If n = m, we apply Theorem 2.1 iteratively 
n — 1 times to arrive at a situation where the new f,g are linear expressions ax + b,cx + d, 
respectively. If at any stage of the argument the Wronskian becomes identically zero, say 
W(f’, 2’) = 0, then f(x) = kg(x)+/ for some constants k, / and W(f,g) = /g’ and the 
conclusion follows. Otherwise at the last stage where ‘f(x)=ax+b, g(x)=cx+d, the 
Wronskian W(f, g) = bc — ad is supposed to have one positive zero, which is a contradiction. 

A special case of a rational function of biological interest [4] with deg( f) = deg(g) = n that 
can have no positive turning points is 


r(x) = xF’/nF, 
where F(x) = dy + a,x + a,x* +--+ +a,x"; a, > 0. 


Turning points occur at zeros of FF’ + x(FF” — F’) and all coefficients of this polynomial 
are nonnegative since 


FF’ + x( FF” — F?) =) jaa(j-i)x'"', O<itj<2n, 
i,j 


and so the coefficient of x'*/~' is a,a,(j — i)’. 


However, we now present examples to show that the maximum number of turning points 
permitted by Theorem 1.1 can be achieved. First of all let 


f(x) = (x +1)" +(x - 1)", a(x) = (e+ 1)". 
In this case 
W(f,g) = —2n(x +1)" "(x-)"", 
which has a zero of multiplicity n — 1 at x = 1. Next consider 


g(x) = GD" Gy", + +(x -—1) +1, 


(m-— 1)! 
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the first m + 1 terms in the expansion of e*~'. It can be checked that all coefficients of g(x) are 
nonnegative when g(x) is written out as a polynomial in x. Taking 


—(x-1)" (x-1)"" 
f(x) = n! + (n—1)! 


+ +++ +(x-1) +1, 


we find that 


m! 


wpe) =(2-H"{ &- Gah, 


which has a zero of multiplicity n at x = 1. 
A necessary condition for achieving the maximum number of turning points can be stated in 
terms of the expressions 


Cf, g) = a,b,_, 7 a;_,5,. 
We interpret a, = 0 unless 0 <i <n and b = O unlessO0 <j < m. 

2.3. PROPOSITION. Let f(x), g(x) be polynomials with nonnegative coefficients and suppose 
W(f,2) has the maximum possible number of positive zeros for the given degrees of f and g. Then the 
expressions C,;( f, g) alternate in sign in the sense that C;C,,, <9 for alli. 

To prove the result we note first of all that 

C.F, g’) = k(k + Nai, g), 


which expresses C,(f,g) in terms of C,_,(f’, g’) except k = 1. If the proposition is true for 
f’, g’, it follows immediately for f, g, except possibly for C,C,. However, this case is also covered 
by making the substitution x ~ 1/x. To see how this works consider the case m = n. Take 


p(x) = x"f(1/x), 
q(x) = x"g(1/x), 
whereupon C,(/f, g) = —C,_;41(p,q). In particular, 


Ci(f,8)G(s.8) = G(p.4a)G_1(2,4)- 


If we assume that W(f, g) has the maximum possible number of positive zeros, then the same is 
true of W( p,q), W(f’,g’) and W(p’,q’). This establishes an inductive step and to start the 
induction we consider two cases. First of all let 


f(x) =a) + a,x + a,x’, 
g(x) =b + bx + bx’, 


and suppose f/g has one positive turning point (the maximum permitted number according to 
Theorem 1.1). In this case 


W(f,8) = —Q,x* — 2 ayby — agb,)x- C,, 
which has one positive zero and must therefore have a negative one. Hence 
CG, < 0, 
with all other C,C,,, equal to zero. In the second case we take 
f(x) = a) + ax, 
g(x) =b t+ bxt+ +--+ +b,x", 
such that f/g has a positive turning point. Then 
W(f,g) =(n- 1)a,b,x" +(nagb, +(n — 2)a,b,_,) x" + -°° 
+2a,b,x +(ayb, — a,b). 
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The coefficients here are all nonnegative except possibly 
—C, = ayb, — a,b, 


and this must be negative by Descartes’ rule of signs because W(f, 2) is assumed to have a 
positive zero. Hence 


Ci > 0, C, = — a,b, < 0, 
and therefore C,C, < 0. This completes the inductive argument and the proof of Proposition 2.3. 


3. Inflexion Points. After turning points, the next most salient features of a graph are its 
inflexions. 


3.1. THEOREM. Let f(x), g(x) be polynomials with nonnegative coefficients of degree n,m 
respectively. If n < m, then the number of positive inflexions of f/g cannot exceed m+n -— 1. If 
m = norm =n — 1, then this number cannot exceed 2m — 2. If m <n —1, then the number of 
positive inflexions cannot exceed 2m. 


We should note that the statement of Theorem 1.1 is symmetrical in m,n because the positive 
turning points of f/g and g/f are the same. This is not true for inflexions. The positive inflexions 
of f/g are the positive turning points of W(/, g)/g?. Now g” has nonnegative coefficients. If 
n < m, then 


degree W( f,g) <m+n-—1 < 2m = degree g’. 


We may, therefore, apply Corollary 2.2 with W( f, g) replacing f, and g* replacing g, to prove the 
first part of Theorem 3.1. To deal with the cases m =n or m=n-—1 we can always find 
constants k,/ such that the degree of the numerator in 


f/g+kx+1l=(f+(kx + 1)g)/g 


will be less than the degree of the denominator. The inflexion points are not changed by adding 
the linear expression kx + 1. Now we apply the previous case. It remains to deal with the case 
m <n — 1 and for this we need a preliminary result. 


3.2. LEMMA. Let f, g be polynomials with nonnegative coefficients of degrees n,m respectively. If 
n>m-+1, then the 2mth derivative of the Wronskian, W°™(f, 2), has degreen — m—1 and 
nonnegative coefficients. 


The proof relies on Leibniz’s formula for iterated differentiation of a product, applied to the 
Wronskian W(g,f) = gf’ — g’f, namely 


2m 2m 
m 2 1 m-i1 2 1 m—il 
Wr20(g,f) = ¥ (20) gopom-ion— $f (2) porgom rey 


i=0 i=o\ ! 
Now g“ vanishes for i > m since g is a polynomial of degree m. Hence, with i — 1 in place of i 
in the second summation, we have 


m — 2 I m-i “ 2 i m-i 
wong, f) = & (2m) gorpom-vor— $F ( 2m ) gorgam-ren 


i=o\ i=1 
= gf emt) 4 x (2”") -( 2m J] goren- 
= Wa} \i-1 : 


In the range 1 <i <™m, the difference of binomial coefficients, (2) -(2" J, is positive. This 
completes the proof of the lemma. 


We return to the final part of Theorem 3.1. Let us assume, contrary to the statement of the 
theorem, that n >m+1 and f/g has 2m+1 positive inflexions. Applying Theorem 2.1 
iteratively 2m times with J = [0, 00), W(f, g) in place of f, and g? in place of g, we arrive at a 
situation where the Wronskian of the functions W°”(f, g) and (g”)°” is supposed to have a 
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positive zero. But (g*)°” is a nonzero constant k, say, and the Wronskian is then KWE™tY, 
This has nonnegative coefficients by Lemma 3.2 and we have a contradiction, which completes the 
proof of Theorem 3.1. 


To make further progress with shape-classification for graphs of positive rational functions in 
the positive quadrant, we might ask about the relative distribution of turning points and 
inflexions. A simple example to consider is 


(x) = 2a DE) en) yg 
(1 + x) 


where K is a sufficiently large positive constant to lead to nonzero numerator coefficients. It can 
be checked that ¢(x) has n — 1 turning points and n — 1 inflexions in the positive quadrant 
which interlace. 

Using the abbreviation T for turning point and J for inflexion, and reading the graph of t(x) 
in the positive quadrant from left to right, we have a regular pattern 


... LITT... 


of turning points and inflexions which we may call the “symbol” of the graph. A primitive form of 
shape classification would entail the listing of all possible symbols for rational functions with 
positive coefficients of given degree n over m. An analysis of low degree cases, reveals, for 
example, that the symbol 


I?TITI 


is realisable by cubics over cubics [9]. Here the maximum numbers of inflexions and turning 
points permitted by Theorems 1.1 and 3.1 are simultaneously realisable. For higher degrees the 
symbolic classification remains a problem. It can be solved in certain simple cases. For example, 
the reciprocal 1/g(x) of a polynomial g(x) with positive coefficients cannot have any positive 
turning points but it is possible to find g(x) of degree m such that 1/g(x) has m — 1 positive 
inflexions. In other words the symbol J”~! is realisable. This symbol is maximal and by reading 
the graph of 1/g(x) from suitable points to the right of the origin we can then realise symbols J” 
forO <r<m. 
A special case of biochemical interest is a rational function of the form 


n 
y= Ly: 
i=l 


where y, = 1,K,x/(1 + K,;x), n;, K, > 0. Clearly y’ > 0 and y” < 0 for x > 0, and so the y(x) 
curve has no turning points or inflexions. Biochemists also use transformation of axes and a 
popular plot is in (1/x,1/y) space. There are no inflexions of y(x) in this space either and to see 
this we first note that 


d*(1/y)d(1/x) <0 2y"(xy’ — y) - yy" <0. 
To establish the result by induction we suppose that the graph of 1/y against 1/x is strictly 
concave down for n = k — 1, which is known to be the case for n = 2 and 3[1]. Then we consider 
the effect of adding a further term y,. 

Now if 2 y’(xy’ — y) — xyy” < 0 for y=y, +y, + +++ +y,_1, the sign after adding y, will 

depend upon P + Q, where 

P= 2y'(xy’ — y) — yy" + AVC — Mn) — IK 
and 

Q = 2yj(2xy’ — y) — yy (ZY! + xy") — ry’. 


As P < 0 we only need to consider a typical term, ¢;,, of Q = a1 Vk resulting from y, and 
y,- Since 
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2 
_ —2njn, K)K,(K, — K,) x 
Dik ~~ 3 3 
(1+ K,x) (1+ K,x) 
Q < 0 and the desired result is established. 


To conclude this section we examine one important example which has received attention in 
the biochemical literature [4]. We first point out that the Hessian of a polynomial p(x) is defined 
by 


4} / 2 
H( p) = npp” —(n- 1)(p’), 
where n = degree ( p). Taking 
f(x) = np(x) — xp'(x), 
g(x) = p(x), 
we find that 
H(p) = W(f,8). 
Theorem 2.1 then yields the following result. 

3.3. PROPOSITION. Let p(x) be a polynomial of degree n with nonnegative coefficients and 
suppose the Hessian is not identically zero i.e., p(x) # (ax + b)". Then H( p) has at most n — 2 
positive zeros. 

Polynomials, p(x), of the form 


p(x) =1 +(1) Kix +(3) KK.» +.) +Ki)Ky--» K,x",  K,>0, 


have importance in biochemistry and the coefficients 

y, = log( Ki41/K;); 
known as cooperativity coefficients, have attracted much attention [4]. When H(p) has the 
maximum number of positive zeros, then the positive, rational function (np(x) — xp’(x))/p’(x) 


has the maximum number of positive critical points. However, for this rational function we find 
that 


(ny? 2 
C= i(n—i)(7) (K.Ky 0 Ka) KCK, ~ Kia), 


and so the sign of C,C,,, is the same as the sign of y,y,,,. Our previous result (2.3) shows that if 
H(p) has the maximum number of positive zeros, then 


Vi¥j+1 < 9. 


Using a theorem of Newton that was established by Sylvester, discussed in a textbook by 
Todhunter [8] but largely overlooked since [5], we can make further conclusions about the nature 
of the zeros of p(x) when y,y,,, < 0. If the sequence log( K,/K,),log( K;/K,),...,log(K,,/K,,_1) 
has alternating signs — + — --- +, then p(x) has at least n — 2 nonreal zeros if n is even, and 
exactly n — 1 nonreal zeros if n is odd. If the signs alternate in the order +— + --- +, then 
there are still n — 1 nonreal zeros if n is odd, but n nonreal zeros if n is even. 


4. Sigmoidicity. In this section we consider rational functions of the form r(x) = xuv(x), 


where 
a,x" 1+ +++ +a,x + ao 
bx" tb yx" tee) +d x + by 


v(x) = 


with n > 1,a, and b > Ofor0 <i<n-—1land0 <j <2, bob, # 0. We are looking therefore at 
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rational functions of degree n over n whose coefficients are nonnegative and whose graphs go 
through the origin. We further assume that the graph of r(x) is S-shaped or “sigmoid” as it issues 
from the origin, as in Fig. 1: 


Xo Xx 
FIG. 1 


The sigmoidicity of such curves has been regarded as of great importance to the biological 
function and numerous attempts to explain and quantify this phenomenon have been made [2], 
[3], [6], [7]. 

Sigmoidicity occurs if r’”’(0) > 0, which is equivalent to v’(0) > 0. Let x, be the first positive 
value of x for which a line through the origin meets the graph of r(x) tangentially as illustrated 
above. Equivalently x, is the first positive turning point of v(x). 

The “sigmoidicity” of r(x) can be measured by the ratio of the area of the triangle with 
vertices at (0,0), (9,0), (X 9, r(Xp)) to the area under the graph of r(x) for x between 0 and Xp. 
This works out to be 


—_ Xor(xXo) _ _ 1 | 
2f" r(x) dx 2p" ete) dx 


The problem is to maximise sigmoidicity over all r(x) of given degree n over n subject to the 
prescribed conditions. The expression above for sigmoidicity is invariant under rescaling of x and 
r(x), i.e., multiplication of these quantities by positive constants. It is therefore convenient to 
normalise the problem by introducing the function 


v( xX) 
u(x )= v( Xo) . 


Then u(1) = 1, u’(1) = 0 and the problem boils down to minimising the integral 
I “xu( x) dx 
0 


over all 


xr + Fax + ao 
es +++ +bxt by ’ 


u(x) = 


for given n > 1 subject to the conditions that all a,,b, are nonnegative, b)b, # 0, u'(0) > 0 
u(1) = 1 and x = 1 is the first zero of u’(x). 

The first step is to establish a result about pointwise minimisation of functions u(x) in the 
interval 0 < x <1. Let C denote the class of functions u(x) as above and note that the 
conditions u(1) = 1, u’(1) = 0 are equivalent to 


a,-1 + @,_5 + 7 +a, + dy = b, +b, + se +b, + bo, 
(n—1)a,_, +(n-2)a,_, + +++ +a, = nb, +(n- 1)b,-, + °°: +8. 


1985] CRITICAL POINTS AND SIGMOIDICITY OF POSITIVE RATIONAL FUNCTIONS 45 


For each k,n — 1 < k <1, the following rational function is a member of C: 


w(x) = La kD) (R= DB] xt Lk = nt kB) 


x" + B 
providing B lies in the range 
n— n-k-+1 
k k-1 
for k > 1 and B >n-—1 for k = 1. This guarantees the positivity of the coefficients, and the 
conditions w(1) = 1, w’(1) = 0 are satisfied because 


n-k+1-(k-1)B+k-n+kB=1+8, 
k[n-k+1-(k-1)B])+(k-1[kK-—n+ kp] =n. 
Thus for each k and each parameter 8 in the stipulated range we have a w(x) in the class C. 
The special nature of w(x) is that in the numerator the only nonzero coefficients belong to 


adjacent powers of x and in the denominator the only nonzero coefficients belong to the constant 
term and highest power of x. 


K cps 


4.1. PROPOSITION. For each member u(x) of the class C there is at least one w(x) such that 
w(x) pointwise minimises u(x) for all x > 0. 


This means that w(x) < u(x) for x > 0. The condition that x = 1 is the first zero of u’(x) is 
irrelevant in this proposition. We may take x = 1 to be any turning point of u(x). The idea of the 
proof of Proposition 4.1 is first of all to reduce the numerator of u(x) keeping the denominator 
fixed and then to increase the denominator keeping the numerator fixed. Consider then three 
numerator coefficients 


Ae» Gj4154)5 
for k > 1+ 1. We can replace them by 
a, = a, +A, a7.) = Ay, + ha; =a, +», 
without changing the conditions u(1) = 1, w’(1) = 0 providing 
A+ pt+v=0, 
kX +(1/4+1)p+l=0. 


This gives 
Vp 
N= FT TR=1? 
_ —(k-T1)p 
— k-1l-1- 


The change in the numerator of u obtained by replacing a,,a,,,,a, by a;,a;,1, a; iS 
[ 
— yk pe _]- 
e(x) Cool (k-Ixt+k—-I-1]. 
Now the derivative of the function 


d(x) =x*"'-(k-I)x+k-I-1 


is given by 
d(x) =(k- D(x"! - 2), 
with second derivative 


d’'(x) =(k-D(k-1-1)x*-"?, 
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It follows that d(x) > 0 for all x > 0 with minimum value equal to 0 at x = 1. Hence by 
choosing v negative we ensure that e(x) < 0 for all x > 0 and the numerator of u(x) is thereby 
pointwise decreased. The formulae above for a;,a;,,,a;,A,) Show that the effect of taking v 
negative is to decrease a, and a, but to increase a,,,. Consequently we can eliminate either a, or 
a,;,,, depending on which is the smaller, whilst decreasing u, preserving the nonnegativity of all 
coefficients and, of course, remaining in the class C. Starting with the triple a,_1,a,,a) and 
working inwards, we can eliminate all coefficients a; except possibly certain two adjacent ones 
whilst decreasing the numerator of u(x) and therefore u(x) itself. 

Now we carry out a similar programme on the denominator of u(x) which of course we wish 
to increase. Using an analogous notation with the denominator coefficients b, replacing the 
numerator coefficients a,;, we see that, by taking v positive, we increase b, and b, but decrease 
b,,, whilst increasing the denominator of u(x). This time we can eliminate the middle term of 
any triple of coefficients, preserving positivity of the rest. Starting with the triple b,,b,_,,5,_, 
and working to the right, we eliminate all denominator coefficients except b, and b,. This leads 
finally to a member of C of the form 

w(x) = ax® + ap_,x*7} 
bx" + bo 
pointwise minimising u(x) for all x > 0. We get the form of w(x) stated above using the 
conditions w(1) = 1,w’(1) = 0 to work out a;,a;,_, in terms of B = bj/b’. This completes the 
proof of Proposition 4.1. 


As far as the original problem of minimising {jxu(x) dx is concerned, we know now that we 
may restrict attention to the special functions u(x) = w(x). There is a 1-parameter family of such 
w(x) for each k,n —1< k <1. How much further can we go in pointwise minimisation? To 
answer this question we consider the derivative of w(x) with respect to the parameter B, 


dw x*-11-(k —1)x"*1 + kx” —(n=k+Ixtn—k] 
ap (x" +p)? 
Let 
f(x) = -(k-1)x"** + kx" -(n-—k+1)x4+n-k, 
and note that 
f(x) = -—(n + 1)(k - 1) x" + knx""' -(n-k +41), 
f(x) = —n(n +1)(k — 1)x"71 + kn(n - 1) x"? 


We have f(0) =n —k>0,f(1) = 0,7 f’) = 0, f"() = n(n +1 — 2k). It follows that if n +1 

> 2k, then dw/dB is nonnegative in the interval 0 < x < 1. Hence w is an increasing function of 
a in this interval and w is minimised by taking the lowest permissible value of 8, namely 
B =(n-—k)/k. For this value of B we get 


excl, 2k<ntl. 
w(x) = Tok <*S n 


Of these w(x) the largest value of k gives the smallest w(x). In the case n = 3 for example, the 
choices of k are k = 1,2, and since 2-2 =3+1,ie,2k =n+1, we see that the function 


x2 
w _ 
(+) 2x7 41 


pointwise minimises all members of C in the interval 0 < x < 1. Consequently for cubics over 
cubics sigmoidicity is maximised by the rational function 


3x3 


2x3 +17 
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or equivalently x* /(x° + 1) after rescaling. Similarly for quadratics over quadratics sigmoidicity 
is maximised by x”/(x? + 1). 

When we come to the case n = 4 a new phenomenon arises. We have three one-parameter 
families of w(x) corresponding to k = 1,2,3. For k = 1,2 we have 2k < n+ 1 and for these 
values of k we saw above that 


nx* 


kx" +n—k 


are minimising functions within each family and decrease pointwise as k increases for 0 < x < 1. 
It follows that the function 


_ 4x 

3x4 417 
which belongs to the family k = 3, actually minimises all members of both families k = 1,2. 
However, within the family k = 3, there is no pointwise minimising function. In fact the graphs of 


any two members of this family cross over at some point in the interval 0 < x < 1. To see why 
this is true let us consider the general member of the family k = 3, namely 


2(1 — B)x? +(3B — 1) x? 
x? +B 


where 1/3 < B < 1. The difference between two members with parameters 8, B’ of the family is 
given by 


(B’ — B)x?[2x° — 3x4 + 2x -1] 
(x* + B)(x* + B’) 


which vanishes between x = 0 and x = 1 since 


9 


2x5 — 3x4 42x —1=(x—1)(x — .6573)(x —714)(x—n), 
where r,, = —.5786 + .6526i. This turns out to be a general phenomenon for 2k >n + 1. 
Hence for n > 4 we have reached the end of what can be achieved by pointwise minimisation as 


regards the original problem of sigmoidicity. We can summarise the discussion above by 
announcing a partial improvement of Proposition 4.1. 


4.2. PROPOSITION. In the interval 0 < x <1 each member u(x) of the class C is pointwise 
minimised by at least one w(x) of a k-family with 2k >n+1, ifn > 4. Forn = 2,3 the functions 
2x/(x? + 1), 3x?/(2x? + 1) are absolutely pointwise minimising in the class C over the interval 
O<x<l. 


To complete this work we present a special argument to show that sigmoidicity for quartics 
over quartics is still maximised by the function 4x*/(3x* + 1) even though the argument of 
absolute pointwise minimisation used for the cases n = 2,3 will not work for n = 4. We wish then 
to minimise the integral 


[oxu(x) dx, 


and we know from earlier work that we may restrict attention to functions 
2(1 - 34+(3B - 1) x? 
ux) ~ 22 = B)** +(38 = I) 
x4 +B 
taken from the family k = 3. Let us consider therefore the parametrised integral 


2(1 - 4+(3B -— 1)x? 
1(8) = [OE a 
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We differentiate under the integral sign to get 
dl pix®[-2x5 + 3x4-2x 41) 
—_—_ = a d. 
dp Jo (x4 +4 By 

and, as we saw earlier, the integrand changes sign at x = t = .6573. We break the integral for 


dI/df into parts and estimate the size of each, 


dI tx? [-2x° + 3x*- 2x41] dx + [2 Lo2e 4B Wet x 
t 


“pe (x4 + By’ (x* +p) 
Over the interval 0 < x < ¢, the integrand is positive and over t < x < 1 it is negative. Hence 
T 
> f'x'[-2x8 + 3x4 — 2x 4 1] ax 
Bon 4 By °9 
+ f'[-2%° + 3x4 -2x+4 1] dx 
(7474+ B) * 
1 
-——— [°° [-2x° + 3x4 —2x +1] dx 
(14+ B) °° 
-—1_[-243-241 
(74 4 By 9 8 5 4 
- /!  t 
(74 4 By 360 
> 0. 


Hence / is an increasing function of B and is a minimum therefore at the least permissible value 
of B namely B = }. This gives our final result for quartics over quartics (after renormalising). 


4.3. PROPOSITION. Maximum sigmoidicity for quartics over quartics is achieved by the function 


x4 


xt +410 


References 


1. W. G. Bardsley, The 3:3 function in enzyme kinetics. Possible shapes of v/S and (1/v)/(1/S) plots for 
third degree steady-state rate equations, J. Theoret. Biol. (1977) 65, 281-316. 

2. W. G. Bardsley and R. E. Childs, Sigmoid curves, non-linear double-reciprocal plots and allosterism, 
Biochem. J. (1975) 149, 313-328. 

3. W. G. Bardsley and A. J. Wright, A new approach to the measurement of sigmoid curves with enzyme kinetic 
and ligand binding data, J. Mol. Biol. (1983) 165, 163-182. 

4. W. G. Bardsley, R. Woolfson, and J.-P. Mazat, Relationships between the magnitude of Hill plot slopes, 
apparent binding constants and factorability of binding polynomials and their Hessians, J. Theoret. Biol. (1980) 85, 
247-284. 

5. W. G. Bardsley, R. Woolfson, and R. M. W. Wood, Some mathematical results concerning Hessians of 
binding polynomials and cooperativity coefficients, J. Theoret. Biol. (1980) 85, 45-51. 

6. D. E. Koshland, Jr., G. Némethy, and D. Filmer, Comparison of experimental binding data and theoretical 
models in proteins concerning subunits, Biochemistry (1966) 5, 365-385. 

7. J. Monod, J. Wyman, and J.-P. Changeux, On the nature of allosteric transitions: A plausible model, J. Mol. 
Biol. (1965) 12, 88-118. 

8. I. Todhunter, An elementary treatise on the theory of equations, Macmillan, 1882. 

9. J. M. Wardell, W. G. Bardsley, J. P. Kavanagh, and R. M. Wood, The probability of obtaining complex 
kinetic curves for enzyme mechanisms with cubic terms in the pseudo-steady state rate equations, J. Theoret. Biol. 
(1982) 95, 465-487. 


CENTER SECTION 
(Vol. 92, No. 1, January 1985) 


Telegraphic Reviews 


Edited by Lynn Arthur Steen, with the assistance of the Mathematics Departments 
of Carleton, Macalester, and St. Olaf Colleges. Books submitted for review 
should be sent to Book Review Editor, American Mathematical Monthly, St. Olaf 
College, Northfield, Minnesota 55057. 


Telegraphic reviews are designed to give prompt notice of all new books in the mathematical 
sciences. Certain of these books will be selected for more extensive review in the Reviews section 


of the Monthly. 


Special Codes: 


T: Textbook 13-18: Grade Level 

S: Supplementary Reading 1-4 : Time in Semesters 
P: Professional Reading KK : Special Emphasis 
L: Undergraduate Library 2? > Questionable 


rr TE NE RN A NER Saeed — SNe 


Martin Gardner. U of Chicago Pr, 1983, x + 262 pp, $7.95 (P). [ISBN: 0-226-28250-3] Paperback 
reprint of the well-known 1971 W.H. Freeman collection, with the word "Games" changed to "Diver- 
sions" in the title. LAS 


General, S, L**. Martin Gardner’s New Mathematical Diversions from Scientific American. Martin 
Gardner. U of Chicago Pr, 1983, 253 pp, $7.95 (P). [ISBN: 0-226-28247-3] Reprint of the 1966 Simon 
and Schuster original edition (with some updates, e.g., on the 1976 proof of the four-color conjec- 


ture). A classic, now available for a new generation of students. LAS 


Elementary, T(13: 1). Introductory Geometry. Mary Kay Hudspeth. Addison-Wesley, 1983, 557 pp, 
$18.95 (P). [ISBN: 0-201-10690-6] This typescript text covers the fundamentals of high school 
geometry and introduces formal logic and trigonometry; intended for developmental students with 
knowledge of first year algebra, but presumes no prior knowledge of geometry. JNC 


Mathematics Appreciation, $??7?(13). Generalisations et Generalites. Florentin Smarandache. Edition 
Nouvelle, 1984, 53 pp, (P). Trivial generalizations of known results, plus a collection of the 


author’s favorite mathematical oddities. JRG 


Precalculus, T(13-14: 2-4). Basic Technical Mathematics. Peter K.F. Kuhfittig. Brooks/Cole, 1984, 


RT SNR Nhe REARS 


pp, $28.95. [ISBN: 0-534-03151-X] Designed for students in technical and engineering programs, both 
texts make use of technical applications and notation and include calculator procedures and prob- 
lems. The first text covers standard precalculus topics including some analytic geometry. The 
second text includes all the material from the first text as well as first year calculus. JNC 


Precalculus, T(13: 1). Trigonometry. Charles P. McKeague. Academic Pr, 1984, xv + 458 pp, $20. 
[ISBN: O-12-484785-4] Standard trigonometry text; includes complex numbers and appendix on loga- 
rithms. Many exercises. RM 


Precalculus, T*(13: 1). Precalculus: Functions & Graphs. Bernard Kolman, Arnold Shapiro. Academic 
Pr, 1984, xiv + 663 pp, $21. [ISBN: 0-12-417894-4] A careful presentation of the standard precal- 
culus topics emphasizing functions and graphing techniques; each chapter includes one or two 
"feature" applications, interspersed progress checks and warnings, and concludes with lists of 
terms, symbols and key ideas as well as review exercises and two progress tests. JNC 


Education, T(13: 1). Modern Elementary Mathematics, Fourth Edition. Malcolm Graham. HarBrace J, 
1984, xii + 575 pp, $25.95. [ISBN: 0-15-561043-0] Primarily for pre-service teachers of K-8; a new 
chapter on mathematical reasoning and problem solving precedes material on sets, functions, number 
systems, elementary geometry (including rigid motions) and some statistics. Other revisions include 
additional illustrations and problems and some calculator exercises. (First Edition, TR, January 
1971; Extended Review, January 1972; Second Edition, TR, October 1975; Third Edition, TR, November 
1979.) JNC 


Education, T(13: 1). Modern Mathematics: An Elementary Approach, Sixth Edition. Ruric E. Wheeler. 
Brooks/Cole, 1984, xv + 694 pp, $23.95. [ISBN: 0-534-02843-8] For elementary education and liberal 
arts majors; covers problem solving, elementary number theory, geometry, probability and statistics, 
calculators and computers; new features include discussion of cognitive reasoning techniques and 
"personal" notes to the students introducing each section. Also new is an accompanying Student 
Manual. (Second Edition, TR, November 1970; Third Edition, TR, October 1973; Fourth Edition, TR, 
December 1977; Alternate Edition, March 1982; Fifth Edition, November 1982.) JNC 


History, P*, L*. A History of Mathematics Education in England. Geoffrey Howson. Cambridge U Pr, 
1982, x + 294 pp, $54.50. [ISBN: 0-521-24206-1] The first account of mathematics education in Eng- 


C2 Center Section January 


land, set as a series of nine biographies of representative English mathematics educators, spanning 
the period from 1530 to 1960. These effective and interesting profiles reveal not only individual 
accomplishments, but also the educational and cultural context of each person’s life. A prelude and 
a postlude offer highlights of events preceding 1500 and following 1960; appendices contain sample 
examinations and syllabi. Although the biographical approach is popular in tone, copious notes pro- 
vide scholars with detailed pointers to primary sources. LAS 


History, L**. Understanding Relativity: Origin and Impact of a Scientific Revolution. Stanley Gold- 
berg. Birkhauser Boston, 1984, xvii + 494 pp, $24.95. [ISBN: 3-7643-3150-X] Explication (for the 
lay reader) of the special and general theories, followed by description and discussion of early 
response to and eventual assimilation of the theory in Europe and the U.S. Interesting discussion 
of sociological implications of this major scientific advance. Technical material in appendices. 
Bibliographic essay. JRG 


Combinatorics, T*(15-16: 1, 2), S*, P, L*. Applied Combinatorics. Fred S. Roberts. Prentice-Hall, 
1984, xviii + 606 pp, $37.95. [ISBN: 0-13-039313-4] A versatile text which stresses algorithmic 
thinking and provides illustrations and applications to a wide variety of fields. Subject matter 
includes elementary combinatorics, graphs, generating functions, recurrences, inclusion/exclusion, 
Polya theory, experimental design, coding, combinatorial optimization. Large collections of exer- 
cises. LCL 


Number Theory, S(15-18), L*. Number Theory: An Approach Through History From Hammurapi to Legendre. 
André Weil. Birkhauser Boston, 1984, xxi + 375 pp, $24.95. [ISBN: 3-7643-3141-0] A thoroughly 
engaging treatise covering the history of number theory until 1801. The three principal chapters 
focus on Fermat, Euler, and Lagrange and Legendre. Appendices to chapters deal with such topics as 
Mordell’s theorem, addition theorem for elliptic curves, quadratic reciprocity, and quadratic forms. 
A must for any teacher of number theory at any level. SG 


rw re 


tography, Physics, Biology, Digital Information, and Computing. M.R. Schroeder. Ser. in Inform. 
Sci., V. 7. Springer-Verlag, 1984, xvi + 324 pp, $24.50. [ISBN: 0-387-12164-1] An informal intro- 


duction to some applications of elementary number theory, primarily in coding but with some less 
publicized examples, e.g., fast Fourier transforms. Some previous acquaintance with number theory 
useful but not essential. Good chapter-by-chapter references to more in-depth articles in the 
literature. DD 


Number Theory, S, P, L*. Factorizations of be+1, b = 2,3,5,6,7,10,11,12 up to high powers. John 
Brillhart, et al. Contemporary Math., V. 22. AMS, 1983, lxviii + 178 pp, $22 (P). [ISBN: 0-8218- 
5021-0] A book of tables which gives known factorizations of numbers of the form b'+l for the values 
of b mentioned in the title. Also includes an historical account of hardware and theoretical 


development and a list of references. CEC 


Linear Algebra, T(13-14: 1), L. Introductory Linear Algebra with Applications. John W. Brown, 
Donald R. Sherbert. Prindle, Weber & Schmidt, 1984, xi + 436 pp. [ISBN: 0-87150-700-5] An 
interesting departure in both content and spirit from most linear algebra books, this freshman- 
sophomore text grew out of a revision of the typical finite mathematics course in order to give a 
clearer perspective and emphasis on the linear algebra portions of it. Theory is stated but largely 
unproved. Emphasis is on applications, including linear programming, systems of equations, Markov 
chains, least squares, and generalized inverses. Exercises, answers, index. JS 


Linear Algebra, T**(13-14: 1, 2). Elementary Linear Algebra, Second Edition. Stanley I. Grossman. 
Wadsworth, 1984, xv + 426 pp, $30. [ISBN: 0-534-02738-5] Changes include the rewriting of some sec- 
tions to improve clarity and to promote understanding. Format changes in uses of color and print. 
An attractive book. Provides many options for depth and extent of coverage. Have a look. (First 
Edition, TR, April 1980.) JK 


Linear Algebra, T(14-16: 1), L. Linear Algebra with Applications. Gareth Williams. Allyn & Bacon, 
1984, viii + 504 pp, $30. [ISBN: 0-205-08010-3] Includes an adequate introduction to the elementary 
theory of linear algebra but the emphasis is largely toward applications and computational aspects. 
Wider than usual choice of examples, in such areas as population problems, Markov chains, coding 
theory, linear programming, and computer solutions. Exercises, answers, appendices. JS 


Group Theory, P. Finite Group Algebras and their Modules. P. Landrock. London Math. Soc. Lect. 
Note Ser., V. 84. Cambridge U Pr, 1983, x + 274 pp, $27.50 (P). [ISBN: 0-521-27487-7] Text on 
modern representations and group algebras in characteristic p, concentrating on "post-Brauer” 
results. DD 


Group Theory, S(17-18), P. American Mathematical Society Translations, Series 2, Volume 121: The 
Kourovka Notebook: Unsolved Problems in Group Theory, Seventh Augmented Edition. AMS, 1983, v + 112 
pp, $33. [ISBN: 0-8218-3079-1] This Seventh Edition contains the complete list of 422 problems from 
the Sixth Edition (151 have been solved) together with update comments, bibliographic information, 


etc., and an additional chapter with 58 new problems. LCL 


Algebra, T*(15-16: 1), S*, P, L*. Discrete Mathematics and Applied Modern Algebra. Henry B. Laufer. 
Prindle, Weber & Schmidt, 1984, vi + 538 pp. [ISBN: 0-87150-702-2] A course in applied modern alge- 
bra (groups, Boolean algebra, rings and fields) with motivation and applications from coding theory, 
fast-arithmetic, finite state machines, graph theory, formal language. Pleasing format, easy to 


1985 Center Section C3 


read text, many examples and exercises. LCL 


Algebra, T(17), S, L. Algebra. Larry C. Grove. Pure & Appl. Math. Academic Pr, 1983, xiii + 299 
pp, $32. [ISBN: 0-12-304620-3] Suitable for a first-year graduate course or for independent reading 
by an advanced undergraduate. The text covers the basics of groups, rings, fields, modules, and 
algebras. The approach is classical with no mention of category theory or homological methods. SG 


Algebra, T(18: 1), S, P. Infinite Dimensional Lie Algebras, An Introduction. Victor G. Kac. Prog. 
in Math., V. 44. Birkhauser Boston, 1983, xvi + 245 pp, $18.75. [ISBN: 0-8176-3118-6] A detailed, 
largely self-contained and comprehensive treatment of the present state of development of the class 
of infinite-dimensional Lie algebras now known as Kac-Moody algebras; particular attention to affine 
Lie algebras and highest weight modules. Exercises, notes and commentary, bibliography, index. JS 


Algebra, T(18: 1), S, P. Skew Fields. P.K. Draxl. London Math. Soc. Lect. Note Ser., No. 8l. Cam- 
bridge U Pr, 1983, ix + 182 pp, $19.95 (P). [ISBN: 0-521-27274-2] Based on lecture notes given at 
Penn State in 1981, there are three parts: I. Skew Fields and Simple Rings; II. Brauer Groups; and 
III. K-Theory. Some exercises, bibliography. JS 


Algebra, P. Lecture Notes in Mathematics-1050: Formally p-adic Fields. Alexander Prestel, Peter 
Roquette. Springer-Verlag, 1984, v + 167 pp, $9.50 (P). [ISBN: 0-387-12890-5] Formally p-adic 
fields are fields of characteristic zero having at least one p-valuation. Investigates all p-adic 
valuations on a given field, the model theory of such fields (deducing the Ax-Kochen-Ersov Theorem) 


and considers in particular function fields over p-adically closed fields. DD 


Algebra, P. Commutative Semigroup Rings. Robert Gilmer. Lect. in Math. U of Chicago Pr, 1984, xii 
+ 380 pp, $ll (P); $27. [ISBN: 0-226-29392-0; 0-226-29391-2] Studies the structure of commutative 
semigroup rings, e.g., polynomial rings over commutative coefficient rings, asking when such an 
object is a Krull domain, when two are isomorphic, etc. DD 


Algebra, T(17-18: 1), S, P. A First Course of Homological Algebra. D.G. Northcott. Cambridge U Pr, 
1980, xi + 206 pp, $13.95 (P). [ISBN: 0-521-20196-9] Paperback reprint of the 1973 hardcover origi- 
nal edition (TR, February 1974). LAS 


Finite Mathematics, T(13-15: 1, 2). Applied Finite Mathematics. Gary G. Gilbert, Donald 0. Koehler. 
McGraw-Hill, 1984, xix + 549 pp, $27.50. [ISBN: 0-07-023222-9] Emphasis on mathematical modelling. 
Many examples and applications adapted from research articles in applied areas. Includes discussion 
of errors in the modelling process. Many exercises, from routine to real applications. Topics 
include linear programming, graphs, probability and Markov chains, game theory, finance. RM 


Calculus, T(13: 3). Calculus with Analytic Geometry, Second Edition. M.A. Munem, D.J. Foulis. 
Worth, 1984, xvi + 1048 pp, $37.95. [ISBN: 0-87901-236-6] Changes from the First Edition (TR, 
November 1978) include adding several new topics and some computer-generated graphs, de-emphasizing 
the use of tables, and using metric units in the applications. Two study guides and a computer 
workbook and diskette are also available. BK 


Calculus, T*(13). Essentials of Calculus for Business, Economics, Life Sciences, Social Sciences. 
Louis Leithold. Harper & Row, 1984, xv + 556 pp, $30.50. [ISBN: 0-06-043954-8] Concise, clear 
statements of definitions and theorems (without proofs) and careful explanations of concepts are 
motivated and illustrated by applications from the indicated areas; includes chapters on exponential 
and logarithmic functions and on differential calculus of functions of several variables; an appen- 
dix covers the calculus of trigonometric functions. JNC 


Calculus, T(13: 1, 2), S. Algebra and Fundamentals of Analysis. Ed: G.N. Yakovlev. Transl: Leonid 
Levant. MIR, 1981, 567 pp, $12.95. [ISBN: 0-8285-2571-4] A textbook for students at secondary 
technical schools and colleges in the USSR. Standard fare in elementary mathematical analysis, 
including chapters on computational mathematics and basics of probability. Numerous worked-out 
examples. Almost 700 problems, all with answers. Physically compact and inexpensive. Suitable for 
self-study. A good buy. JK 


Calculus, T*(13-14: 1-3). Calculus and Analytic Geometry. Donald W. Trim. Addison-Wesley, 1983, 
xix + 923 pp, $37.95. [ISBN: 0-201-16270-9] Attractive textbook featuring lots of exercises (some 
requiring a calculator) and refreshing figures. Usual fare but the author’s informal style and 
intuitive approach, the variation in the order of topics and the recasting of standard exercises 
enhance this presentation. Emphasis is on problem solving through an understanding of concepts. 
Answers are given to the even-numbered exercises. JK 


Calculus, T(13-14: 1-3). Calculus, Third Edition. Stanley I. Grossman. Academic Pr, 1984, xxiii + 
1336 pp, $31. [ISBN: 0-12-304371-9] Changes in examples, exercises, theorems and explanations to 
improve clarity. Some rearrangement of topics. Trigonometry now appears earlier in the text. 
Detailed biographical sketches have been added. Continues to be a solid textbook. (First Edition, 
June-July 1979; Second Edition, TR, November 1981.) JK 


Calculus, T*(13: 2, 3), S, L. Calculus with Analytic Geometry, Fourth Edition. Edwin J. Purcell, 
Dale Varberg. Prentice-Hall, 1984, xv + 864 pp, $36.95. [ISBN: 0-13-111807-2] Brief but brisk and 
to the point. Emphasizes functions as main objects of study; differentiation and integration as 
linear operators on functions. Trigonometric functions are reviewed early. Full chapter on elemen- 
tary numerical methods. Exercises unusually varied and imaginative; nearly all sets include graphi- 
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cal or calculator problems (Third Edition, TR, October 1978). PZ 


Calculus, T(113: 2, 3), S. Calculus and Analytic Geometry, Fourth Edition. Douglas F. Riddle. 
Wadsworth, 1984, xvii + 1246 pp, $43.25. [ISBN: 0-534-01468-2] Set of topics is standard. First 
chapters relatively elementary, intuitive; rigor increases later. Limits, continuity treated in 
three widely separated chapters--first intuitively, then more rigorously in geometric setting, 
finally in epsilon-delta format. Transcendental functions follow fundamental theorem. General 
emphasis on geometry, in text and exercises. (First Edition, TR, April 1970; Second Edition, TR, 
October 1974; Third Edition, TR, March 1979.) PZ 


Calculus, S*®***(13-14). Interface: Calculus and the Computer, Second Edition. David Smith. Saunders 
College, 1984, xv + 254 pp, (P). [ISBN: 0-03-070663-7] This book is a first in many ways, even 
though it is a Second Edition (First Edition, TR, November 1976). It is not the first well-written 
book about calculus, though it is among the rare few; neither is it the first book to treat the com- 
puter as a tool for enhancing calculus. But it is the first to bring the two together with so much 
delight and substance. It teaches BASIC (with appropriate apologies) on the sly while guiding the 
reader, who might profitably be a calculus teacher, to new insights into calculus and computing. It 
focuses on numerical approaches that develop understanding by requiring programming; it only men- 
tions symbolic manipulation. But the ideas presented go definitely beyond those of numerical 
methods into history and modelling. After the very first chapters where pidgin BASIC is developed, 
the rest of the algorithms are presented in a language independent flow-chart style. Thus the book 
is suitable in conjunction with any machine as powerful as the more sophisticated programmable cal- 
culators. Good references, indexes, reference charts and the like. JAS 


Calculus, T(13-14: 1). Calculus for Business. Richard S. Paul, Ernest F. Haeussler, Jr. Reston, 
1984, vii + 374 pp, $23.95. [ISBN: 0-8359-0635-3] Strictly for business. "Technical proofs, condi- 
tions, etc., are sufficiently described but are not overdone;" omissions include mean value theorem, 
volumes of revolution, proof of chain rule, Self-contained with respect to economic terms. 
Thirty-page reference appendix on algebra, with exercises. Attention is drawn to common mistakes in 
"pitfall" paragraphs. RB 


Calculus, T(13). Calculus: Basic Concepts and Applications. R.A. Rosenbaum, G. Philip Johnson. 
Cambridge U Pr, 1984, xvi + 422 pp, $24.95. [ISBN: 0-521-25012-9] An intuitive calculus text con- 
centrates on a single variable, and achieves brevity by using short problem lists, plausibility 
instead of proof, and fast moving prose. Attractive possibility for a short course. AWR 


Real Analysis, S(16-17), P, L. Asymptotic Analysis. J.D. Murray. Appl. Math. Sci., V. 48. 
Springer-Verlag, 1984, vii + 164 pp, $22. [ISBN: 0-387-90937-0] Written mainly for the person whose 
"primary aim is to get answers to practical problems," this is a readable, nonrigorous approach to 
asymptotic solutions for various methods including steepest descent, Stationary phase, integral 
transitions, differential equations, and singular perturbations. Includes exercises, bibliography, 
index. (1974 Oxford University Press original edition, TR, November 1975.) JS 


Complex Analysis, T(16-17: 1, 2), L. The Theory of Analytic Functions: A Brief Course. A.I. Mar- 
kushevich. Transl: Eugene Yankovsky. MIR, 1983, 423 pp, $10.95. Introduction to standard topics 
of complex analysis at advanced undergraduate or beginning graduate level. Emphasizes geometric, 
physical, engineering points of view in interpreting results. Relatively complete: Nevanlinna 
theory, quasiconformal mappings, and several-variable theory are introduced. Many diagrams, but 
almost no exercises. PZ 


Differential Equations, P. Hyperbolic Partial Differential Equations: Populations, Reactors, Tides 
and Waves: Theory and Applications. Ed: Matthew Witten. Intern. Ser. in Modern Appl. Math. & Com- 
puter Sci., V. 6. Pergamon Pr, 1983, viii + 244 pp, $37.50. [ISBN: 0-08-030254-8] A retitled spe- 
cial issue of Vol. 9, No. 3 of the journal Computers and Mathematics with Applications, containing 
papers devoted primarily to the McKendrick/Von Foerster hyperbolic partial differential equations 
model of population growth. LAS 


Differential Equations, T(15-16: 1), S*, L. Ordinary Differential Equations in R™: Problems and 
Methods. L.C. Piccinini, G. Stampacchia, G. Vidossich. Transl: A. LoBello. Appl. Math. Sci., V. 
39. Springer-Verlag, 1984, xii + 385 pp, $32 (P). [ISBN: 0-387-90723~8] Excellent readable, contem- 
porary account of topics usually covered in differential equations in a course other than the first. 
Based on the notes of G. Stampacchia. Exercises extend and complement the text. Not appropriate 
for students in the U.S. who have completed only the usual sequence of calculus courses. A very 
attractive treatment of the material. JK 


Differential Equations, S(18), P. Almost Periodic Functions and Differential Equations. B.M. Levi- 
tan, V.V. Zhikov. Transl: L.W. Longdon. Cambridge U Pr, 1982, xi + 211 pp, $34.50. [ISBN: 0-521- 
24407-2] Develops general properties of almost periodic functions and applies this theory to linear 
and non-linear operator differential equations. Clearly written translation from the Russian. His- 


torical references after each chapter. No problems. BH 


Differential Equations, T*(17: 3), S*, P*¥, L. Elliptic Partial Differential] Equations of Second 
Order, Second Edition. David Gilbarg, Neil S$. Trudinger. Grund. der math. Wissenschaften, B. 224. 
Springer-Verlag, 1983, xiii + 513 pp, $45.50. [ISBN: 0-387-13025-X] Covers the theory of the linear 
and non-linear elliptic Dirichlet problem, from the classical and modern points of view. Pays par- 
ticular attention to the equations of mean curvature, minimal surface, and Monge-Ampére. Provides 
the necessary background in Sobolev spaces, fixed point theory, and norm estimates. Mentions appli- 
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cations, e.g., stochastic control theory. Offers historical notes and problems. May be accessible 
to advanced undergraduates. (First Edition, TR, August-September 1978.) YN 


Numerical Analysis, S(13). Applied Numerical Methods for the Microcomputer. Terry E. Shoup. 
Prentice-Hall, 1984, x + 262 pp, $25.95 [ISBN: 0-13-041418-2]; Numerical Methods for the Personal 
Computer, 1983, xii + 238 pp, $18.95. [ISBN: 0-13-627208-8] Standard numerical methods are presented 
for the solution of engineering-oriented problems on a microcomputer. Programs for many of the 
methods are listed in a particular version of BASIC. Some general discussion of microcomputers and 


choice of methods. RWN 


Numerical Analysis, T(14-15). Introduction to Numerical Computation. J. Thomas King. McGraw-Hill, 
1984, xi + 355 pp, $31.95. [ISBN: 0-07-034639-9] A textbook for an undergraduate course in numerical 
analysis. Covers standard topics: the solution of nonlinear equations, polynomial interpolation, 
elimination methods for systems of linear equations, numerical integration and differentiation, and 
discrete variable methods for initial value problems. Example programs are written in WATFIV. AO 


Numerical Analysis, T(14-15: 1, 2), L. Applied Numerical Analysis, Third Edition. Curtis F. Gerald, 
Patrick 0. Wheatley. Addison-Wesley, 1984, x + 621 pp, $29.95. [ISBN: 0-201-11577-8] Major changes 
from earlier editions include the rewriting of all computer programs (programs are now in Fortran 
77), many new exercises, as well as new material on trigonometric approximation, the Fast Fourier 
Transform, and adaptive integration. AO 


Numerical Analysis, T*(17-18: 1), P. Matrix Computations. Gene H. Golub, Charles F. van Loan. Ser. 
in the Math. Sci., V. 3. Johns Hopkins U Pr, 1983, xvi + 476 pp, $49.50; $24.95 (P). [ISBN: 0- 
8018-3010-9; 0-8018-3011-7] A clear, relatively comprehensive text on numerical linear algebra, 
including linear systems, least squares and eigenvalue problems. Presents approaches that have been 
of recent value for large matrices, error and sensitivity analyses, and special methods. RWN 


Numerical Analysis, T(15-16: 1, 2). Introduction to Numerical Computations, Second Edition. James 
S. Vandergraft. Computer Sci. & Appl. Math. Academic Pr, 1983, xiv + 372 pp, $29.50. [ISBN: 0-12- 
711356-8] Two topics have been added in this edition: Newton formulas for interpolation, and adap- 
tive methods for integration. (First Edition, TR, August-September 1979.) AO 


Functional Analysis, T(18: 1), S, P. Locally Convex Spaces and Operator Ideals. Heinz Junek. 
Teubner-Texte zur Math., B. 56. BG Teubner, 1983, 180 pp, 17M (P). A systematic approach to the 
study of locally convex vector spaces by way of looking at ideals of operators. Special attention 
is paid to (F)-, (DF)-, and nuclear spaces. Assumes background in Banach and Hilbert Space. Refer- 
ences, index. JS 


Functional Analysis, P. Approximation of Hilbert Space Operators, Volume II. Constantin Apostol, et 
al. Research Notes in Math., No. 102. Pitman, 1984, 524 pp, $29.95 (P). [ISBN: 0-273-08641-3] 
Chapters 9-16, plus appendices, continuing the work begun by Domingo Herrero in Volume I. 
Emphasizes norm-approximation problems of invariant subsets of operator algebras. LAS 


Functional Analysis, T**(18: 1), S**, P**. Sequences and Series in Banach Spaces. Joseph Diestel. 
Grad. Texts in Math., No. 92. Springer-Verlag, 1984, xiii + 261 pp, $38. [ISBN: 0-387-90859-5] An 
engaging introduction, in 14 brief chapters, to results and methods of structure theory of Banach 
spaces. Focuses on recent developments but gives consistent attention to history of the topic. 
Style is informal, readable, contagiously enthusiastic, and generous to the reader--helpful comments 
and paraphrases of important results abound. Exercises and notes follow each chapter. PZ 


tors. Shmuel Kantorovitz. Springer-Verlag, 1983, 179 pp, $9.50 (P). [ISBN: 0-387-12673-2] 


a 


Bruno Iochum. Springer-Verlag, 1984, vi + 247 pp, $13 (P). [ISBN: 0-387-12901-4] On the underlying 
order structure of a Jordan algebra. Of interest to quantum theorists. JD-B 


Functional Analysis, P. Infinitely Divisible and Stable Measures on Banach Spaces. Werner Linde. 
Teubner-Texte zur Math., B. 58. BG Teubner, 1983, 201 pp, 19M (P). Spectral representations, tail 
behavior, and other properties of stable and infinitely divisible measures on Banach spaces. An 
infinitely divisible (Radon) measure admits n-th roots for all n-~such a measure is also stable if 
each root 1S a translation and dilation of the original measure. The subject is a branch of proba- 


bility theory on Banach spaces. PZ 


Analysis, $(16-17), L. Functional Equations: History, Applications and Theory. Ed: J. Aczél. Math. 
& Its Applic. D Reidel, 1984, ix + 244 pp, $47.50. [ISBN: 90-277-1706-0] Part of a series dedicated 
to showing relationships between apparently disparate areas of mathematics, this book collects 
papers and panel contributions from the functional equations section of a conference, Mathematics at 
the Service of Man. Its value is greatly enhanced by attention to historical development, though 
the pleasure of such reading is diminished by the strained, sometimes fractured English prose of 
writers more at home with other languages. AWR 


Analysis, S*(18), P. Geometrical Methods of Nonlinear Analysis. M.A. Krasnosel’ skil, P.P. Zabreiko. 
Grund. der math. Wissenschaften, B. 263. Springer-Verlag, 1984, xix + 409 pp, $48. [ISBN: 0-387- 
12945-6] Written for "mathematicians and mechanical engineers" with a "basic knowledge of functional 
analysis and topology." Describes theory of rotation of vector fields and applies this theory to 
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the study of solution sets of overator equations, giving many mathematical and physical examples. 
Text is well motivated, clearly written, and provides good overview of material as it is developed. 
Unfortunately, no problems. BH 


Analysis, P. Inégalités isopérimétriques et applications en physique. Jacqueline Mossino. Hermann, 
1984, 182 pp, 130 F (P). [ISBN: 2-7056-5963-3] 


Algebraic Geometry, P. Intersection Theory. William Fulton. Ergebnisse der Math. und ihrer 
Grenzgebiete, 3. Folge, Band 2. Springer-Verlag, 1984, xi + 470 pp, $39. [ISBN: 0-387-12176-5] A 
comprehensive modern treatment of the subject. Serves well as a text for graduate students in alge- 
braic geometry or as a basic reference for practicing algebraic geometers. SG 


Differential Geometry, P. Séminaire Sud-Rhodanien de Géométrie. P. Dazord, N. Desolneux-Moulis. 
Hermann, 1984. Feuilletages et quantification géométrique, 139 pp, 130 F (P). [ISBN: 2-7056-5972- 
2]; Géométrie symplectique et de contact, 113 pp, 110 F (P). [ISBN: 2-7056-5971-4] The first two of 
three volumes containing papers given at a conference on symplectic geometry in Lyons in June 1983. 


JD-B 


Differential Geometry, P. Affine Differential Geometry. Su Buchin. Gordon & Breach, 1983, iv + 248 
pp, $63.75. [ISBN: 0-677-31060-9] From Shanghai: the study of geometric objects invariant under 
affine transformations, after Blaschke and the author’s own work of the 1920°s and “30%s. Review of 
Blaschke’s Vorlesungen, affine surface theory, relations between affine and projective geometry. 
Presumes elementary differential geometry (curves and surfaces). Exercises. RB 


Differential Geometry, P. Introduction to the Geometry of Foliations, Part B: Foliations of Codimen~ 
sion One. Gilbert Hector, Ulrich Hirsch. Aspects of Math., V. 1. Friedr. Vieweg & Sohn (Distr: 
Heyden & Son), 1983, x + 298 pp, $22 (P). [ISBN: 3-528-08568-1] An extension of Part A (published 
1981) to extend the geometry of foliations of surfaces to general manifolds of codimension one. JAS 


Differential Geometry, P. Category Theoretic Methods in Geometry. Ed: A. Kock. Pub. Ser., No. 35. 
Aarhus U, 1983, vi + 290 pp, (P). Proceedings of the workshop which took place in Aarhus, June 1- 
15, 1983. JAS 


Differential Geometry, T(17: 1, 2), S, P*. Modern Geometry-~Methods and Applications, Part I: The 
Geometry of Surfaces, Transformation Groups, and Fields. B.A. Dubrovin, A.T. Fomenko, S.P. Novikov. 
Transl: Robert G. Burns. Grad. Texts in Math., V. 93. Springer-Verlag, 1984, xv + 464 pp, $48. 
[ISBN: 0-387-90872-2] First in a projected three-volume text for a modern course in geometry 
appropriate for physicists. Excellent preface contains authors” thoughts on what should be compul- 
sory subjects in a university-level mathematical education. Topics selected with a view toward 
their utility include basic notions of surface theory, the algebra and the differential calculus of 
tensors, and the calculus of variations. Done in "as simple and concrete a language possible." 


Smoothly translated. Some examples and exercises. Brief bibliography. JK 


Differential Geometry, P. Lecture Notes in Mathematics-1045: Differential Geometry. Ed: A.M. 
Naveira. Springer-Verlag, 1984, viii + 194 pp, $10.50 (P). [ISBN: 0-387-12882-4] Proceedings of a 
symposium held at Pefisicola, Spain in 1982. JG 


Differential Geometry, P. Proceedings of the 1980 Beijing Symposium on Differential Geometry and 
Differential Equations. Ed: $S.S. Chern, Wu Wen-tsun. Gordon & Breach, 1982, $325 set [ISBN: 0- 
677-31000-5]; Volume 1, ix + 625 pp; Volume 2, v + 474 pp; Volume 3, v + 640 pp. Papers from a 
month-long international symposium, the first of its kind ever held in China. Volumes 1 and 2 con- 
tain the invited "comprehensive reports," including papers by M.F. Atiyah, E. Bombieri, Gu Chao-hao, 
L.K. Hua, P.D. Lax, H. Wu, and Wu Wen-tsun. Volume 3 contains about 60 contributed papers, mostly 
by Chinese mathematicians. LAS 


Geometry, S(13), L. Experiments in Four Dimensions. David L. Heiserman. TAB Books, 1983, vii + 501 
pp, $17.50 (P). [ISBN: 0-8306-1541-5] A detailed "how-to" guide for generating the specifications 
(i.e., vertex coordinates, angles, edge lengths, etc.) of four-dimensional objects and three- 
dimensional views of these objects; painstakingly leads the reader with knowledge of precalculus 
through coordinate systems, algebraic representations of transformations and object specifications 
in one, two, three, and four dimensional space; exercises. JNC 


Algebraic Topology, T(17-18: 1), S, P. Techniques of Geometric Topology. Roger A. Fenn. London 
Math. Soc. Lect. Note Ser., No. 57. Cambridge U Pr, 1983, viii + 280 pp, $24.95 (P). [ISBN: 0- 
521-28472-4] Illustrates basic techniques of algebraic topology as applied to geometric aspects of 
low dimensional topology (dimension less than or equal to about 4). Reviews of homology, homotopy; 
covering spaces including their homology; knot theory; Massey products; exercises. A suitable 
sequel to a semester course in algebraic topology or for independent reading, emphasizing both foun- 
dational and current geometric aspects. RB 


Algebraic Topology, T(18: 1), P. Spectra and the Steenrod Algebra: Modules Over the Steenrod Algebra 
and the Stable Homotopy Category. H.R. Margolis. Math. Lib., V. 29. Elsevier Sci, 1983, xix + 489 


pp, $65. [ISBN: 0-444-86516-0] "[T]he book constitutes both a new presentation of results in the 
literature--in effect, an introduction to stable homotopy theory--and a presentation of original 
results [by the author]." Focuses on global structure of topological categories, as opposed to 


specific problems. RB 
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Siegfried Moran. Math. Stud., V. 82. Elsevier Sci, 1983, xii + 295 pp, $42.50 (P). [ISBN: 0-444- 
86714-7] Knot theory developed via the theory of braids, after Artin, Birman. Largely self- 
contained, including discussions of group presentations, fundamental group, Seifert-van Kampen 
theorem, etc.; some knowledge of algebra and topology is presumed. Stress on algebraic aspects of 
the subject in general and Alexander matrices of knots and links in particular. RB 


Optimization, T, S, P. Multicriterion Optimization in Engineering with FORTRAN Programs. Andrezj 
Osyczka. Transl: B.J. Davies. Ser. in Engin. Sci. Halsted Pr, 1984, viii + 178 pp, $51.95. [ISBN: 
0-470-20019-7] Written with the object of making theoretical results of multicriterion optimization 
accessible to practicing engineers. Attempt is made to keep mathematical demands on the reader to a 
minimum, and concrete examples are used throughout. No problems or exercises. AWR 


Probability, T(17), P. Nonlinear Filtering and Smoothing: An Introduction to Martingales, Stochastic 
Integrals and Estimation. Venkatarama Krishnan. Wiley, 1984, xiii + 314 pp, $34.95. [ISBN: 0-471- 
89840-6] Written for "engineering graduate students with a basic knowledge of probability theory." 
Refers many proofs to other texts. Stresses physical understanding and applications. BH 


Statistics, S(13-16), L. The Business of Risk. Peter G. Moore. Cambridge U Pr, 1983, x + 244 pp, 
$39.50; $9.95 (P). [ISBN: 0-521-24174-X; 0-521-28497-X] A general discussion of methods to assess 
and handle risk, with specific examples from the fields of commerce, finance, industry, medicine, 
health, and public policy. Assumes only a rudimentary knowledge of mathematics and statistics. LCL 


Statistics, T(16-18: 1), S, P, L. Pattern Recognition Approach to Data Interpretation. Daane D. 
Wolff, Michael L. Parsons. Plenum Pr, 1983, xiii + 223 pp, $29.50. [ISBN: 0-306-41302-7] Chapter 
one discusses choices of data and organization, choices of tools and their applications, phailoso- 
phies of interpreting results. Chapter two provides some mathematical foundations for the tech- 
niques included, Chapter three presents three major packages: SPSS, BMDP, ARTHUR. The last chapter 
contains natural science applications. References. Appendices. Subject index. Computer program 
index. RJA 


Statistics, S(17), P. Linear Statistical Analysis of Discrete Data. Mikel Aickin. Wiley, 1983, xvi 
+ 358 pp, $36.95. [ISBN: 0-471-09774-8] This book integrates application, computation, and theory 
required for statistical analysis of discrete data. Material includes log-linear models for cross- 
classifications, logistic regression, inference for nested models, and covariate inference. 
Designed to be accessible, with instruction, to those with a course in statistics beyond the e]emen- 
tary level, calculus, and linear algebra. MT 


Statistics, T(17: 1). Elementary Multivariate Analysis for the Behavioral Sciences: Applications of 
Basic Structure. Clifford E. Lunneborg, Robert D. Abbott. Elsevier Sci, 1983, xvi + 522 pp, $39. 
[ISBN: 0-444-00753-9] Chapters 2-6 review matrix concepts such as matrix algebra, rank, orthogonal- 
ity, and differentiation. Chapters 7-9 discuss both theory and applications of multiple regression. 
Chapters 10-12 are the heart of the text; they introduce multivariate distributions and the mul- 
tivariate general linear model, including T-squared, Wilks” Lambda, and discriminant analysis. The 
final three chapters discuss models for categorical data and the basic structure and analysis of 
interdependence. MT 


Statistics, $(13). Statistical Illusions: Problems. Schuyler W. Huck, Howard M. Sandler. Harper & 
Row, 1984, xiii + 175 pp, $10.95 (P) [ISBN: 0-06-042973-9]; Solutions, ix + 181 pp, $10.95 (P). 
[ISBN: 0-06-042969-0] The "problems" book contains 100 vignettes, each culminating in a leading 
question with an expected, "commonsense," (but incorrect) answer. The companion "solutions" volume 
shows why the immediate, intuitive answer is incorrect, and explains the correct solution. Can be 
used as a supplement in a beginning statistics course; questions span such areas as descriptive 
statistics, probability, estimation, hypothesis testing, correlation, nonparametrics. LCL 


Statistics, P*. Experimental Design, Statistical Models, and Genetic Statistics: Essays in Honor of 
Oscar Kempthorne. Ed: Klaus Hinkelmann. Statistics, V. 50. Dekker, 1984, x + 409 pp, $55. [ISBN: 
0-8247-7151-6] Collection of 21 articles written by 32 former students, colleagues and friends of 
Kempthorne, who did pioneer work in the areas of experimental design and genetic statistics (2 on 
his contributions to Iowa State University, 9 on the design and analysis of experiments, 6 on linear 


and nonlinear models, and 4 on statistical and population genetics). RSK 


Statistics, T(14-17), S*. Analysis of Unbalanced Data: A Pre-Program Introduction. Ching Chun Li. 
Cambridge U Pr, 1982, x + 145 pp, $19.95. [ISBN: 0-521-24749-7] Can be used as a text for a short 
course or module to introduce the analysis of two-way classification data with unequal numbers of 
observations in the cells. Detailed explanations are clearly given and illustrated using only ele- 
mentary algebra. An appendix gives the relation between the methods used in the text and the more 
sophisticated generalized inverse approach. RSK 


ee ee El 


the Life Sciences. J.H. Pollard. Cambridge U Pr, 1981, xvi + 349 pp, $18.95 (P). [ISBN: 0-521- 
29750-8] Divided into three parts: Part I deals with numerical techniques, Part II deals with sta- 
tistical techniques, and Part III combines numerical and statistical techniques in discussing least 
squares regression. This is not a text but rather a statistical handbook designed as a reference 
for scientists. MT 
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Statistics, T(17: 2). Intermediate Statistical Methods and Applications: A Computer Package 
Approach. Mark L. Berenson, David M. Levine, Matthew Goldstein. Prentice-Hall, 1983, xviii + 579 
pp, $29.95. [ISBN: 0-13-470781-8] This text includes discussion of analysis of variance, regres- 
sion, principal components, discriminant analysis, and analysis of cross-classified categorical 
data. After presentation of each statistical procedure, the book describes the commands necessary 
to conduct the analysis using each of three statistical computing packages: SPSS, SAS, and BMDP. MT 


Statistics, T?(14: 1), S*. Dice, Data and Decisions: Introductory Statistics. K.W. Kemp. Ser. in 
Math. & Its Applic. Halsted Pr, 1984, 314 pp, $64.95. [ISBN: 0-470-27496-4] This book is an intro- 
duction to statistics for students who have had differential and integral calculus. Concepts of 
probability, expectation, and random variables are introduced in the first chapters, but in an order 
that may prove difficult to follow; one should examine this book carefully before adopting it as a 
text. However, there are many illustrative examples which make the book a good student reference. 
MT 


Statistics, T(14~-17: 1, 2), P. Biostatistical Analysis, Second Edition. Jerrold H. Zar. Prentice- 
Hall, 1984, xiv + 718 pp, $34.95. [ISBN: 0-13-077925-3] Presents a broad collection of data analysis 
techniques used by biological researchers, assuming only a background of elementary algebra. 
Advanced topics include parametric and nonparametric analysis of variance techniques, including mul- 
tiple comparison procedures, multiple regression and correlation, and circular distributions. Con- 
tains 190 pages of tables and graphs, but no discussion of elementary probability. RSK 


Statistics, T*(15: 1, 2). Introduction to Mathematical Statistics, Fifth Edition. Paul G. Hoel. 
Wiley, 1984, x + 435 pp, $28.95. [ISBN: 0-471-89045-6] Principal changes from the Fourth Edition 
(TR, November 1971; Extended Review, June 1974) include earlier discussion of small sample methods, 
a rewriting of the chapter on sampling theory, some additional material on computer programs, and 
many new problems, often involving real data. The problem sets now also include a final collection 


of mixed problems in addition to the problems grouped by section. RSK 


Computer Literacy, S, P, L. Information, Technology and Civilization. Hiroshi Inose, John R. 
Pierce. WH Freeman, 1984, xx + 263 pp, $9.95 (P); $19.95. [ISBN: 0-7167-1515-5; 0-7167-1514-7] A 
report commissioned by the Club of Rome (ten years after their highly publicized "Limits to Growth") 
to examine the impact on society of information technology. The authors see information serving a 
new world of "communities of interest," held together by common stakes in information, data, and 
communication. LAS 


Computer Literacy, L. Computer Literacy: The Basic Concepts and Language. John V. Lombardi. Indi- 
ana U Pr, 1983, x + 114 pp, $5.95 (P). [ISBN: 0-253-21075-5] A brief introduction to microcomputers 
and related technology for those without previous experience. Includes material on word processing, 
spreadsheets, communications, and programming languages. AO 


Computer Programming, T(13-14: 1). Programming Assembler Language, Second Edition. Peter Abel. 
Reston, 1984, x + 581 pp. [ISBN: 0-8359-5661-X] An introduction to assembly language programming on 
the IBM 370 series. Includes general descriptions of the system, computer arithmetic, data han- 
dling, storage access and macros. (First Edition, TR, June-July 1980.) RWN 


Computer Programming, T(13-14: 1, 2). PASCAL, Programming and Problem Solving. Sanford Leestma, 
Larry Nyhoff. Macmillan, 1984, xii + 585 pp, $19.95 (P). [ISBN: 0-02-369460-2] Intended for a 
two-semester introduction to Pascal programming and computer problem solving. By year’s end, final 
chapter discusses RPN, introduction to trees, quicksort, merging: advanced programming topics of 
program specification, external documentation, software maintenance are excluded. Top-down problem 
solving demonstrated in only two examples. RB 


Computer Programming, $(13-18). The Complete Forth. A.F.T. Winfield. Wiley, 1983, xi + 131 pp, 
$15.95 (Pp). [ISBN: 0-471-88235-6] The FORTH-79 standard dialect is used. Text directed at a micro- 
computer environment. First five chapters present basic language features: stack, words, actions, 
conditionals, loops. Next follows editing and disk operations for FORTH. Second half of text 
discusses specialized techniques of FORTH programming and two large complete programs. Chapter sum- 
maries and exercises. Bibliography. Answers to problems. Glossary of terminology. Index. FORTH 
reference card. RJA 


Computer Programming, T(13-15: 1), P. Learning and Applying APL. B. Legrand. Transl: Julian Glyn 
Matthews. Wiley, 1984, xiii + 400 pp, $24.95. [ISBN: 0-471-90243-8] An introduction to the program- 
ming language APL, developed by Ken Iverson of Harvard University in 1965. APL is an operator- 
oriented language that is particularly well suited for solving mathematical problems relating to 
arrays and matrices. The text is a thorough introduction to the language and would be appropriate 
for either the beginner or the experienced professional wishing to learn another language. MS 


Computer Programming, T**(13-18: 1), S, L. Paradigms and Programming with Pascal. Derick Wood. 
Computer Sci Pr, 1984, xiii + 425 pp, $25.95. [ISBN: 0-914894-45-5] Text is divided into four parts. 
First comes a quick review of Smaller Pascal, a subset of Pascal used in the text. Part two 
presents programming methodology issues in the context of four important problems that are examined 
fully. Also, there is a chapter on good I/O interfaces and one on the general issue of efficiency 
in high-level programming. Part three introduces a number of important paradigms (e.g., recursion, 
divide and conquer, various searches, backtracking, etc.) along with appropriate illustrative exam- 
ples of problems and solutions. Part four very briefly introduces more extremely interesting para- 
digms along with project suggestions. Chapter references and exercises. Index. RJA 
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Computer Programming, T*(13-18: 1). VAX PASCAL. J.N.P. Hume, R.C. Holt. Reston, 1984, xiv + 395 
pp. [ISBN: 0-8359-8247-5] Interweaves elements of computers, computer programming, the VAX program- 
ming environment, Pascal, and computer science. An introductory programming text that addresses 
some issues not usually found at this level; e.g., sorting and searching methods and their effi- 
clency, insertions into and deletions from trees, other high-level programming languages as well as 
assembly and machine languages, compilers. Of special note are the chapters on scientific calcula- 
tions and numerical methods. Chapter exercises. Several appendices. Index. RJA 


Computer Programming, T(13). An Introduction to Programming and Pascal. William J. Collins. Mac- 
millan, 1984, xi + 388 pp, $19.95 (P). [ISBN: 0-02-323780-5] Emphasis on programming--algorithms, 
pseudo-code, stepwise refinement (explained with "solution trees"). Many Pascal constructs intro- 
duced early, loops and arrays before procedures. Includes detailed case studies, but skimpy on 
exercises. RM 


Computer Programming, T(14-16: 1), S, P. An Introduction to APL. S$. Pommier. Transl: Bronwen A. 
Rees. Computer Sci. Texts, No. 17. Cambridge U Pr, 1983, vii + 136 pp, $29.95; $11.95 (P). [ISBN: 
0-521-24977-5; 0-521-27109-6] APL is a function-oriented language designed by Ken Iverson. In APL, 
the basic data type is the array, and the language contains dozens of powerful operators for direct 
manipulation of these array structures. This text is a description of the APL language. It 
includes numerous examples of APL programs and a discussion of the types of problems for which APL 
is especially well suited. MS 


Computer Programming, T(15-16: 1), P, L. Micro-PROLOG: Programming in Logic. K.L. Clark, F.G. 
McCabe. Prentice-Hall, 1984, xi + 401 pp, $16.95 (P). [ISBN: 0-13-581264-X] Tutorial on logic pro- 
gramming with the microcomputer implementation of PROLOG (available under UNIX and on Z-80 and 
8088/86 based machines). Nice, straightforward treatment, with chapters on applications to expert 
systems, the critical path method, two person games. RM 


Computer Programming, S. Apple BASIC Made Easy. David A. and Marianne L. Gardner. Prentice-Hall, 
1984, ix + 246 pp, $19.95. [ISBN: 0-13-038928-5] A readable introduction to Apple BASIC. Includes 
the elementary commands, conditional statements, loops, arrays, and graphics. Some programming 
problems. Lots of cartoons. CEC 


Computer Programming, 1(13: 1). Introduction to Computer Science using the Turing Programming 
Language. R.C. Holt, J.N.P. Hume. Reston, 1984, xii + 404 pp. [ISBN: 0-8359-3168-4] Turing is a 
"Pascal-like" programming language that has been designed for use in teaching programming concepts 
to beginning programmers. It has simplified syntax, more user-friendly error messages, and simpli- 
fied data structures. According to the author, it has been successfully used in a large number of 
introductory computer programming courses in place of either BASIC, FORTRAN, or Pascal. MS 


Computer Programming, T(13-14: 1). Problem Solving with Structured FORTRAN 7/7. D.M. Etter. 
Benjamin/Cummings, 1984, xxii + 416 pp, $21.95 (P). [ISBN: 0-8053-2522-0] Structured problem solving 
with heavy emphasis on stepwise refinement, pseudocode, structured control statements, format free 
1/0. Many applications, chapters on file handling, character strings. RM 


Computer Programming, T(13-14: 1). VAX FORTRAN. Charlotte Middlebrooks. Reston, 1984, xii + 516 
pp, $15.95 (P). [ISBN: 0-8359-8243-2] An introduction to FORTRAN 77 intended for beginning students 
with no prior experience in the area of programming. The authors have chosen to make the text very 
machine specific. The text is geared around the DEC VAX-11 computer system. The text includes a 
discussion of the VAX-ll command language, editor, and filing system. Users of other computers 
would find this text very inappropriate. MS 


Computer Programming, P. Programming the IBM Personal Computer: Fundamentals of BASIC. Neill Gra- 
ham. Holt, Rinehart & Winston, 1984, xiii + 285 pp, (P). [ISBN: 0-03-059561-4] Introduction to 
BASIC on the IBM PC. Emphasis on simpler aspects, suitable for self study. RM 


Computer Programming, S(16). INMOS Limited: OCCAM, Programming Manual. Ed: C.A.R. Hoare. 
Prentice-Hall, 1984, 85 pp, $16.95 (P). [ISBN: 0-13-629296-8] OCCAM is a programming language 
designed to support concurrent applications. It is especially useful in designing systems in which 
many microprocessors are networked together to solve a problem. This text is a programmer’s refer- 
ence manual to the OCCAM language. It assumes the reader is already familiar with the concepts of 
block-structured programming languages and operating systems. MS 


Computer Programming, T(14-15: 1). Assembly Language Made Easy for the TRS-80. Chao Chien. Holt, 
Rinehart & Winston, 1984, xii + 228 pp, $18.45 (P). [ISBN: 0-03-070441-3] A tutorial on the TRS-80 
Models I, II, III computer systems and the assembly language of the TRS-80, called EDTASM. The text 
assumes a familiarity with the general concepts of high level languages, and frequently uses BASIC 
to explain concepts or make comparisons with assembly language. MS 


Computer Programming, T(13-14). Introduction to PASCAL and Structured Design. Nell Dale, David 
Orshalick. DC Heath, 1983, xvii + 586 pp, $19.95 (P). [ISBN: 0-669-06962-0] An introductory text 
in computer programming and the block structured language Pascal that makes no assumptions about 
previous experiences in either programming or mathematics. It covers the entire Pascal language and 
also includes discussions about such related topics as algorithms, top down design, debugging, and 


testing methods. MS 
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Data Structures, T(15-17: 1), L. Sequential Program Structures. Jim Welsh, John Elder, David Bus- 
tard. Ser. in Comp. Sci. Prentice-Hall, 1984, xii + 385 pp, $26.95. [ISBN: 0-13-806837-2] Focuses 
on the structures, both of programs and data, that arise in sequential (i.e., not concurrent) com- 
puter programs. Emphasizes the use of stepwise refinement, modular programming, abstract data 
types, and appropriate data structures. Stacks, queues, lists, trees, and graphs are among the data 
structures discussed. Example programs are presented in Pascal Plus. AO 


Software Systems, T(14-15: 1), L. An Introduction to System Programming--Based on the PDPll. Der- 
rick Morris. Springer-Verlag, 1983, 187 pp, $16.80 (P). [ISBN: 0-387-91230-4] Designed for an 
introductory course in systems programming. Covers what is necessary to make a "bare" machine 
usable at the assembly language level. AO 


Software Systems, L. The Computer Graphics Glossary. Stuart W. Hubbard. Oryx Pr, 1983, vi + 95 pp, 
$24.50. [ISBN: 0-89774-072-6] Brief one or two sentence definitions and explanations of between 500 
and 1000 terms. Mostly names of equipment and high-level concepts that would be found at the appli- 
cations program level in computer graphics. JAS 


Software Systems, T*(13-18: 1), S*, P, L. Smalltalk-80: The Language and its Implementation. Adele 
Goldberg, David Robson. Addison-Wesley, 1983, xx + 715 pp, $34.95. [ISBN: 0-201-11371-6] Smalltalk 
derives many important ideas from the simulation language Simula. The language is interpreted, 
interactive and object-oriented, e.g., a user can draw lines on a screen using a screen pen. Text 
is divided into four parts: (1) concepts and syntax of the language; (2) specification of the 
system’s functionality; (3) modelling discrete, event-driven simulations; (4) specification of the 
language’ s virtual machine. Indexes. RJA 


Software Systems, P. Computer Graphics: Theory and Applications. Ed: Tosiyasu lL. Kunii. 
Springer-Verlag, 1983, x + 530 pp, $45. [ISBN: 0-387-70001-3] Proceedings of InterGraphics “83 held 
in Tokyo, April 11-14, 1983. These papers present the state-of-the-art in computer graphics tech- 
nology and its applications. AO 


Software Systems, P. Working with RT-ll. David Beaumont, Anne Summerfield, Julie Wright. DEC, 
1983, x + 206 pp, $19 (P). [ISBN: 0-932376-31-2] An introduction to the use of a single-user real- 
time operating system for DEC computers. First volume of four planned. RWN 


Computer Science, T(14-17: 1). MICOS: A Microprogrammable Computer Simulator. Lubomir Bic. Com- 
puter Sci Pr, 1984, xi + 111 pp, $15.95 (P). [ISBN: 0-914894-76-5] MICOS was developed at the 
University of California at Irvine and is written in UCSD Pascal. It creates an environment where 
both the assembly language and the underlying microcode are accessible to the user. Logically the 
MICOS "machine" as delivered is moderately sophisticated (level of common microcomputers) and is of 
course microprogrammable to the users own specifications. This book describes the system and its 
instructions. It appears that a diskette, presumably with an applications-level manual, is avail- 
able for $15 from the publisher. JAS 


Computer Science, T(16-17: 1), P. Parallel Programming in ANSI Standard Ada. George W. Cherry. 
Reston, 1984, viii + 213 pp. [ISBN: 0~-8359-5434-X] This text addresses the issue of parallel pro- 
gramming; its notation to represent these programs is ANSI standard Ada. Some of the application 
areas addressed are parallel sort, root finding, search, and text copying algorithms. The emphasis 
is on how these parallel algorithms can be effectively and efficiently implemented in Ada using the 
Ada rendezvous mechanism. MS 


Computer Science, S(16-17), P, L. Fortran Optimization. Michael Metcalf. APIC Stud. in Data Pro- 
cessing, V. 17. Academic Pr, 1982, xii + 242 pp, $24. [ISBN: 0-12-492480-8] Presents techniques 
that can be used by compiler writers to generate efficient machine code from FORTRAN programs. Also 
discusses strategies a programmer can use to make FORTRAN programs more efficient. AO 


Computer Science, P. Lecture Notes in Computer Science-163: VLSI Engineering: Beyond Software 
Engineering. Ed: Tosiyasu L. Kunii. Springer-Verlag, 1984, vii + 308 pp, $14 (P). [ISBN: 0-387- 
70002-1] Focuses on VLSI-related engineering problems as they are used in the production of 
application-oriented computers. A collection of 14 papers presented at a symposium on VLSI 
engineering held in Japan in October, 1982. The papers are technical research papers addressing 
research issues in the manufacture of high density VLSI circuits. MS 


Computer Science, P. On Conceptual Modelling: Perspectives from Artificial Intelligence, Databases, 
and Programming Languages. Ed: Michael L. Brodie, John Mylopoulos, Joachim W. Schmidt. Topics in 
Inform. Sys. Springer-Verlag, 1984, xi + 510 pp, $28. [ISBN: 0-387-90842-0] Many advanced areas of 
computer science, including artificial intelligence, database design, and programming, must build 
very high-level abstractions which are independent of a particular problem or computer system. 
Examples include knowledge representation techniques or semantic networks. This text tries to tie 
together the ideas of abstraction under the title conceptual modelling. It contains nineteen 
advanced papers on this subject including discussions and analysis of the subject matter. MS 


Control Theory, T(16-17: 1, 2), P. Adaptive Filtering Prediction and Control. Graham C. Goodwin, 
Kwai Sang Sin. Inform. & System Sci. Ser. Prentice-Hall, 1984, xii + 540 pp, $41.95. [ISBN: 0-13- 
004069-xX] A textbook for advanced undergraduate or graduate students that incorporates many recent 
research results. Both deterministic and stochastic systems are covered. AO 
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Control Theory, T(17-18: 1). Kalman Filtering Theory. A.V. Balakrishnan. Univ. Ser. in Modern 
Engin. Optimization Software (Distr: Springer-Verlag), 1984, xii + 222 pp, $24 (P). [ISBN: 0- 
911575-26-X] A textbook for use in a one-quarter or one-semester graduate course. Focuses on those 
aspects of the subject that can be given a firm mathematical foundation. AO 
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tions to Stochastic Systems Theory. Harold J. Kushner. Ser. in Signal Processing, Optimization, & 
Control. MIT Pr, 1984, xvii + 269 pp, $40. [ISBN: 0-262-11090-3] A leading researcher in the area 
of stochastic processes as applied to control and communication theory explains how known techniques 
can be used more directly in certain applications, and presents some new methods. AWR 


Systems Theory, P*. Interdisciplinary Mathematics, V. XXII: Topics in the Geometric Theory of Linear 
Systems. Robert Hermann. Math Sci Pr, 1984, xvi + 291 pp, $50. [ISBN: 0-915692-35-X] First volume 
in a projected series “aimed at developing new geometric methodology to be used in engineering and 
the physical sciences." Directions of future volumes are indicated in author’s collection of papers 
in the Journal of Mathematical Physics since 1980. Forward on "Mathematical Engineering: Problems 
and Opportunities" is must reading even if one does not agree with author’s point of view. JK 


Systems Theory, P. Stochastic Phenomena and Chaotic Behaviour in Complex Systems. Ed: P. Schuster. 
Ser. in Synergetics, V. 21. Springer-Verlag, 1984, viii + 271 pp, $31.50. [ISBN: 0-387-13194-9] 
Collection of the invited contributions of 20 participants in an interdisciplinary workshop of the 


UNESCO working group on systems analysis. AWR 


Applications, $(16-18), P, L*. Mathematical Models of Morphogenesis. René Thom. Transl: W.M. 
Brookes, D. Rand. Ser. in Math. & Its Applic. Halsted Pr, 1983, 305 pp, $59.95. [ISBN: 0-470- 
27499-9] Translation of the second (1980) French edition, a collection of Thom’s papers on morpho- 
genesis dating from 1967 to 1981. Topics range from physics to biology, from linguistics to semiot- 
ics; all seek to provide "Speculative and philosophical" bases for catastrophe theory as a "stimulus 
to imagination." LAS 


Applications, P. Transactions of the First Army Conference on Applied Mathematics and Computing. US 
Army Research Office (P.O. Box 12211, Research Triangle Park, NC 27709), 1984, xxii + 925 pp, (P). 
Papers from a May 1983 conference held at George Washington University, combining two symposia--the 
Conference of Army Mathematicians and the Numerical Analysis and Computers Conference. Over 50 
papers on a wide variety of applied topics. LAS 


Applications (Architecture), T(15-17), S, L. Incidence and Symmetry in Design and Architecture. 
Jenny A. Baglivo, Jack E. Graver. Urban & Architectural Stud., V. 7. Cambridge U Pr, 1983, xi + 
306 pp, $54.50; $15.95 (P). [ISBN: 0-521-23043-8; 0-521-29784-2] Applications of graph theory (e.g., 
routing, bracing, planning) and transformation geometry (e.g., symmetries of the plane) to architec- 
ture. Topics are geometrical (topological) in nature with many excellent illustrations and 
diagrams. Exercises involve computation, simple theory, as well as paper folding and mirror experi- 


ments. LCL 


Applications (Artificial Intelligence), T?(17: 1), P. Automated Reasoning: Introduction and Applica- 
tions. Larry Wos, et al. Prentice-Hall, 1984, xiv + 482 pp, $28.95. [ISBN: 0-13-054453-1] Intro- 
duction to the theory and practice of automated logical reasoning programming using resolution-based 
inference rules and strategies. Theoretical foundations, language of clauses, representations, and 
applications to logic circuit design and validation, research in mathematics and logic, program 
verification, and expert systems. RM 


Applications (Biology), S(17), P, L. Graphs and Genes. B.G. Mirkin, S.N. Rodin. Transl: H. Lynn 
Beus. Biomath., V. ll. Springer-Verlag, 1984, xiv + 197 pp, $27. [ISBN: 0-387-12657-0] Graph 
theory is applied to the analysis of gene structure, gene semantics and gene evolution. A true 
melding of mathematical and biological research. Includes a list of references. CEC 


Applications (Biology), P. The Fluid Mechanics of Large Blood Vessels. T.J. Pedley. Mono. on 
Mechanics & Appl. Math. Cambridge U Pr, 1980, xv + 446 pp, $97.50. [ISBN: 0-521-22626-0] Aimed at 
mathematicians interested in cardiovascular fluid mechanics. Provides a comprehensive overview of 
what 1s known empirically and of the mathematical understanding of the observed phenomena. Much of 
the book describes areas stil] being actively researched. AO 


Applications (Biology), T(14-15: 1), S, L. Demography Through Problems. Nathan Keyfitz, John A. 
Beekman. Problem Books in Math. Springer-Verlag, 1984, viii + 141 pp, $28. [ISBN: 0-387-90836-6] 
Six chapters treat aspects of population theory: age-independent populations, life tables, projec- 
tion and forecasting, etc. Each chapter contains a brief, intuitive introduction to mathematical 
concepts, a large collection of problems, and complete or sketched solutions. Assumes a few facts 
about matrices and a thorough understanding of elementary calculus. PZ 
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An Analysis of Large Economies with Non-Convex Preferences. Walter Trockel. Springer-Verlag, 1984, 


viii + 205 pp, $14 (P). [ISBN: 0-387-12881-6] Comprehensive treatment of the existing literature on 
this subject; special emphasis on the "ergodic" approach to smoothing demand by aggregation. JRG 


Applications (Economics), T(14-16: 1), S$, L. An Introduction to Mathematical Models in Economic 


Dynamics. David L. Clements. Polygonal, 1984, $18.95. [ISBN: 0-936428-07-4] Introduction to 
mathematical economics. Begins with linear one-sector continuous and discrete models, proceeds to 
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generalizations: nonlinear and macroeconomic models, trade cycles, stabilization, growth models. 
Assumes a good mastery of elementary calculus and differential equations--the necessary difference 
equation material is covered in an appendix. Interesting, attractive application of discrete and 
continuous mathematics. PZ 


Applications (Engineering), P. Problems and Methods of Optimal Structural Design. N.V. Banichuk. 
Transl: Vadim Komkov, Edward J. Haug. Math. Concepts & Methods in Sci. & Eng., V. 26. Plenum Pr, 
1983, xxi + 313 pp, $42.50. [ISBN: 0-306-41284-5] An exposition of recent work done by Soviet 
mathematicians on problems of optimum shape and structural properties of elastic bodies subjected to 
external forces. Includes a bibliography of Western work in this area. AWR 


Applications (Engineering), P. Proceedings of Symposium on Finite Element Method. Gordon & Breach, 
1982, 478 pp, $92.50. [ISBN: 0-677-3102-0X] 


Applications (Information Theory), T(15: 1), S, P, L. Probability and Information. A.M. and I.M. 
Yaglom. Theory & Decision Lib., V. 35. D Reidel, 1983, xx + 421 pp, $69. [ISBN: 90-277-1522-X] The 
first English edition of this widely read book which originally appeared in 1957. Self-contained, 
extremely readable and a substantial list of references. No exercises. Includes introductions to 
probability theory, entropy and information along with applications. CEC 


Applications (Medicine), P. Lecture Notes in Medical Informatics-23: Selected Topics in Image Sci- 
ence. Ed: O. Malcioglu, Z.-H. Cho. Springer-Verlag, 1984, ix + 308 pp, $25.50. [ISBN: 0-387- 
12898-0] Papers on diagnostic imaging. Part one concerns algorithms and detectors. Part two deals 
with applications in emission tomography, digital radiography, ultrasound and nuclear magnetic reso- 


nance imaging. RJA 


Applications (Meteorology), P. Problems and Prospects in Long and Medium Range Weather Forecasting. 
Ed: D.M. Burridge, E. Kallén. Springer-Verlag, 1984, xii + 274 pp, $17.50 (P). [ISBN: 0-387-12827- 
1] Pioneering work by E. Lorenz shows that the atmosphere is inherently unstable: "Even if we have 
an almost perfect model with almost perfect initial data, we will never be able to make an accurate 
weather prediction more than a few weeks ahead." Papers in this volume, from a September 1981 
conference in Reading, England, explore the practical consequences of this theoretical limit, and 
are led off by a survey by Lorenz. LAS 


Applications (Physics), P. Mechanics Boundary Layers and Function Spaces. Diarmuid 0. Mathina. 
Dublin Inst for Adv Stud, 1984, 216 pp, (P). Monograph on theory of thin elastic plates and shells. 
The main goal is to make precise features of the passage from the three-dimensional to the simpler 
two-dimensional theory. It is achieved by a suitable integration in the thickness coordinate of the 
three-dimensional system equations. PZ 


Applications (Physics), P. Theory of Dispersed Multiphase Flow. Ed: Richard E. Meyer. Academic Pr, 
1983, ix + 388 pp, $28. [ISBN: 0-12-493120-0] Proceedings of an advanced seminar on the motion of 
multiphase fluids held in the Mathematics Research Center in Madison, Wisconsin in May 1982. Topics 
range from aerosols to ice crystals, from long chain polymers in liquids to dilute suspension of 
particles in the atmosphere. LAS 


Applications (Physics), T(18), P*. Symplectic Techniques in Physics. Victor Guillemin, Shlomo 
Sternberg. Cambridge U Pr, 1984, xi + 468 pp, $49.50. [ISBN: 0-521-24866-3] An introduction to sym- 
plectic geometry from a modern point of view. Applications in physics are an integral part of the 
presentation. AO 


Applications (Physics), P. Stochastic Quantum Mechanics and Quantum Spacetime: A Consistent Unifica- 
tion of Relativity and Quantum Theory Based on Stochastic Spaces. Eduard Prugovecki. Fundamental 
Theories of Physics. D Reidel, 1984, xxi + 302 pp, $48.50. [ISBN: 90-277-1617-X] The author pro- 
poses a unification of relativity with quantum mechanics using stochastic phase spaces and stochas- 
tic geometries. AO 


Applications (Psychology), P*. The Stochastic Modeling of Elementary Psychological Processes. James 
T. Townsend, F. Gregory Ashby. Cambridge U Pr, 1983, xix + 501 pp, $69.50. [ISBN: 0-521-24181-2] 
Major goals are to introduce the fundamentals of probabilistic modeling of certain simple cognitive 
processes and to develop mathematical theories encompassing broad classes of models. Good set of 
references. RSK 


Reviewers 


RJA: Richard J. Allen, St. Claf; MA: Melissa Anderson, St. Olaf; FA: Fahrad Anklesaria, Macalester; 
DA: David Appleyard, Carleton; PB: Peder Bolstad, St. Olaf; RB: Richard Brown, Carleton; JNC: Judith 
N. Cederberg, St. Olaf; CEC: Clifton E. Corzatt, St. Olaf; JD-B: John Dyer-Bennet, Carleton; JRG: 
Jennifer R. Galovich, St. Olaf; SG: Steven Galovich, Carleton; BH: Bruce Hanson, St. Olaf; PH: Paul 
Humke, St. Olaf; RSK: Richard S. Kleber, St. Olaf; JK: Joseph Konhauser, Macalester; LCL: Loren C. 
Larson, St. Olaf; RM: Richard Molnar, Macalester; RWN: Richard W. Nau, Carleton; LN: Linda Ness, 
Carleton; YN: Yves Nievergelt, St. Olaf; AO: Arnold Ostebee, St. Olaf; TR: Teresa Reardon, St. Olaf; 
AWR: A. Wayne Roberts, Macalester; MS: Michael Schneider, Macalester; JS: John Schue, Macalester; 
SS: Seymour Schuster, Carleton; JAS: J. Arthur Seebach, Jr., St. Olaf; KS: Kay Smith, St. Olaf; LAS: 
Lynn Arthur Steen, St. Olaf; MT: Michael Tveite, St. Olaf; MU: Milton Ulmer, Carleton; TAV: Theodore 
A. Vessey, St. Olaf; MW: Martha Wallace, St. Olaf; FLW: Frank L. Wolf, Carleton; PZ: Paul Zorn, St. 
Olaf. 


A great algebraist turned super administrator. (See p. 61.) 
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= 4G 1)? - 1? 4 


where we used the substitution jx? + §* = 1. In particular, when slicing T with 7, the planar 
surface area S, is equal to S(1) which is easily evaluated, viz., 


3 
(10) S, = 4f'13 — 1) (t+ 1) dt = -23.-1)7(t +12)" =8. 
1 1 
The volume V of the small region obtained by slicing 7 with 7 is given by 
3 3 
(11) v= f's(s) at 4? as PG - 970-1)? 
1 1 ¢ | 42 _ ¢? 
= af (3 -1)'7°(t- 1)'arecos| 7) dt. 
In the latter integral we substitute ¢ = 2 + cos, and the result is written as 
(12) V= 4f'(2 sin’ @ — 1 + cos¢sin’ ¢)arccos — do + 4 [“arceos — do. 
0 2+ cos¢ 0 2+ cos¢ 


Here, the first integral, to be denoted by V,, is reduced through an integration by parts, yielding 
G&G sin’? @ — cos ¢)sin’ > 
(13) y= 4 Usin'¢ ~ cose)sin’ $ __ 


0 (2+ cos¢)/(1 + cos$)(3 + cos¢) 


The integrand in (13) is reduced to a rational function by the standard substitutions cos ¢ = 1, 
(1 —- 1)/GB + 1)” =s, viz., 


11-3t-/(1-1t\'” 
(14) na 5f, 2+t a7 (a5) dt 
—3 + 145? + 5*)5? 
-2 (3 4 Mas" + SS" iy = 4/3 tog(2 + v3) — 28 — aa 
3 Jo (3 — s2)(1 + 5?) 3 3 
The second integral in (12) follows from (1) and (7). Thus we find for the volume V, 
2 

(15) V= =. — = - 2 + 473 log(2 + V3) — 2log?(2 + V3) = 13.03207..., 


where the numerical value has been checked by numerical integration of (12). The curved surface 
area §. of the torus slice is given by 


(16) S.= 4f"(2 + cos #)arceos do, 
0 


_ | 
2+ cosd 


which can be evaluated by the same methods as before. Omitting further details we present the 
final result 


(17) S. = 30? — 20 + 473 log(2 + V3) — 41og?(2 + V3) = 25.51226..., 


where the numerical value has been checked by numerical integration of (16). 
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BY ALL MEANS 


FRANK BURK 
Department of Mathematics, California State University, Chico, CA 95926 


It is easily shown that 


(1) 5 < tam x < x < sinh x < Fsinh2x, x>0. 

Vsinh’x + cosh’x 
If we take reciprocals in (1) and reverse inequalities, let x = In/b/a, where 0 < a < b, and 
multiply by $(b — a), we have 


2 b-—a a+b a* + b? 
) 11? <inb-ma~ 2 ~V 2” 


ab 


i.e., respectively, the harmonic mean, geometric mean, logarithmic mean, arithmetic mean, and the 
root-mean-square. 


A PROBABILISTIC PROOF OF A THEOREM OF SCHUR* 


INGRAM OLKIN 
Department of Statistics, Stanford University, Stanford, CA 94305 


Proofs that an n X nm symmetric matrix A is positive definite (or semidefinite) are usually 
based on showing that the quadratic form xAx’ or the eigenvalues A,(A),...,A,(A) are positive 
(nonnegative). We propose another procedure based on the following probabilistic fact: An n X n 
matrix = is positive semidefinite if and only if it is the covariance-matrix of a random vector 
X= (X,...,X,)- 

If = is positive semidefinite, then there exist Gaussian random variables having > as its 
covariance matrix (see, e.g., [2]). If X is a random vector with mean vector p, then the covariance 
matrix 2 is 2 = E(x — p)’(X — p), where E is the expectation operator, and hence > is positive 
semidefinite. (Since covariances are location invariant we can assume that the means are zero.) 

We illustrate the use of this fact to prove the following theorem of Schur (see, e.g., [1], p. 95): 


THEOREM. If A = (a,;), B= (b,,) are positive semidefinite, then C = (a;,,b,,) is positive 
semidefinite. 


Probabilistic Proof. Since A and B are positive semidefinite, there exist independent random 
vectors X = (X,,...,X,) and Y = (¥,,..., Y,) with zero means and covariance matrices A and 
B, respectively. Define Z, = X,Y,, i= 1,...,n. Then 

EX,X,) 


c,, =Cov( Z,, Z;) = E(X,Y,)(X,¥,) = ( iy 
= Cov( X,, X,)Cov(¥,, ¥,) = 


so that C = (c,,) is positive definite. 


*Supported in part by the National Science Foundation. 
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This technique can also be used to prove the following theorem about Kronecker products. 
(See, e.g., [1], pp. 235-243.) 


THEOREM. If A = (a;,) and B= (b,,) are positive semidefinite, then the Kronecker product 
A ® B= (a;,B) is positive semidefinite. 


As a final demonstration of these ideas, we show that the symmetric n X n matrices 


n n—1l n—2 a | 
24n-1) %n-2) --- 2 
4a (m= 2) 3] 
n 
a; a; a; vee a; 
a; + a3 a* + a3 ve a; + a3 
B= a; +a +a} vee ay +a,+a} 


aj +--+ +a? 


are positive semidefinite by showing that they are covariance matrices. 

(i) If X,,..., X, are independent random variables from a uniform distribution on [0,1] and 
Xa) < ++: < Xn) are the order statistics, then, except for a constant, the matrix A is the 
covariance matrix of (X(1),..-, X(ny)- 

(ii) The matrix B is the covariance matrix of random variables 


Z,=X,,Z,=X,+%,...,Z,=X, ++: +X, 


where X,,..., X,, are independent random variables with zero means and variances a?,..., a2, 


respectively. 
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ENUMERATION IN MUSIC THEORY 


DAVID L. REINER 
Argonne National Laboratory, 9700 South Cass Avenue, Argonne, IL 60439 


The calculation of the number of chords and tone rows using Pdlya’s Theorem and Burnside’s 
Lemma can add a little variety to the applications of these theorems usually given in combinator- 
ics and algebra texts. 

The n-scale is taken to be equal (well) tempered and consists of the integers from 0 to n — 1. 
We equate octave notes, so our scale is mathematically Z, with addition. For example, ordinary 
Western music has n = 12, Debussy used a whole tone scale with n = 6, and Ives used a quarter 
tone scale with n = 24. (Of course, in terms of frequencies of pitches, the physical scale is a 
multiplicative group, but this makes no difference here.) 

A k-chord in the n-scale is an equivalence class (defined in Section 1) of subsets of k elements 
each of Z,, and an n-tone row is an equivalence class (defined in Section 2) of permutations in S,. 
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1. Chords. The equivalence relation is induced by one of two permutation groups. We can use 
the transpositions (not to be confused with permutations which exchange two elements) T’: Z, > 
Z,.a*> i+ a(modn); a jazz guitarist generally uses chords which can be transposed easily so 
not so many fingerings must be memorized. Or we can use the larger group of transpositions (as 
above) and the inversion J:a — —a(modn). A typical group element looks like T'I: a 
i— a(modn). This use of the word “inversion” is standard in serial music and must be 
distinguished from the usual meaning of octave transposition of one or more notes of a chord. For 
example, under the larger group, a C major chord is equivalent to a C minor chord: {C, E,G} = 
{0,4,7} ~ {0,3,7} = {C, Eb,G}, the ~ being via T’T. 

It is easy to see that these groups are the cyclic groups C, and the dihedral groups D,, 
respectively, and we are thus dealing with what is known as the two-color necklace problems, 
either one-sided or two-sided [3, p. 162]. That is, we have a circular necklace of notes to choose 
from, and we choose the ones in the chord by coloring them one color and the rest another color. 

Polya’s Theorem is the appropriate tool for this situation: a group acting on a domain set D, 
inducing a group action on the set of functions into some range set R. It suffices to let G be a 
group of permutations of D. G then acts on the set of all functions R? in a natural way, inducing 
an equivalence relation. 

To keep track of what’s important in such a problem, we use enumerators, polynomials or 
formal power series whose coefficients are the numbers of objects of specified types. The 
enumerator of the range here can be whatever we like, say L,.- pw(r), where w, the “weight,” is 
any function from R to a commutative ring containing the rationals. The weight of a function 
from D to R is then defined to be the product of the weights of its image values, and the 
enumerator or inventory of a set of functions is the sum of the weights of the individual functions. 
Usually this set of functions is a set of equivalence class representatives when we have an 
equivalence relation on R?. 

In our situation of chords, R = {0,1}, w(0) = 1, w(1) = x, and the weight of a function from 
D to this R is x*, where k is just the size of the subset of D mapped to 1. This is how we 
enumerate subsets or chords. 

The importance of Pdlya’s theorem is that it relates the inventory of equivalence classes of 
functions to the “store enumerator,” the enumerator of the range, via the cycle index P, of the 
group G acting on D, which is defined by 


1 
Po (ty, t,---) ~ 1G dy try ---, 
geG 


where a, = the number of i-cycles in g. 
Polya’s Enumeration Theorem [2, p. 148] states that the enumerator or inventory of equiv- 
alence classes under G of functions in R” is 


2 k 
P,(Lw(r), Dw(r)s-.-, Dw(r)*,...). 
The cycle indices of C, and D, are well known [3, pp. 149-150]: 
1 ee 
Pots taresty) => UOC =— 9, 
j\n 
and 


® + nt, th" 9/7], if n is odd, 


inl 
Py (ths tas-+-st,) 1 No may 3 a 
-;-|¢ tan” (4; + t»)), if n is even. 


Here we use $(/), which is the Euler phi-function, the number of positive integers less than 
which are relatively prime to 7, and ®, which is short for the sum in the first equation. 
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Our enumerator is just 1 +x, and ¢t;=1-+ x/. Hence the number of k-chords is the 
coefficient of x* in P(1 + x,1 + x?,...,1 +x"). For C,, a little rearrangement produces 


1 A 1 
# of k-chords = — | =—-@(k), 
7, D(A) =H 


where ®, (x) is short for the summation. For D,, the additional term gives 


-1)72 
_ ®(k)+n (n ~ 1)/ , if n is odd, 
2n [ k/2] 
J 1 n/2 ; 
# of k-chords = a7 fo, (4) + n( al if n is even and k is even, 
1 (n/2) -1 


). if n is even and k is odd. 


ane +n [k/2] 


The situation in twelve-tone music is n = 12 with dihedral symmetry, and we obtain the 
following table. 


k= O;1;)2); 3) 4) 5) 6) 7} 8} 9/10] 11) 12 
number | 1 | 1 | 6 | 12 | 29 | 38 | 50 | 38 | 29/12; 6| 1] 1 


Thus, there are 50 hexachords in twelve-tone music and only 6 intervals, a fact well known in 
music theory, [1]. 


2. Tone rows. Here our equivalence classes are induced by the group generated by 


transposition T: S, > S,:(a,,...,a,) ? (a, + 1,...,a, + 1) (mod n); 
inversion [:(@,,a5,...,a,) ~ (@,,2a, — a5,...,2a, — a,,) (mod n); 
retrogradation R:(a,,...,a,) > (a@,,...,@). 


This group is not well known; however the entire structure of the group is not really needed. 
We can take a cue from music theorists, who regard transposition as such a basic transformation 
that they don’t work with all permutations, but work with equivalence classes of permutations 
under transposition, those beginning with 0 being regarded as class representatives. Hence our set 
is now the set of (n — 1)! permutations of {1,..., — 1} with a prefix of 0, and our group is 
generated by R and J. On this new set, RI = IR, so we have the Klein four group V acting. 

Burnside’s Lemma is the appropriate tool in this setting. 


Burnside’s Lemma [2, p. 136] states that for a group G acting on D, the number of equivalence 
classes is 


1 


Gj >> (# of elements of D fixed by g). 


geG 


The computation proceeds as follows (see the appendix for details). In order to avoid triviality, we 
assume that n > 3. 


Group Element Number Fixed 
n odd n even 
e (n — 1)! (n—1)! 
I 0 0 
R 0 (n — 2)(n — 4)--- (2) 


IR (n — 1)(n — 3)--- Q) (n/2)(n — 2)(n — 4)--- (2). 
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Thus the number of n-tone rows is 
4{(m — 1)! +(n - 1)(n — 3)--- (2)] if n is odd; 
4[(m — 1)! +(n — 2)(n — 4) ---(2)(1 4+. 1/2)] ifn is even. 


For example, there are 9985920 twelve tone rows, a fact which does not seem to be in the 
literature. 


Appendix. Computation of numbers of fixed elements. 


1. I(0,a5,...,a,) = (0, —a,,..., —a,) ~ (0,a5,...,a,) implies a, = —a, (modn) for i= 
2,...,” Since no transposition is allowed because the first element 0 is fixed. There is at most one 
nonzero solution to x = —x (mod n), but that is not enough to fill out the permutation. 

2. R(a,,...,a,) = (a@,,..-,@,) ~ (a,,...,a,) implies that a ¢ exists such that a, =a, + t, 
a, =a,_,+t,... (modn). If n is odd, the middle element is fixed and no transposition is 
allowed: ¢ = 0. But then a, = a,, a contradiction. If n is even, the first and last congruences 
imply that 2¢ = 0; hence, t = 0 or t = n/2. The first is impossible just as when n is odd, but the 
other gives fixed permutations. Since a, = 0, a, = n/2. For a,, we can choose any of n — 2 
elements, and this determines a,_,. For a,, we have n — 4 choices, etc. 

3. IR(a,,...,a,) = (—4,,..-, —@,) ~ (a,,...,a,) implies that a ¢ exists such that a, + a, = 
t, dy + a,_, =1,.... The last congruence for n odd is 24,,,, 1). = t (mod n). Clearly it is not 
important that we fix the first element as 0; we could fix a,,,43)/. as 0 and obtain the same count. 
Thus we may assume that ¢ = 0, @,,,,1)/2 = 0. This allows n — 1 choices for a,, with a, thus 
determined, n — 3 choices for a,, etc. For n even, we fix a, = 0, a, # t #0. A little thought 
shows that ¢ must be odd in order for us to complete the permutation. If t = 2k, then there is no 
mate for k in the permutation. There are thus n/2 choices for t = a,. For a,, there are n — 2 
choices, with a,_, determined thereby, etc. 


The author thanks Dennis White for helpful comments. 
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BIJECTING EULER’S PARTITIONS-RECURRENCE 


DAVID M. BRESSOUD 
Department of Mathematics, Pennsylvania State University, University Park, PA 16802 


DORON ZEILBERGER 
Department of Mathematics and Computer Science, Drexel University, Philadelphia, PA 19104 


A partition of an integer n is a nonincreasing sequence of positive integers A(1) > A(2) > -:- 
> A(t) > 0, such that A(1) + --- +A(t) = n. The set of partitions of n is denoted Par(n) and its 
cardinality |Par()| is written p(n). For example, 

Par(5) = (5; 4,1; 3,2; 3,1,1; 2,2,1; 2,1,1,1;1,1,1,1,1} and p(5) =7. 
There is no closed form formula for p(n) but Euler ({1], p. 12) gave a very efficient way for 
compiling a table of p(n) by proving the recurrence 


(1) ~ p(n-a(j))= Y p(n-a(j)), where a(j) = (3/7? + j)/2. 


j even j odd 


Euler used generating functions to prove this formula. Garsia and Milne [2] gave a very nice 
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byective proof of (1), utilizing their Involution Principle. We are going to give another bijective 
proof which does not require any iterations and is very simple. Indeed, 


o: U Par(n—a(j)) @ U Par(n-a(j)), 
j even j odd 
given below does the job. 
Let (A) = (AQ),...,A(4)) € Par(n — a(/)). Then define ¢ by 
(t+ 37-—1,A(1) -1,...,A(¢t) —1) © Par(n-—a(j-1)), iff+37>A(), 
o((A)) = ¢ (A(2) +1,...,A(¢4) +1,1,1,...,1) € Par(n-—a(jt+1)), ift+3j <A), 
where there are \(1) — 3j — t — 1 1’s at the end. 


Note that applying ¢ twice yields the identity mapping, thus ¢ = @ ' and ¢ is invertible. 


EXAMPLE. n = 21. 
$(5,5,4, 3, 2) = (7,4,4,3,2,1). 


Here (5,5, 4,3, 2,) € Par(19) = Par(n — a(1)) so j = 1. The number of parts ¢, is 5 and we have 
t+ 37 > A(1), since 5 + 3 > 5. Now consider $(7, 4, 4,3,2,1); here 7 = 0, t = 6, A(1) = 7 and 
6+ 0 <7. Also A\(1) - 37 —-t-1=7-—0-—6 —1=0so0 we do not add any 1’s at the end and 
(7, 4, 4, 3, 2,1) = (5,5, 4, 3, 2). 
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THE TEACHING OF MATHEMATICS 
EDITED BY MARY R. WARDROP AND ROBERT F. WARDROP 


Material for this department should be sent to Professor Robert F. Wardrop, Department of Mathematics, Central 
Michigan University, Mount Pleasant, MI 48859. 


ON THE CONVOLUTION OF CAUCHY DISTRIBUTIONS 


MEYER Dwass 
Department of Mathematics, Northwestern University, Evanston, IL 60201 


The characteristic function of a probability distribution is usually too advanced a topic for a 
first undergraduate course in mathematical statistics and the more limited moment generating 
function is often used instead. In teaching the distribution of sums of independent random 
variables such as normal, gamma, or uniform, I supplement the use of the moment generating 
function with the convolution formula, 


(1) fx a(u) =f" f(x)g(u~ x) ak. 


For sums of independent Cauchy random variables the moment generating function does not 
apply and the use of the convolution formula is difficult. Undoubtedly, it is generally understood 
that if f and g are Cauchy densities, a partial fraction decomposition of the integrand in (1) 
should lead to an explicit evaluation of the convolution integral but I do not find the details 
worked out anywhere. (See comment in Feller, [1], p. 51.) The purpose of this note is to outline 
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these details as I have presented them in the classroom. The algebra is elementary and well within 
the scope of an undergraduate course. 
The calculation is based on the following relationship: 


LEMMA. Fora > 0, let 


f(x, a) — 


— tL eye 

na|1 +(x/a)’| * ” 

Then 

[a+b —-ux/a] f(x,a)+[a+b—-u(u—x)/b] f(u—- x,b) 
2n|(a +b) + u?| 

n [-a+b+ ux/a] f(x,a)+[a-—b+ u(u— x)/b| f(u —- x,d) | 

2n|(a — pb)? + u? | 


f(x,a)f(u-x,b)= 


From the lemma it follows immediately that 
(2) f f(x,a)f(u—x,b) dx =f(u,a+t b), 
since the integral in (2) exists and 

f f(x, a) f(u—-— x,b) dx = lim fo f(a) f(a = x, b) ax 

— 00 T?oY%-—T 
and 

T T 
li ,a)dx = li _ — x, = 0. 
Him fo xf(x a) dx fim fou x) f(u—-—x,b) dx =0 


Proof of Lemma. For a> 0, define 


1 
n(x, a) = 2ma(1 + ix/a)’ 
h(x, —a) i 


~ 2ma(1 — ix/a)’ 


where i=y-1. 


The following steps follow by routine algebra and the details are left to the reader: 
(a) f(x, a) = h(x, a) + h(x, — a), 


(b) f(x, a)f(u — x,b) = h(x, a)h(u — x,b) + h(x, a)h(u — x, -—b) + h(x, —a)h(u — x, b) 
+ h(x, —a)h(u — x, —b), 


h(x,a)+h(u-—x,b 
(c) A(x, a)h(u — x,b) = ee ee) 
—h(x,a)+h(u-x,—-)b) 


A(x, a)h(u — x, 0) = eS in) 


h(x, - h(u-—x,—b 
Me = ayn % —2) = Mee) 


9 


1985] PROBLEMS AND SOLUTIONS 37 


(d) h(x, a)h(u — x,b) + h(x, —a)h(u — x,b)= 
[a+b— ux/a] f(x,a)+[a+b—-u(u—-x)/b] f(u- x 1b) 
2n|(a +b) +u 2| 
h(x,a)h(u — x,—-b)+ h(x, —a)h(u — x,b) = 
[-a+b+ ux/a] f(x,a)+[a-b+u(u—x)/b] f(u- x, b) 
2n| (a —b)’ +u 2| 
The lemma now follows by combining the last two equations. 
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EDITOR EMERITUS: EMORY P. STARKE. COLLABORATING EDITORS: VINCENT BRUNO, FRANK S. CATER, 
GULBANK D. CHAKERIAN, A. M. DAWES, MICHAEL J. DIXON, UNDERWOOD DUDLEY, RICHARD A. GIBBS, 
CLARK GIVENS, RICHARD M. GRASSL, DOUGLAS A. HENSLEY, ISRAEL N. HERSTEIN, ROBERT H. JOHNSON, 
ELGIN H. JOHNSTON, MURRAY S. KLAMKIN, DANIEL J. KLEITMAN, JOSEPH D. E. KONHAUSER, FREDERICK W. 
LUTTMANN, MARVIN MARCUS, LOUISE E. Moser, M. J. PELLING, C. M. REIS, J. O. SHALLIT, B. L. R. 
SHAWYER, EDWARD T. H. WANG, AND ALBERT WILANSKY. 


Send all proposed problems, typed and in duplicate if possible, to Professor D. H. Mugler, Department of 
Mathematics, University of Santa Clara, Santa Clara, CA 95053. Please include solutions, relevant references, 
etc. 

An asterisk (*) indicates that neither the proposer nor the editors supplied a solution. 

Solutions should be sent to the address given at the head of each problem set. 

A publishable solution must, above all, be correct. Given correctness, elegance and conciseness are preferred. 
The answer to the problem should appear right at the beginning. If your method yields a more general result, so 
much the better. If you discover that a MONTHLY problem has already been solved in the literature, you should of 
course tell the editors; include a copy of the solution if you can. 


ELEMENTARY PROBLEMS 


Solutions of these Elementary Problems should be mailed in duplicate to Professor D. H. Mugler, Department 
of Mathematics, University of Santa Clara, Santa Clara, CA 95053, by May 31, 1985. Please place the solver’s 
name and mailing address on each (double-spaced) sheet. Include a self-addressed card or label (for acknowledge- 
ment). 
E 3069. Proposed by Zhang Zaiming, Yuxi Teachers’ College, Yunan, China. 
The m X m determinant J = |a,,| has a,, = fo x* '[F(x)]" "*' dx, where F is nondecreas- 

ing on [0,1]. Prove that J > 0. 

E 3070. Proposed by Gerard Letac, Université Paul Sabatier, Toulouse, France. 


Let a and (t,,),,5, be strictly positive numbers and let the sequence (1,,),,, be bounded. Prove 
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that 
io 6) io 6) 
i+ Vag: ttf > L (a/lat 1))"". 
n=0 


n=1 
E 3071. Proposed by K. Satyanarayana, Hyderabad, India. 


If in triangle ABC, angle C > angle B > angle A, then prove that J lies inside triangle OBH 
where, as usual, O,/7,H denote the circumcenter, incenter and orthocenter, respectively, of 
triangle ABC. 


E 3072. Proposed by Louis Funar, Craiova, Rumania. 
Let P(x) =a) + a,x +--+: +a,x" be a real polynomial of degree n > 2 such that 


[n/2] 1 
O< do <— 2 app aa: 


Prove that P(x) has a real zero r such that |r| < 1. 


SOLUTIONS OF ELEMENTARY PROBLEMS 
A Four-Point Tangency 


E 2911 [1981, 705]. Proposed by Jordi Dou, Barcelona, Spain. 


Let 2 semicircles AC, CB, AC = 3CB, be given (ACB are collinear). Let a, b be the tangents to 
the given semicircles at A, B. Let y be the circle tangent to a,b and to the larger of the given 
semicircles. Prove y, b, and the given semicircles have a common tangent circle. 


Solution by William A. Newcomb, Lawrence Livermore National Laboratory, CA. Let a 
Cartesian coordinate system be introduced such that the points A, C, B are assigned the respective 
coordinates (0, 0), (0, 6), and (0, 8). Also, let us use the notation (a, 8, r) to denote the circle whose 
equation is (x — a)? + (y — B)? =r’. 


B(0, 8) 
(0, 7) 


C(0, 6) 


(0, 3) 


A(0, 0) 


Fic. 1 
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The criterion for two circles, (a, 8, r) and (a’, B’, r’), to be externally tangent is 


l(a —a’) +(B- py |” =rt+r’, 
The two given circles are (0,3,3) and (0,7,1), and the circle y is (4/3 ,4,4). The circle 
(4/3 /3,20/3,4/3) is externally tangent to all three of these circles, and also tangent to the 
straight line y = 8. 


—_ 


~] ~3/5 0 


Fic. 2. After inversion with respect to I. 


Solution by I. J. Schoenberg, University of Wisconsin-Madison. In C we choose a coordinate 
system so that A = +1, C = —1, and therefore B = —5/3. If a < B are reals, we denote by 
(a, 8) the circle of diameter [a, 8]. Performing an inversion with respect to the circle T = (—1,1) 
(see Fig. 2), we find that the tangent b goes over into the circle T’ = (—3/5,0), and that the 
problem is equivalent to the statement: The (excentric) ring between the two circles T,T’ contains 
a (closed) Steiner system of six circles which are pairwise tangent and all tangent to T and YT’. To 
make the ring [', I’ concentric, we perform a real Mébius transformation M such that MT =T 
and MI’ = (—a,a)=I™*. The invariance of the anharmonic ratio shows that we must have 
(—1, —a,a,1) = (—1, —3/5,0,1) whence a = 1/3. Since the circle (1/3,1) is seen from the 
origin 0 under an angle of 60°, the Steiner chain of six circles within the ring between I’ and I’* 
becomes obvious (see Fig. 3). 


Vo 


~] -1/3 0 1/3 


—_ 


Fic. 3. The sixth circle is now obvious. 


Also solved by the proposer. 
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Six Concurrent Circles 


E 2963 [1982, 594]. Proposed by Calin P. Popescu, student, Bucharest, Rumania. 


Let A,A,A,, A{A5 Aj be two equilateral triangles in the plane. Construct circles y, [y,] with 
radii r, [r’] and centers A, [A’], i = 1,2,3 [i = 1,2, 3]. Suppose further that 7, [r’] are geometric 
progressions with ratio a positive integer. When can the six circles be concurrent? 


Solution by L. Kuipers, Switzerland. We show that three circles centered at the vertices of an 
equilateral triangle A,A,A, and with radii in geometric progression with integer ratio m are 
concurrent at a point P only if m = 1, in which case P is the centroid of the triangle. Thus the six 
circles in the problem are concurrent only if the two triangles have a common centroid. 

Any point P such that 

A,P A;P 

A,P  AyP 
must be a point of intersection of the two Appolonian circles determined by the first and second 
equations respectively. It is easy to see that these circles do not intersect if m > 2. Hence m = 1, 
in which case the circles degenerate to two medians intersecting at the centroid P of A,A, 43. 


Also solved by J. Dou (Spain), O. P. Lossers (The Netherlands) and the proposer. 


Editorial note: The proposer wishes to say that he was inspired by the memory of his friend Stephen Ghiviraga. 


ADVANCED PROBLEMS 


Solutions of these Advanced Problems should be mailed in duplicate to Professor D. H. Mugler, Department 
of Mathematics, University of Santa Clara, Santa Clara, CA 95053, by May 31, 1985. The solver’s full 
post-office address should be on each sheet. 


6484. Proposed by A. V. Nabutovskii, Novosibirsk, U.S.S.R. 
Suppose that x = L™_,p,/q, < oo, where the p,’s and gq,,’s are positive integers satisfying the 
inequality 
Pn Pn+1 
—— => ———— 
An In 7 1) In+1 — 1 
Let S be the set of indices n for which the inequality is strict. Prove that x is irrational if and 
only if S is infinite. 


forn =1,2,.... 


6485. Proposed by Alexandru Lupas, Sibiu, Rumania. 
Given a function f € C[0,1], let (B, f)(x) denote the Bernstein polynomial 
n _ k 
¥ (ja - 2)" “4(5). 
ino \* n 
If f € C™ [0,1], prove that, for0 <x <1,n=1,2,..., 
x(1 Zz x) 1 1 ” 2 ve 
(BDC) -BaNi< =? (s-firora) 
6486. Proposed by Liviu I. Nicolaescu (student), University Ai Cuza Iasi, Rumania. 


Let S,, denote the set of all permutations of 1,2,...,”. For 6,7 € S,, define d(o,7) to be the 
number of inversions of the permutation o - +‘. Prove that d is a metric on S,. 
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6487. Proposed by Pei Yuan Wu, National Chiao Tung University, Hsinchu, Taiwan, Republic 
of China. 


Let T be a contraction (i.e., ||7'|| < 1) on a Hilbert space and, for n = 1,2,..., let 


K,=1+ & (1 - Ao )cre + re) 
k=1 ne 

be its “Fejér kernel”. Prove by an elementary method (i.e., without the use of spectral theory or 

dilation theory) that K, > 0 for n =1,2,.... 


SOLUTIONS OF ADVANCED PROBLEMS 
An Exponential Diophantine Equation 


6411 [1982, 788]. Proposed by Leo J. Alex, SUNY, Oneonta. 
Find all solutions in integers a, b,c, d to the equation 1 + 3% = 5° + 3°54. 


Solution by the proposer. We show that the only solutions in integers are (a, b, c,d) = (t, 0, t, 0), 
(2,1,0,1) or (3,2,1,0). The key to proving this is a result which follows from work of R. J. 
Stroeker and R. Tiydeman, [ Diophantine Equations, Computational Methods in Number Theory, 
MC Tract 154(Part I), MC Tract 155(Part ID, (1983), pp. 321-369]: 


LEMMA. If 0 < [3% — 5°| < 3”/?, where u and v are integers, then (u,v) = (3,2). 


We now consider the given equation. If b = 0, then a = c, d= 0; ie., (a, b, c,d) = (1,0, t, 0). 
Thus we may assume that b > 0. Then a > c and so 5” = 1 (mod 3°). Since 5* ¥# 1 (mod 32), 5 is 
a primitive root mod 3°. It follows that 2 - 3‘~'|b when c > 0, and hence that 3° < 3b/2. We 
distinguish the Cases (i) b > d and (ii) b < d. 

In Case (i) 37 = —1 (mod 5“) and so, since 3 is a primitive root mod 5%, 2 - 54~'|a when 
d> 0. Hence, 54 < 34/2 and so 0 < 34 — 5° < 15ab/4. Consequently, b < alog3/log5 and 
0 < 3% — 5° <2 - 6a’. By the lemma we have (3% — 5°| > 3°”? if (a, b) # (3,2). Hence, 2 - 6a’ 
> 34/7, and so a<10,b<6,c <2,d <2. It follows easily that (a,b,c,d) = (2,1,0,1) or 
(3,2, 1,0). 

In Case (ii) we have 3° = —1 (mod 5”), whence 5° < 5a/2. Thus 0 < 3% — 3°54 < 5° < 5a/2, 
and hence 0 < 3° -° — 54 < 5a3~°/2. Suppose first (a — c,d) # (3,2), so that, by the lemma, 
I327—° — 54| > 3(4- 9/2. Then 3°”? < 5a/2, and so a < 4, b=1, d > 2. Since 5/3 + 1, we obtain 
a = 2, a contradiction. Finally, suppose (a — c,d) = (3,2). Then b = 1, c = 0, and thus 3° = 2a, 
again a contradiction. Therefore there are no solutions in Case (ii). 


Also solved by I. A. Sakmar (Canada), and partially solved by Mihaly Bencze (Rumania). 


Partition of Integers 


6428 [1983, 338]. Proposed by J. M. Steele, Princeton University. 


Let e be a separation of {1,2,...,2”} into two disjoint classes a, <a, < --- <a, and 
b, <b, < --- b,. Let M, denote the number of solutions of a;—- b =k,—2n<k < 2n, 
and put M(n) = min,max, M,. Selfridge, Motzkin, and Ralston showed liminfn~'M(n) < .4 
and Moser showed lim sup n~!M(n) > .357. Prove that the limit lim n~'M(n) actually exists. 


Solution by the proposer. Let M* = liminfn™='M(n),6> 0, and choose an n such that 
n-\M(n) < M* + 6. If the optimal separation e for me n is given by (a,), (5;), we conser the 
separation e’ of {1,2,...,2mr} given by (ra; + h)?_,,0 <h <r and (rb, + h’ inp O<h’ <r. 
The number of elements of this separation with distance k is equal to the ‘number of solutions of 
(ra, + h) — (rb, + h’) = k, and this is easily seen to be bounded by rM(n). 
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This proves M(rn) < rM(n); hence limsup, ., ,,(m) 'M(rn) < M* + 6. Since M(mn +h) < 
M(rn) +h we also have limsup n”'M(n) < M* + 6, which suffices to prove the claim. 


Also solved by Tim Keller. 


Differences Between Consecutive Primes 


6429 [1983, 338]. Proposed by B. Powell, Kirkland, WA. 


Prove that for any positive integer M there exists an even positive integer ny) = n)(M) such 
that the number of pairs of consecutive primes (P,, P,,) that differ by exactly ny is greater than 
M. 


Solution by Roy O. Davies, The University of Leicester, England. Suppose the assertion false. 
Then there is a bound M to the number of times any difference between successive primes 
appears. In particular, there are at least (n — 2)/M distinct differences between successive primes 
in the sequence 3 = py, p3,..., P,,, and so 


Py Pr 22444 ++ +2 


But this contradicts the elementary estimate p, = O(n logn). 


Also solved by Richard Beigel, Robert Breusch, Keith Kearnes, O. P. Lossers (Netherlands), Victor Pambuccian 
(Rumania), University of South Alabama Problem Group, and the proposer. 


REVIEWS 


EDITED BY ALLAN L. EDMONDS AND JOHN H. EWING 
COLLABORATING EDITOR FOR FILMS: SEYMOUR SCHUSTER 


Sophie Germain. An Essay in the History of the Theory of Elasticity. By Louis L. Bucciarelli and 
Nancy Dworsky. D. Reidel, Boston, 1980. xii + 143 pp. $31.50 (cloth), $15.75 (paper). 


JOSEPH W. DAUBEN 
Herbert H. Lehman College, CUNY, Bedford Park Boulevard West, Bronx, NY 10468 


This is a book whose title should have been reversed, for accuracy’s sake, to read: An Essay in 
the History of the Theory of Elasticity: the (Minor) Role of Sophie Germain. Reading the authors’ 
preface, which describes the progress of their research and thinking that eventually led to this brief 
history, it becomes clear that elasticity is their real interest. Almost accidentally, however, they 
discovered “that a woman appeared to have made a significant contribution in the field” (page 6), 
and she subsequently aroused their curiosity. Of course, one understandably suspects why both 
the authors and their publisher chose to emphasize Germain in the title, for (apart from 
specialists) who would be likely to read, let alone buy, a history of theories concerning elasticity? 
The additional story, however, of a French woman born in the revolutionary year 1776, the 
self-taught product of France’s own decade of upheaval, who not only took it upon herself to 
study mathematics and its physical applications, but to compete for a prix extraordinaire offered 
by the Institut de France, and then actually win its gold metal in 1816— might well attract many 
more readers, certainly those interested in the social history of women, as well as those interested 
in mathematics and its more esoteric aspects. Unfortunately, in point of fact, Germain was neither 
a very good mathematician, nor very successful in the applications she tried to make of 
mathematical techniques in her various studies of elasticity. Nor does she appear to have been an 
exceptionally interesting personality. Although little in fact is known about her life, what this 
biography does manage to reveal suggests that Germain was possibly the most lackluster of the 
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great or near-great women, not only of her own day—consider Madame de Staél, for example—but 
of the recent history of mathematics as well. Germain’s biography lacks any of the social or 
political interest of Sophie Kovalevskaya’s, and she offers none of the originality or depth of 
mathematical insight that characterize Emmy Noether’s work. But the authors of Sophie Germain 
are not responsible for their subject’s shortcomings, personality, or for the reclusive and rather 
prosaic life she seems to have led. Instead, it is necessary to ask what their book sets out to do, 
and what interests it manages to serve? 

Readers of this MONTHLY will naturally be interested primarily in the mathematical aspects of 
her work, which can be summarized rather briefly. Her most impressive results were in number 
theory. A limited correspondence with Gauss, which survives in the Universitatsbibliothek, 
Gottingen, and in the Bibliotheque Nationale, Paris, shows that he was impressed not only with 
what she had accomplished in this area, but also with the fact that she was a woman. This did not, 
however, affect the first impressions he had of her work, for when she initiated their correspon- 
dence in 1804, she used the nom de plume “Monsieur LeBlanc.” Two years later, however, when 
Napoleonic forces were occupying Prussia, Germain was concerned with Gauss’ safety in 
Brunswick, and used family connections to have a French army officer intercede personally on 
Gauss’ behalf. Later, she wrote to him, explaining that “fearing the ridicule attached to a female 
scientist,” she had “previously taken the name of M. LeBlanc in communicating to you those 
notes that, no doubt, do not deserve the indulgence with which you have responded.” As a result 
of this revelation, however, Gauss was not only indebted to Germain for her concern, but now was 
doubly interested in her mathematics and in the fact that she was a woman. “The taste for the 
abstract sciences in general and, above all, for the mysteries of numbers, is very rare,” he 
responded. “But when a woman, because of her sex, our customs and prejudices, encounters 
infinitely more obstacles than men in familiarizing herself with their knotty problems, yet 
overcomes these fetters and penetrates that which is most hidden, she doubtless has the most 
noble courage, extraordinary talent, and superior genius,” (page 25). This was, easily, the most 
extensive and best-informed praise Germain ever received for any of her scientific work. 

In fact, her accomplishments in number theory have immortalized her eponymously. It was her 
friend and mentor Legendre who published the theorem proving Fermat’s Last Theorem for the 
special cases n = p — 1, where p is a prime of the form 8k + 7, that is today known as Germain’s 
Theorem. As Germain once said of herself, in a letter to Gauss written in 1809, her mind had a 
“great predilection for arithmetic problems” (page 121). 

Unfortunately, she had neither the same innate abilities, nor the opportunities, to perform so 
well in other branches of mathematics, particularly those that were basic for mathematical 
physics, and in fact essential for her studies of elasticity. Here, despite the celebrated prix 
extraordinaire, her work was deeply flawed. Even her biographers describe her results variously as 
“naive and unconvincing” (page 93), “slight and unsuccessful” (page 105), “inadequate and 
trivial” (page 107). But such negative evaluations concern primarily her difficulties with higher 
mathematics, especially the calculus and variational techniques. 

If Germain’s mathematical physics was less than adequate, however, how was she able to have 
won the prix extraordinaire from the French Académie des Sciences? The answer depends upon 
the state of mathematical physics at the turn of the century, and begins with the dramatic public 
lectures delivered in Paris on vibrating plates by E. F. F. Chladni in 1802. A metal disc, caused to 
vibrate, creates a diversity of different geometric patterns, and the challenge (apart from setting 
up the vibrations, which required considerable skill) was of course to account for the nodal lines 
mathematically. (See Fig. 1.1 on p. 66.) Napoleon, who was not only a member of the Institut, but 
in fact of the First Class (for physics and mathematics)—he served as President of both—was so 
impressed by Chladni’s demonstrations that he urged a prize competition be established on the 
subject. In 1809 the competition was announced, with a prize set at a kilogram of gold (about 
3000 francs) for “the development of a mathematical theory of the vibration of elastic surfaces, 
and a comparison of this theory with experiments” (page 35). 

Germain’s first paper on the subject, submitted in 1911, was also the only entry. (G. Libri, 
Germain’s friend and first biographer, conjectured that potential competitors had been discour- 
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Fic. 1.1. Chladni’s vibration modes of circular plates. 


aged from working on the problem by Lagrange’s pronouncement that the subject was extraordin- 
arily difficult and would require an entirely new kind of analysis.) In fact, Lagrange noted the 
major difficulty that Germain took as her starting point (never successfully): ““We do not even 
possess the differential equations of movement for this type of vibration in considering these 
phenomena as nature presents them” (page 43). 

Germain began with Euler’s treatment of horizontal beams, and attempted to generalize 
mathematically to the case of vibrating surfaces. Through correspondence (and perhaps actual 
meetings) with Legendre, she tried to refine this approach, but she was never successful. As 
Legendre wrote after her paper had failed to win any notice from the Académie, her attempts to 
introduce double integrals were “nowhere amenable to the substitutions you have made” (page 
54). 

However, her paper did contain a valuable assumption about the nature of the problem that 
inspired Lagrange to deduce the proper differential equation for vibrating plates. Beginning with 
the assumption that the elasticity at every point of a disc or plate could be represented by 
resistances of the form (1/r) + (1/r’), where r and r’ are the two principal radii of curvature of 
the deformed plate, Germain had produced an erroneous differential equation. 

Lagrange, however, rejected Germain’s faulty deductions, and with his own methods reached 
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the alternative equation: 


_ d*z 4 d*z d*z 
dt? dx ~~ dx? dy? dy*} 
Ly Ly 


Legendre communicated the gist of Lagrange’s conclusions to Germain, who thereafter used 
Lagrange’s correct equation for her own subsequent work on elasticity. The prize contest was 
extended until October of 1813, when she submitted a second entry, again the only one received. 
This time she presented the equation that Lagrange had deduced in response to her first paper, 
but she offered little justification for the result. Again, her analysis was so full of errors that the 
judges were not convinced by her mathematical deductions. As Legendre eventually explained to 
her in December: “I do not understand the analysis you send me at all; there is certainly an error 
in the writing or the reasoning, and I am led to believe that you do not have a very clear idea of 
the operations on double integrals in the calculus of variations” (page 63). 

But the second part of her paper aroused greater interest, and was sufficient to bring her at 
least a “honorable mention.” This was awarded for the connection she established between the 
physical, experimental data and (Lagrange’s) correct differential equation. 

Germain’s confirmation of the applicability of the mathematical interpretation of the phenom- 
ena soon inspired Poisson, apparently, to produce an impressive (but incorrect) mathematical 
theory of elasticity. His work eventually involved both Germain and Navier in a heated polemic 
with Poisson over priorities and approaches, and this constitutes, easily, the most interesting and 
impressive parts of the study produced by Bucciarelli and Dworsky. This aspect of their book, 
reflected in its subtitle, deserves careful reading, for it is filled with information about how the 
theory of elasticity developed in the 19th Century, based upon false assumptions, heated rivalries, 
erroneous metaphysics, and last, but not least, elaborate but false mathematical models of a 
particularly interesting set of physical phenomena. 

Not satisfied with Germain’s second memoir, the Academy extended yet again the prize 
competition on elasticity, setting a deadline of October, 1815. Germain, not satisfied with her 
“honorable mention,” began work on yet another memoir, this time working wholly in isolation, 
with no help (or encouragement, it seems) from Legendre or anyone else. 

Meanwhile, Poisson read a paper to the Institut in August, 1814, attempting a rigorous 
derivation of the differential equation for elasticity based on the “molecular hypothesis” (page 
75). He produced a nonlinear, more general equation than had Lagrange. Germain, impressed by 
Poisson’s reputation and unable to follow the details of his mathematical arguments, showed in 
her third memoir that her own hypothesis led to the same equation Poisson had produced! But 
this result only underscores how ineffectually Germain manipulated, but could hardly justify, the 
results she obtained. 

Germain’s third memoir on elasticity finally was awarded the prix extraordinaire in 1816 
(again it was the sole entry). Although it was “interesting in intent” (faint praise), Bucciarelli and 
Dworsky admit that it was “fundamentally deficient and wrong in execution” (page 80). 
Moreover, before Germain had even submitted her paper, Biot had already decided that Poisson’s 
memoir a year earlier had “resolved” the problem of elasticity. So the problem remains, why, at 
last, in a version hardly better than any of her earlier attempts, should Germain now have been 
awarded the prize? Was it simply a means of ending the matter once and for all? 

Germain, having won the prix extraordinaire of the Academy in 1816 (in absentia, for she 
realized the interest her appearance for the award would create, and wished to avoid any public 
spectacle), went on to publish her memoir in a further, revised form at her own expense in 1821. 
Despite revisions and reworkings, Bucciarelli and Dworsky are not hesitant to say that it still 
revealed “‘a tenuous grasp on relevant physical concepts and a less than full understanding of the 
substance and technique of the variational approach of Lagrange” (page 92). Worse still, 
publication “now displayed her incompetence” publicly (page 93). 

The published memoir also made clear her differences with Poisson. She was openly on record, 
after 1821, of opposing the “molecular mentality” and instead, advanced her own hypothesis for 
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the first principles of elasticity theory (page 94), hoping that “an informed public may weigh 
them.” Her confidant by now, Lagrange, was immediately worried that this would draw Poisson 
into open confrontation, and he hoped Germain would not “find occasion to repent of (her) 
courage,” for he doubted that Poisson would be able “to respond with dignity to this civilized 
assault” (page 96). 

Eventually, Legendre’s pessimism was fully justified. But in the meantime, work in elasticity 
was developing rapidly in the hands of the most competent mathematicians of the day— Fourier, 
Navier, Poisson and Cauchy. As for Navier’s work, Bucciarelli and Dworsky set the stage for 
growing conflict by noting that “Poisson had been upstaged in his own domain” (page 100). 
Navier, by making certain assumptions about forces between molecules, gave the first system of 
differential equations defining the behavior of a three-dimensional solid acted on by external 
forces. Poisson objected, criticizing Navier’s assumptions about molecular forces as a step 
backwards into metaphysics. Poisson also disliked the use of Lagrange’s variational techniques, 
which he felt were inappropriate in dealing with a subject that should be approached in a purely 
mechanical molecular way. 

Navier’s work had been submitted to the Academy for judgment (upon which publication 
would be approved), and Cauchy was assigned the paper. Instead of issuing an evaluation, 
however, Cauchy began to publish his own results on the subject, causing Navier to protest! But 
as Bucciarelli and Dworsky acknowledge, Cauchy’s insights went much further than anyone else’s 
at the time, and included salient definitions of stress, strain, derivations of equilibrium with 
respect to both, and to displacement of each point of an elastic body. In short, Cauchy managed 
to bring together all of the most essential features of today’s theories of elasticity (page 102). 

By now, the trend in elasticity was moving away from Germain on two fronts—Cauchy 
developing his ideas in terms of matter viewed as a continuum, Navier adopting the molecular 
approach of her rival Poisson (page 104). Germain nonetheless continued to study and publish on 
the subject, although her later efforts are also described by Bucciarelli and Dworsky as “slight,” 
and “unsuccessful,” (page 105). Work she submitted to the Académie was never formally 
evaluated because “her detailed analysis simply did not constitute first-rate work” (page 106); put 
more bluntly, the Academy simply could not approve work that was “so inadequate and trivial” 
(page 107). 

In 1828 Poisson published a grand theory meant to treat elasticity in terms of a comprehensive 
molecular vision (page 109). Navier, in response, promptly criticized Poisson for failing to cite his 
work, and also noted Germain’s “ingenious hypothesis” about the nature of elasticity. Poisson lost 
no time in pointing out the great confusion in Germain’s work, and went on to criticize Navier’s 
results, notably his boundary conditions. Navier, in rebuttal and in support of his friend, 
mentioned Lagrange’s use of Germain’s hypothesis to derive the plate equation. Even Germain 
herself felt obliged to respond, and in a general, philosophical commentary on the insufficient 
nature of the molecular hypothesis, she stressed her role in exposing its limitations. Published in 
the Annales de Chimie for 1828, her reply to Poisson struck a genuinely Newtonian note, typical 
of 19th-century positivism. She insisted that the object of mathematics was not the investigation of 
the causes of phenomena. One was restricted to the incontestable facts of elasticity. All she felt 
justified in assuming was the tendency of elastic bodies to return to their original shapes when 
stretched— nothing more need nor should be assumed about how molecules interact (page 110). 

Towards the end of her life, Germain’s circle of acquaintances apparently included Auguste 
Comte—father of Positivism—and even G. W. F. Hegel, when he visited Paris. But there seem to 
have been few contacts with any members of the Académie. Even so, she continued to write. 
Among her last scientific memoirs was one concerning the curvature of surfaces and a second on 
number theory. She wrote to Gauss about both, and the letter she sent sadly reflects the isolation 
of her life in Paris, where she saw few scientific colleagues and never enjoyed easy access to 
publications and journals. The letter, in fact, “was impressionistic and showed a lack of 
understanding of (Gauss’) work” (page 116). It is little wonder that he did not reply. 

Germain also found the energy to produce one last polemic on elasticity which was published 
in Crelle’s Journal in 1830. In part, the uprising in Paris that year may have driven her back to 
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her studies, representing the same sort of refuge she had found in science during the early years of 
the French Revolution, when she was only thirteen. But now she was fifty-four, and there was no 
future for her; she was ill and complained of the discomfort and suffering she was tired of bearing. 
A year later, in 1831, she died of cancer. 

To the extent that Germain was of special interest to her contemporaries, it was because she 
was a natural anomaly. Even Condorcet, the most liberal of the philosophes on the subject of 
women (he wrote a pamphlet On the Admission of Women to the Sufferage, anticipating John 
Stuart Mill’s famous essay, The Subjection of Women, by nearly a century), wrote that women had 
shown no genius in the higher forms of science and philosophy, which required he said, 
extraordinary brain power. Germain showed signs of genius, scientific genius, and this made her 
unique. 

For a time, especially at the beginning when many took Germain’s early efforts in mathematics 
as signs of real promise, she enjoyed a substantial following, including the prominent Legendre, 
and later the young Italian mathematician Libri. Legendre, however, seems to have abandoned her 
as the consensus over her work on elasticity theory became increasingly negative, and Libri may 
well have been attracted to her because, in a sense, they were both foreigners and outsiders. 

Libri, in fact, is an interesting case, for his relationship with Germain reveals something of the 
nature of intellectual life in Paris in the 1830’s and raises some important questions about 
Germain’s status there. Libri apparently knew Germain socially, at the end of her life. Both were 
interested in number theory, and Bucciarelli and Dworsky suggest that Germain became a kindly 
patron of sorts; eventually Libri went on to be her first biographer. 

Libri saw Germain as a very social, urbane and noble conversationalist, but this may only have 
been his way of adulating his biographee. If she could be socially at ease, even adept, with the 
likes of Comte and Hegel, as he reports, why not on similar terms with the Parisian mathemati- 
cians, whose approval she repeatedly sought, and with whom she was willing to correspond 
whenever she wished clarification on theoretical points related to her work? Were they too 
cliquish, an exclusive group within the Academy that looked down on other Frenchmen doing 
important work, let alone others? Was Libri, as a foreigner himself, more at home with Germain 
than Legendre or others of the intellectual establishment interested in number theory? 

What Sophie Germain needed, as her biographers agree, was an orderly education in mathe- 
matics and in judging the nature of mathematical questions. She needed criticism as well as 
encouragement, discipline as well as indulgence. Instead, she appears to have been regarded as a 
curiosity, a marvel. As Libri said of her, she was a “superior woman, who of all who pursued 
mathematical studies the farthest, the only one, to our knowledge, who has made real progress” 
(page 118). But in fact, where her work on elasticity was concerned, she lacked the mathematical 
facility to work with Lagrange’s variational techniques, and as a result, she could not command 
the respect of the French mathematical intelligentsia of her day. Although faint praise was 
delivered, she could not hold her own place among the most powerful minds and members of the 
Académie. As Bucciarelli and Dworsky put it, “with no way of avoiding it, and through no fault 
of her own, she looks like a fool in the world of French science” (page 97). 

As Bucciarelli and Dworsky stress, science then as now was rooted in human lives and the 
interplay between people and institutions. And yet, if anything, Germain’s life illustrates the 
problems that resulted from her failure to interact successfully either with scientists or institutions. 
Although the Académie went so far as to confer a prix extraordinaire upon her, it did so 
reluctantly, and never opened its doors to her officially. Fourier, when he became Permanent 
Secretary of the Academy in 1822 (he was also a personal friend), arranged that she might attend 
any public meetings of the Institute that she wished to, with a reserved place in the center of the 
hall as a demonstration of “all the interest that your mathematical works inspire” (page 89). But 
Fourier’s generosity in helping Germain also indicates the extent to which, formerly, she had been 
excluded from scientific meetings in Paris, despite the reputation she had acquired for herself. 

But perhaps her greatest handicap was her self-imposed isolation, and her failure to have any 
systematic interaction with the scientific practitioners of her day, either directly, or through their 
writings. Had Lagrange been on hand, even periodically, to explain the complexities of double 
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integrals, to respond to her questions, to provide coaching, or discussion of esoteric matters 
related to the problem of elasticity, she might have been able to go much further. Unlike the 
Marquise de Chastelet or Madame de Staél, Sophie Germain seems to have worked alone—per- 
haps she even preferred to work alone. But this meant that she was deprived of suggestive or 
corrective interaction with her contemporaries. Moreover, she does not seem to have read 
carefully—or to have been able to find easy access to material that would have been of immense 
value. Why she was never able to profit from Lagrange’s discussion of double integrals, presented 
in the new edition of his Mécanique Analytique in 1811, remains a mystery. 

What Bucciarelli and Dworsky have written is a very interesting book on the subject of 
elasticity and its development during the early 19th Century. But Sophie Germain, as a person of 
flesh and blood, is virtually absent. As the authors admit, Germain literally “disappears” for lack 
of any detailed biographical information. The rest is rather meagre, and her “biographers”’ are left 
to survey her work, which for the mathematician is unfortunately disappointing. On the other 
hand, Germain’s life, in the words of Bucciarelli and Dworsky, became her work, and this in turn 
“became a part of the working of the scientific community.” It is here, on the level of French 
institutions and rivalries, rather than the biography of Sophie Germain, that their book succeeds 
best. 


Dynamical Systems on Surfaces. By C. Godbillon. Translated from the French by H. G. 
Helfenstein. Springer-Verlag, Berlin- Heidelberg, Germany, 1983. 201 pp. $19.80. 


R. F. WILLIAMS 
Department of Mathematics, Northwestern University, Evanston, IL 60201 


When I first heard this title, my mind traveled back over the last fifteen or so years of 
excitement in dynamical systems. It soon became clear exactly how much of this research has been 
just that: dynamical systems on surfaces. The recent resurgence of rational mappings, as dynamical 
systems, roughly centered around Dennis Sullivan, has three continents throbbing with the latest 
news about these self maps of the Riemann sphere. And not long ago, the “pseudo-Anosov maps” 
of Thurston broke upon us with a beautiful melding of dynamical systems and quasi-conformal 
mappings. They have proved to be an important tool in several fields. 

Let us rebegin at the beginning. A dynamical system herein will be one of two types. First, a 
smooth autonomous ordinary differential equation (ODE) on a manifold usually without boundary. 
This means a vector field on a manifold. By the existence and uniqueness theorem, this is 
equivalent to a flow on the manifold. If one takes a section, i.e., a manifold of one less dimension, 
transverse to the flow, one obtains the Poincaré map of the flow which assigns to each point x in 
the section, the “first return” f(x) of the orbit through x to the section. Thus the Poincaré map is 
a diffeomorphism (f) defined on a manifold of one lower dimension. Iterating the Poincaré map 
corresponds exactly to taking the second-, third-, etc., returns of the flow to the section so that 
flow and map reflect the same properties to a large extent. Thus the second type of system is a 
diffeomorphism; evolution of an ODE corresponds to iterating a diffeomorphism. Abstractly, a 
flow is an action of the real line, R; a diffeomorphism generates an action of the integers, Z. 

A system is structurally stable (1937, Andronov and Pontrjagin) provided that sufficiently small 
perturbations (C” topology) cause no structural change in the evolution of the system. A system 
which completely lacks stability would be a poor model for reality, as reality is always a 
perturbation of what we think it is. Thus some kind of stability is crucial. 

The modern flowering of dynamical systems began when Peixoto proved that structurally 
stable flows are dense on surfaces, about 20 years ago. An earlier version (for the disk) helped 
stimulate Smale who quickly became a central force in dynamical systems. His work on 
““Morse-Smale”’ systems is largely motivated by examples on surfaces. But the “Smale horse shoe”’ 


1985] REVIEWS 71 


is truly 2-dimensional in conception. 

This example is both beautiful and baffling; since I have seen many talented expositors try to 
give a simple explanation, I will give only an outline. 

The basic notion is the stable manifold, and the simplest instance of this is a hyperbolic linear 
mapping of the plane; i.e., one with two real eigenvalues, one inside and one outside the unit 
circle. The stable manifold of such a map is the line in the eigen-direction corresponding to the 
smaller eigenvalue. It is characterized as the set of all points which under iteration of the map tend 
(exponentially) toward the origin. Dually, the unstable manifold consists of those points which 
tend away, under iteration and is the line in the other eigen-direction. All other orbits of points 
can be understood in terms of the unstable and stable manifolds, as they travel along hyperbolas, 
with these lines as asymptotes. 

Now a nonlinear map leaving the origin fixed, and with derivative a hyperbolic map as 
mentioned above, will also have a stable and unstable manifold. At the origin these manifolds are 
tangent to the stable and unstable manifolds of the derivative, but far away they can bend 
depending upon the nonlinear map. Poincaré knew of examples of nonlinear maps in which the 
stable and unstable manifolds cross one another—and he knew that in such cases, all hell breaks 
loose. Such a point of intersection is called a homoclinic point of the fixed point; the orbit of a 
point of intersection is called a homoclinic orbit. Note that any homoclinic orbit tends to the fixed 
point under both forward and backward iterations of the map. 

Poincaré felt that the resulting dynamics was so complicated as to defy analysis. But this is just 
what Smale did with an example on the 2-dimensional sphere now universally called the Smale 
“horseshoe” because it maps a square into a horseshoe shape, extending outside the original 
square. Smale proved that this example 1s structurally stable and codified its dynamical properties 
as being isomorphic to the Bernoulli 2-shift. This last is the probability space usually employed to 
describe the experiment of tossing a coin. 

Smale’s “horseshoe” marks the first step toward understanding “chaotic” or “strange” 
systems. 

At about this time Smale wondered whether every dynamical system must have an “attractor.” 
This word was introduced in about 1966 (Smale or Thom?) to mean a “generalized sink;” that is, 
an invariant set A toward which all points in some neighborhood tend under iteration of the 
system. (An attractor should also have a dense orbit.) Thom pointed out that the linear hyperbolic 


map (2 ‘), when passed down to the torus, has no attractor. This map, so simple as to be linear, 


yet complicated enough to have a global hyperbolic structure, inspired Anosov’s work on “Anosov 
diffeomorphisms,” and later the pseudo-Anosov maps mentioned above. 

In this case, a “global hyperbolic structure,” is easily understood in terms of linear algebra. The 
dynamics are complicated because in essence, all points are homoclinic. Let U denote the family of 
all lines parallel to the unstable eigen-direction in the plane, and let S be those lines parallel to 
the stable eigen-direction. Both of these families foliate the plane and (as families) are invariant 
under the linear map; this map stretches the lines in U and shrinks those in S. Now these families 
of lines pass down to families of lines on the torus because the eigen-directions have irrational 
slopes. Though each line is now densely winding throughout the whole torus, the lines are 
stretched or shrunk by the map on the torus just as before. 

Though the appellation “strange attractor” came later, the first two “strange” attractors were 
discovered at the end of 1966 by Smale. The more famous Smale’s “DA” or “derived from 
Anosov” map is a diffeomorphism of the torus with a 1-dimensional attractor. (It appears that 
until their discovery there had been no need for the concept “attractor.”) Today the phrase 
“strange attractor” is widely used by physicists and engineers as well as by dynamicists. It was 
coined by Ruelle and Takens in the essentially infinite dimensional context of turbulent flow. But 
much of the current excitement concerns—you guessed it—a two dimensional system, the 
“Heénon attractor.” This is a diffeomorphism of the plane in which the two coordinate functions 
are polynomials in x and y, one linear, the other quadratic. 

When I finally had Godbillon’s book in my hands, I was surprised to find that none of these 
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seven topics was mentioned. In all fairness, this is partly because Godbillon’s book is elementary. 
But the rea/ reason is that this is a book about foliations, not dynamical systems. True, titles often 
reflect fashion more than content. But I cannot help regretting the demise of the book this title 
conjured up in my mind. 

Dynamical Systems on Surfaces is in Springer’s “Universitext” series. And this book could be 
used as a text leading up to more advanced material on foliations.* The prospective teacher is 
warned that there is no index and that the exercises are spotty, both in frequency and to a lesser 
extent, relevance. 

The most glaring omission is the stable manifold theorem (see p. 104). In this connection, 
Godbillon apologizes that there is a missing chapter—“which would have been dealing with 
structural stability.” But there is no way to do structural stability without the stable manifold 
theorem, and to include this missing chapter would have meant rewriting the whole book. 

In summary, this is an elementary introduction to such topics as flow boxes, rotation numbers, 
stability criterion near singular points, the index of a vector field, and geometric analysis of 
singular points. Hartman’s theorem, that a smooth hyperbolic map is locally conjugate to its linear 
part near its fixed point is proved but only in the contracting case. Attracting periodic orbits are 
discussed, but not the “in phase” theorem. 

The “in phase” theorem is less widely understood than it should be; it is a beautiful result. A 
periodic orbit A of an ODE is said to be attracting provided it has a neighborhood within which 
all orbits tend toward A as required by our general definition of attractor above. Taking a 
Poincaré section yields a diffeomorphism with fixed point where the section intersects the periodic 
orbit A. If the Poincaré map is a contraction (this is independent of the choice of section) at the 
fixed point, then A is an attracting periodic orbit. Godbillon goes this far. 

But the same hypothesis on the Poincaré map implies much more: each point in this 
neighborhood is “in phase” with some point on the periodic orbit, so that not only do nearby 
orbits tend to look periodic after a while, but in addition, each point in a neighborhood is 
associated with or “in phase” with a certain point on the periodic orbit. The set of all points “in 
phase” with a point x is the (strong) stable manifold of x, and has codimension one. These strong 
stable manifolds foliate the neighborhood, and the flow moves leaves of the foliation along into 
other leaves. 

The Denjoy theorem about C? or better flows on the torus and his counter example for C’, is 
given, partly as a sequence of exercises. But along with these there is much technical material 
designed for the specialist in foliation theory. And much, much more, central to dynamical 
systems, is left out. 


*This reviewer recommends Palis-deMelo, Geometric Theory of Dynamical Systems, Springer, 1982, for a course 
in dynamical systems. 


LETTERS TO THE EDITOR 


Material for this department should be prepared exactly the same way as submitted manuscripts (see the inside 
front cover) and sent to Professor P. R. Halmos, Department of Mathematics, University of Santa Clara, Santa 
Clara, CA 95053. 


Editor: 
Professor Robert C. Thompson’s essay on inflation [5] does not reflect the mathemat- 


ical literature concerning his subject, specifically numerous undergraduate texts, for instance 
Wonnacott and Wonnacott [8, p. 274]. Professor Thompson’s “true growth rate” is in fact the 
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Fisher effect, published in 1896 by the famous economist and mathematician Irving Fisher [1], 
and still read by economics students in his classic reference masterpiece [2, p. 39]. Indeed, with r, 
i, and g denoting the rates of inflation, interest, and growth of purchasing power— Professor 
Thompson’s “true growth rate”—the Fisher effect states precisely that g = (i — r)/(1 +r). 

Professor Thompson also asserts in [5, p. 208] that this formula “isn’t known to a good many 
professionals in finance [...].”’ Evidence to the contrary abounds: introductory economics and 
finance classes learn the Fisher effect from texts such as Wonnacott [7, p. 298] and Van Horne [6, 
p. 125], respectively. Even engineers see this economic model, for example in Newnan’s textbook 
[4, p. 334]. Both Newnan and Wonnacott present numerical examples, and one would have 
illustrated Professor Thompson’s composition quite nicely for students. Actuaries, too, derive the 
relation g = (i — r)/(1 + r) when they study their classic text by Kellison [3, Problem 13, pp. 26 
& 235] to drill for the fourth of their ten professional examinations. Then they apply Fisher’s 
formula to annuities-immediate in geometric progression [3, p. 85, Equation (4.40)], which Professor 
Thompson calls “strings of inflating payments” [5, p. 209]. 
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AUTOMATED REASONING* 


L. WOS 
Mathematics and Computer Science Division, Argonne National Laboratory, 9700 S. Cass Avenue, Argonne, 
IL 60439 


1. Introduction. Two of the principal activities of mathematicians and logicians are the search 
for a proof of some lemma or theorem and the search for some model or counterexample. Until 
recently, both activities were conducted by the researcher alone or conducted with a colleague or 
colleagues with a similar interest. Recent occurrences suggest that another alternative now exists. 
The alternative is that of relying on the assistance of a computer program, an automated 
reasoning program. One such automated reasoning program, AURA (for AUtomated Reasoning 
Assistant) [12], has in fact provided valuable assistance in answering certain previously open 
questions from the fields of ternary Boolean algebra [14], finite semigroups [16], [17], and 
equivalential calculus [15], [21]. The capacity of this program both to find proofs and to generate 
models and counterexamples played a vital role in answering the open questions. AURA is not the 
only automated reasoning program that has been used to obtain new results in mathematics. 
Slightly more than 15 years ago, a reasoning program [3] of a distinctly different design proved a 
lemma in lattice theory that had not been previously known. Just as the computer has been used 
for years to assist in certain numerical aspects of mathematics, it can now be used to assist in 
various reasoning aspects. 

Since an automated reasoning program can provide such valuable assistance in diverse fields of 
mathematics and formal logic, you might wish to know more about it. 


1. What is automated reasoning? 

2. What has been achieved with the assistance of an automated reasoning program? 
3. How much work is required to use such a program? 

4. Is such a program available? 

5. Which types of research are amenable to assistance from such a program? 


By answering these questions, I intend to satisfy the curiosity of some, pique the curiosity of 
others, and, most important, enlist the aid of both mathematicians and logicians in finding 
additional open questions to attack with the assistance of an automated reasoning program. 
Equally, I wish to interest members from various fields to begin actively using a reasoning 
program as a research assistant. 


2. The Elements of Automated Reasoning. Automated reasoning is concerned with the dis- 
covery, formulation, and implementation of concepts and procedures that permit the computer to 
be used as a reasoning assistant. One of the primary objectives is the design and implementation 
of a computer program—often called a theorem-proving program—that automates the process 
known as reasoning. By reasoning, we mean the process of drawing conclusions rigorously, as in 
mathematics and in formal logic. Reasoning here means logical reasoning, not probabilistic or 
common-sense reasoning. When a conclusion is drawn from some given set of assumptions, the 


Lawrence Wos received his Ph.D. in mathematics under Reinhold Baer at the University of Illinois in 1957. Since 
then, he has conducted research in various aspects of automated theorem proving and automated reasoning at 
Argonne National Laboratory, Argonne, Illinois 60439. He introduced the notion of “strategy” into the field. In 
collaboration with one of his colleagues, he answered (previously) open questions in abstract algebra and in formal 
logic. The questions were answered by relying heavily on the automated reasoning program AURA developed jointly 
at Argonne and Northern Illinois University. In recognition of his achievements, Wos (together with his colleague) 
was awarded the Mathematics Prize for the best current research in automated theorem proving. 


*This work was supported by the Applied Mathematical Sciences subprogram of the Office of Energy Research 
of the U.S. Department of Energy under Contract W-31-109-ENG-38. 
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conclusion must follow logically and inevitably. If the conclusion itself represents false informa- 
tion, then one of the assumptions from which it was obtained is false. How does an automated 
reasoning program derive its logical power and rigor? 

A program of the type under discussion reasons by applying well-defined inference rules to 
statements called “clauses”. (The discussion of automated reasoning programs given here is 
influenced by our particular programs. Certain features, for example the use of “clauses” and of 
“weighting”, are often not available, especially in those programs with a quite different design.) 
For the concept of a “clause”, first note that, informally, an “atom” is a statement free of logical 
operators. A “literal” is an atom or an atom preceded by the “not” operator. Finally, a “clause” 
is the “or” of a (possibly empty) finite set of literals. For example, the axiom of left identity in a 
group—the product of e and x is x for all x—can be written as the clause 


PRODUCT(e, x, x), 


which contains a single literal. Where “—” denotes “not” and “|” denotes “or”, the statement 
“the subset H is closed under addition” can be written as the clause 


— H(x)|—H(y) | H(sum(x, y)) 
in which the first two literals are called “negative literals”. This clause, which can be read “if x 
and y are in H, then their sum is in H”’, is obtained by using the fact that “if P, then Q” is 
logically equivalent to “not P or Q”’. 

Each inference rule selects from the available information some number of statements, each of 
which satisfies certain syntactic requirements, and yields a conclusion only if it satisfies certain 
(possibly different) syntactic requirements. For example, one inference rule employed by rea- 
soning programs yields a conclusion only if it contains a single literal. Another inference rule 
requires that the conclusion contain only positive literals, placing no restriction on their number. 
Many inference rules that are employed are generalizations of the familiar rules of modus ponens 
and syllogism. Still others generalize equality substitution. For example, an inference rule exists 
that, when applied to the equations “x + —x = 0” and “y + (—y +z) =z”, yields in a single 
step “y + 0 = —(—y)”. In clause form, from 


EQUAL(sum( x, minus( x)), 0) 
and 
EQUAL(sum( y, sum(minus( y), z)), z) 
the clause 
EQUAL(sum( y,0), minus(minus( y))) 


is obtained. To be acceptable, an inference rule must be sound: the conclusions obtained with it 
must be logical consequences of (the conjunction of) the statements to which it is applied to yield 
them. From the viewpoint of model theory, an inference rule is sound if, when it is applied to a set 
of premisses to yield a conclusion, the conclusion holds in any model that satisfies the premisses 
from which it is obtained. 

When an automated reasoning program draws a conclusion, as in the spirit of high school 
geometry, it cites the specific and immediate ancestors of the conclusion, thus enabling you to 
check its work. All steps of a chain of reasoning are made explicit; you can check any or all steps 
of a deduction simply by executing the same algorithm that was executed by the reasoning 
program. Any proof obtained by a reasoning program can, therefore, be refereed in a straightfor- 
ward though tedious manner. 

But how does a reasoning program recognize that a proof has been obtained? By far the most 
common form of proof obtained by an automated reasoning program is a proof by contradiction. 
As often occurs in mathematics, you begin by stating the theorem under study and assume it to be 
false. If the symbolic form of the theorem is “if P, then Q”, you assume P true and Q false. 
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Should “if P, then Q” be a theorem, making this assumption yields a self-contradictory set of 
statements. Such a set of statements is called an “unsatisfiable’’ set. In the typical case, an 
automated reasoning program is presented with an unsatisfiable set of statements and asked to 
establish the unsatisfiability. Such a program recognizes it has completed a proof when it finds 
two statements that obviously contradict each other. For example, the program might eventually 
deduce that, for some element a, a is not equal to a. Since the reflexivity axiom of equals, x = x, 
is ordinarily included in the submission of a problem to a reasoning program, deducing that a is 
not equal to a immediately signals a contradiction. 

How does a reasoning program search for a proof, given statements that correspond to 
assuming P true and Q false? After all, in mathematics and in logic a random and undirected 
search seldom succeeds. Since the computer is so fast, does a reasoning program simply conduct 
an exhaustive search until the contradiction is discovered, discovered more or less by accident? 
By no means, for an attempt to prove even the most uninteresting theorems with an exhaustive 
search produces a tremendous number of facts, almost all of which are irrelevant. Even though the 
computer is very fast, such a search requires far too much time. What occurs instead, when using 
an automated reasoning program, is the employment of “strategy”. Some strategies, called 
“ordering strategies”, direct a reasoning program’s attack on a problem, employing criteria for 
selecting the information on which to focus next. Some strategies, called “restriction strategies”, 
restrict a reasoning program’s attack, totally prohibiting certain paths from being explored. 

An example of an ordering strategy is “weighting” [9]. You can assign “weights” to the various 
concepts in a problem, enabling a reasoning program to give priorities to the concepts. The 
program selects the information on which to focus next according to these priorities, selecting the 
least complex information for focus. You can assign weights to define “complexity” strictly in 
terms of the number of symbols in an expression. On the other hand, you can choose the opposite 
strategy—define “complexity” so that the more symbols in an expression, the more it is preferred 
for being considered next. As another alternative, you can assign weights so that multiplication is 
preferred over sum, so that inverse is preferred over identity, and so that the cube of an element is 
preferred over its square. Weighting allows you to impose your knowledge and intuition on the 
program’s attack on a problem. 

An example of a restriction strategy is the “set of support strategy” [18]. This strategy is in 
effect often employed by mathematicians and logicians, although not with an official name. The 
strategy attempts to take full advantage of the assumed consistency or satisfiability of the set of 
statements that corresponds to P, where the theorem under consideration has the form “if P, then 
OQ”. That set of statements—call it P’— is satisfiable, for its correspondent P typically consists 
of the axioms for some domain of inquiry together with certain additional hypotheses. For 
example, if you were studying Boolean rings—rings in which, for all x, the product of x with 
itself is x—then the set P would consist of the axioms for a ring together with a statement that 
the ring is Boolean. 

Imagine that you wish to prove that such a ring is commutative, that xy = yx for all x and y. 
You would begin by assuming the theorem false—assuming that there exist elements a and b in 
the ring with ab not equal to ba. You would then have three sets of clauses—A’ which consists of 
the clauses that correspond to the axioms for a ring, S’ which consists of those that correspond 
to the Boolean property, and D’ which consists of those that correspond to assuming such a ring 
is not commutative. The set P’ would then be the union of A’ and S$’. 

The idea behind the set of support strategy is to prevent an automated reasoning program from 
randomly selecting clauses from P’ from which to draw conclusions. Such a random search for 
new information simply explores the theory of Boolean rings, rather than concentrating on the 
theorem to be proved, namely, such rings are commutative. Further, since the objective is to 
obtain a proof by contradiction, and since this forward search from P’ produces additional 
consistent information, the proof will be found only by accident— by accidentally proving 
commutativity. When used as recommended, the set of support strategy restricts the actions of a 
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reasoning program by preventing it from applying an inference rule to a set of premisses all of 
which are in A’, the correspondent of the set of axioms. 

To use the set of support strategy, you choose a subset T of the set S of clauses that represents 
the problem to be studied. In the spirit of the example just discussed, the recommended choices 
for T are either the union of S’ and D’, or D’ alone. For a set of premisses from among the 
elements of S to be considered by an inference rule, the set is required to contain at least one 
element of 7. By imposing the strategy on a.reasoning program’s search for information, the 
program is forced to key (at least recursively) on the elements in the chosen subset T. In the 
example from Boolean rings, the recommended choices for T cause the program (recursively) to 
key on the Boolean property or on the fact that ab is assumed different from ba. All new 
information is therefore traceable to S’ or to D’. If any lemmas from ring theory are supplied as 
part of the problem description, in keeping with the intent of the strategy, they are not included in 
T. The choice of T as the union of S’ and D’ is reminiscent of the practice in mathematics and 
logic, for it causes an automated reasoning program to emphasize the role of the special 
hypothesis of the theorem—to key on those statements that are part of P but are not the axioms 
and lemmas that are supplied. 

An automated reasoning program of the type under discussion relies on two additional 
processes to enhance its effectiveness in finding proofs and in generating models and counterex- 
amples. The first process, known as “subsumption” [10], enables a reasoning program to prefer 
general information to specific information. When presented with two statements such that one is 
captured by the other as a trivial corollary, the program discards the corollary. For example, in 
the ring problem cited earlier, if the program discovers that x +x =x (for all x) and that 
a + a= a, then the second discovery is discarded in favor of the first. (Typically, the lemma that 
x +x =x for all x is in fact found and used in the proof that Boolean rings are commutative.) 
The order of the discoveries is not relevant, only the relative generality of the two statements. By 
means of subsumption, an automated reasoning program performs as a mathematician or logician 
often does, retaining the theorem and discarding the trivial corollaries. 

The other process that enhances effectiveness is “demodulation” [20]. Through demodulation, 
a reasoning program normalizes and simplifies information. For example, with the appropriate 
information present, the program will replace 0 + x by x, —(—x) by x, (xy)z by x (yz), and 
xy +xz by x(y +2). In other words, demodulation enables a reasoning program, if so in- 
structed, to remove the (additive) identity from expressions, to cancel the inverse of the inverse, to 
right associate all expressions, and to use distributivity to write certain expressions in a factored 
form. The information for normalizing and simplifying can be supplied to the program at the 
beginning, and this information can be added to or found by the program as it conducts its search 
for the proof or for the model or counterexample. Despite the existence of a number of as yet 
unanswered questions concerning the use of demodulation, the process is a most valuable and 
often necessary feature for solving problems. 


3. Successes. The attempt to answer open questions with the assistance of an automated 
reasoning program often benefits both fields—the field of mathematics or logic from which the 
problem is selected, and the field of automated reasoning. The various obstacles that are 
encountered, if overcome, lead to enhancements of existing techniques or even to totally new 
techniques for attacking problems. The value of finding open questions that are amenable to 
consideration by a reasoning program cannot be overestimated. In fact, your assistance in the 
search for such open questions is strongly encouraged. Equally, your use of a reasoning program 
in your own research could be beneficial to both mathematics and to automated reasoning itself. 

The first open question that was answered with the aid of the automated reasoning program 
AURA (AUtomated Reasoning Assistant) [12] concerns the possible dependence of certain 
axioms for a ternary Boolean algebra [14]. A ternary Boolean algebra is a nonempty set satisfying 
the following five axioms, where the function f can be thought of as a three-place “product” and 
g as inverse: 
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(1) f(f(v, w, x), vy, f(v, w, Z)) = flv, w, f(x, y, Z)) 
(2) f(y, x,x) =x 

(3) f(x,y, g(y)) = x 

(4) f(x, x,y) =x 

(5) f(g(y), yx) = x 


The precise question to be answered is: Are any of the first three axioms independent of the 
remaining four? (The fourth and fifth were already known to be dependent.) Each of the first 
three was proved independent by finding a model satisfying the remaining four but failing to 
satisfy the axiom in question. It was this study of ternary Boolean algebra that led to the 
technique for generating models and counterexamples [13] using an automated reasoning pro- 
gram. 

The next open question that was answered [16], [17] with the assistance of AURA concerns the 
possible existence of a finite semigroup admitting certain mappings in combination. Specifically, 
does there exist a finite semigroup admitting a non-trivial antiautomorphism but admitting no 
non-trivial involutions? An antiautomorphism is a one-to-one onto mapping h with h(xy) = 
h(y)h(x). An involution is an antiautomorphism whose square is the identity. To be non-trivial, 
the mapping must not be the identity map. Commutative semigroups are of course not of interest 
for, in such a semigroup, an antiautomorphism is simply an automorphism. There does exist a 
non-commutative semigroup with the desired properties, and the first found has order 83 [16]. The 
smallest such semigroup [17] has order 7, and there are four such semigroups. Since there are over 
800,000 semigroups of order 7, finding those four could present serious difficulties were one to 
simply sort through them looking for appropriate semigroups. The open questions of existence 
and minimality were answered by using generators and relations, relying on multiplication tables, 
considering possible isomorphisms, and applying “without loss of generality” arguments. The 
capacity of AURA both to find proofs and to generate models was required. 

Other questions that were answered [15], [21] by relying on AURA concern the classification of 
formulas in the equivalential calculus. The equivalential calculus is that field of formal logic that 
studies the abstraction of “equivalence”. In particular, the formulas 


(1) E(x,x) (reflexivity) 
(2) E( E(x, y), E(y,x)) (symmetry) 
(3) E( E(x, y), E(E(y,z), E(x, z))) (transitivity) 


taken together can serve as a complete set of axioms for the calculus. (The designations of 
reflexivity, symmetry, and transitivity, not ordinarily used in equivalential calculus, are chosen to 
provide an intuitive understanding.) The elements of equivalential calculus are the expressions 
that can be recursively well-formed from the variables, x, y, z, w,..., and the two-place function 
E. In the equivalential calculus, there exist individual formulas that are sufficiently strong that 
they serve as single axioms for the entire calculus. The shortest single axiom for the calculus 
contains 11 symbols, excluding grouping symbols and commas. Of the 630 such formulas of 
length 11, at the time we began our study, eleven were known to be shortest single axioms and 
seven remained unclassified with respect to this property. We decided to consider the correspond- 
ing open questions: Are any of the following seven formulas strong enough to serve as a single 
axiom for the equivalential calculus? 


(1) XJL = E(x, E(y, E(E(E(z, y), x), 2))) 

(2) XKE = E(x, E(y, E( E(x, E(Z, y)), Z))) 

(3) XAK = E(x, E(E(E(E(), z)), x),2),¥)) 
(4) BXO = E(E(E(E(x, E(y,Z)), 2), ¥),*) 
(5) XCB = E(x, E(E( E(x, y), E(Z, y)),2)) 
(6) XHK = E(x, E(E(y, z), E( E(x, Z), y))) 
(7) XHN = E(x, E(E(y, z), E(E(z, x), y))) 
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In particular, it was conjectured that no additional shortest single axioms existed—that the 
remaining unclassified seven formulas are each too weak to serve as a single axiom. The study 
revealed that the first five are each too weak, but, contrary to the conjecture, each of the last two 
is in fact strong enough [15] to be a single axiom. To obtain these results, the model generation 
technique used successfully to answer the preceding questions was deemed unsuitable. The 
approach we chose was to seek a complete characterization of all deducible theorems from each 
formula under study. The characterizations were obtained by devising a means for an automated 
reasoning program to employ schemata [21]. Again the study of an open question led to a new 
technique for using an automated reasoning program. The proof that XHK is a single axiom 
consists of 87 steps, including steps containing 71 symbols not counting grouping symbols and 
commas. The proof that XHN is a single axiom consists of 162 steps, including steps containing 
103 symbols. An indication of the difficulty of completing these proofs is provided by the fact that 
there are more than 100,000,000,000 formulas containing 27 symbols. 

The program AURA, as well as various other automated reasoning programs, has also been 
used in various studies only distantly related to mathematics and formal logic. The program has 
been used to design logic circuits, some superior to those already known. It has been used to 
validate the design of existing circuits also, and to experiment with the operation of control 
systems. Reasoning programs have been used to prove given claims for computer programs. 
Finally, the possible use of such a program for chemical synthesis is currently being investigated. 


4. Use and Availability. The biggest obstacle to using an automated reasoning program is 
language. As with various fields of mathematics and of logic, automated reasoning has its own 
notation. The most common notation relies on the use of “clauses” [19]. The clause language is 
very closely related to the first-order predicate calculus. The language treats variables as meaning 
“for all’, so it is trivial to say that “the square of every element is the identity”, for example. 
When you wish to express that something “exists”, depending on the nature of the statement, you 
employ a function of the appropriate number of variables or employ a constant. In addition, the 
conjunctive normal form is used— all statements are implicitly assumed to be separated by “and”, 
while all literals of a statement are separated by “or”. The only other operator that is acceptable is 
“not”. Nevertheless, the language is more than sufficient for attacking a wide variety of problems. 
In fact, the language of clauses is extremely powerful, as we discovered in the past few years. 
Relationships involving “if then” and “equivalent” present no problem, for they are simply 
transformed into statements relying on “or”, “not”, and “and” having the same meaning. Despite 
the power and richness of the language, the presentation of a problem to an automated reasoning 
program is an art. 

In addition to the representation problem, there are closely related problems of choosing the 
appropriate rules of reasoning and of choosing the strategies to control the rules [19]. Good 
choices of representation, inference rule, and strategy are extremely interconnected. The choices 
must not be made separately. Then there is the problem of providing the appropriate normaliza- 
tion and simplification information. Even with these obstacles, as evidenced earlier, an automated 
reasoning program can be successfully used to solve problems, some of which are quite difficult. 
The value of such an automated assistant can be seen by noting that the researchers who solved 
the open questions knew remarkably little about the areas from which the problems were selected. 

You need not be an expert in automated reasoning to use such a program. The studies in 
circuit design, circuit validation, operation of control systems, and chemical synthesis are being 
conducted by people with little knowledge of automated reasoning. The people in question were 
able to initiate these studies by relying on manuals and by consulting with experts in automated 
reasoning. Even with the various successes and the current activities, we must again say that the 
use of an automated reasoning program is at present an art. 

Until recently, the available written material consisted of manuals and formal texts on the 
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subject. The book “Symbolic Logic and Mechanical Theorem Proving” by Chang and Lee [2] is a 
very readable and rather formal introductory text. It covers well the topics of representation, 
inference rule, strategy, demodulation, and subsumption. The book “Logic for Problem Solving” 
by Kowalski [4] is somewhat less formal, and is particularly valuable for questions of representa- 
tion. It covers applications such as state-space problems, and introduces the topic of logic 
programming with specific attention to the programming language of Prolog. The book “Auto- 
mated Theorem Proving: A Logical Basis” by Loveland [5] presents a complete and thorough 
treatment and in a very formal manner. It contains much material on the logical foundations of 
the subject, including various completeness theorems. The book “Automation of Reasoning, 
Volumes 1 and 2, Classical Papers on Computational Logic”, edited by Siekmann and Wrightson 
[11], contains the important research papers on the subject written before 1971. The editors have 
provided a valuable guide to this collection by designating which of the papers deserve special 
attention. The book “Automated Theorem Proving: After 25 Years”, edited by Bledsoe and 
Loveland [1], contains a number of papers presented on the subject at the winter AMS meeting in 
1983. It includes Loveland’s paper [6], which reviews the previous 25 years of automated theorem 
proving—the field from which automated reasoning evolved—and the two papers given by the 
respective winners of the prize for milestones and the prize for current achievement in automated 
theorem proving. 

With the intention of reaching a much wider audience than that aimed at by each of the 
preceding books, and to provide material not available elsewhere, a fifth book has just been 
completed. The book, “Automated Reasoning: Introduction and Applications” by Wos, Over- 
beek, Lusk, and Boyle [19], assumes no background. It covers various applications— mathemati- 
cal, logical, and others—of the type discussed in this paper and, moreover, provides some 
guidance for using an automated reasoning program. 

Very powerful and effective portable reasoning programs are now available [7], [8]. These 
programs, written in Pascal, can be run on relatively inexpensive computers, provided an 
appropriate Pascal compiler is accessible. They offer the researcher an arsenal of inference rules, 
strategies, and related processes from which to choose. Those who wish to experiment with and 
conduct research with an automated reasoning program can now do so. 


5. Amenable Questions. An open question is amenable to attack with the assistance of an 
automated reasoning program if it is representable in the language of clauses. Being representable 
in first-order predicate calculus is sufficient, for an algorithm exists for mapping from that 
language to the language of clauses. Even better, an open question is amenable to such an attack if 
it can be represented by a finite set of equations. Specifically, if the structures under study are 
finitely based varieties—structures such as semigroups, groups, and rings—an automated rea- 
soning program might well provide valuable assistance. Mappings of various kinds are perfectly 
acceptable. So also are finite sets of generators and relations. On the other hand, the statement 
“the structure is finite’ cannot be represented in the language of clauses, and the fact that it is 
infinite can be represented but not in a useful fashion. To talk about “all mappings from one 
structure to another” is not possible, and to talk about “there exists a mapping” requires a trick. 
The trick is similar to one used in mathematics; we simply name the mapping in question. 
Problems in differential equations, for example, are not well suited to consideration. In short, 
problems similar to the ones discussed earlier are the best bet. 


6. Summary. You are not required to be an expert in automated reasoning to use such a 
program. Its use is difficult, especially from the linguistic viewpoint. In fact, “the art of automated 
reasoning” captures the flavor of the present state of the discipline. Nevertheless, a number of 
activities are now benefiting from reliance on a reasoning program. They include research in 
mathematics and logic, the design and validation of logic circuits, and proving that computer 
programs fulfill certain claims made for them. 
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What is needed is additional open questions to attack with the assistance of a reasoning 
program. Such questions will lead to enhancements of existing techniques and the development of 
totally new techniques for problem solving. What is also needed is the use by mathematicians and 
logicians of automated reasoning programs. If open questions can be solved by researchers who 
know so little about the respective fields from which the problems were taken, how much more 
might be accomplished by experts in various fields with the assistance of such a program! 

A reasoning program provides an explicit treatment of each proof and each model or 
counterexample it obtains. It offers a wide variety of inference rules, strategies, and related 
processes from which to choose when attacking a given problem. In view of the successes and 
properties cited here, perhaps mathematicians and logicians might strongly consider enlisting the 
aid of a different type of colleague—a computer program that functions as an automated 
reasoning assistant. 
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138. MISCELLANEA 


Just as surely. as our understanding of Nature is really valid only to the extent that it is 
mathematical, so also our understanding of higher domains must be based on mathematical 
models. 


—R. Steiner (1861-1925), quoted in Mathematical Reviews 82k : 51001. 


Each natural science contains genuine Science only to the extent that it contains 
Mathematics... . It may be that a formal philosophy of Nature in general, that is, a philosophy 
that deals only with general concepts, is possible without Mathematics, but a formal natural 
science dealing with definite objects (whether Physics or Psychology) is possible only with the use 
of Mathematics, and since each natural science contairis only as much genuine Science as it 
contains a priori knowledge, it follows that a natural science is a genuine Science only to the 
extent that Mathematics can be applied to it. 


—Immanuel Kant, Metaphysische Anfangsgriinde der Naturwissenschaft, 1876. 


Perhaps we should sometimes listen to the other side. Goethe, for example, did not like 
Mathematics. 


Mathematicians are amazing people. In virtue of their accomplishments, they have set 
themselves up as a universal guild and will acknowledge only what suits their circle, what their 
method of organization can produce. A prominent mathematician once said, when someone 
strongly recommended a topic in Physics, “But can’t it be reduced to calculation?” 

—Goethe, Maximen und Reflexionen, no. 1277 

It surely does not follow that the hunter who kills the game must also cook it. A cook might go 
hunting and shoot well; but he would be badly mistaken if he claimed that only a cook can be a 
good shot. It seems to me that this is the situation of mathematicians who claim that nobody can 
understand or discover physical phenomena without being a mathematician, since they should be 
pleased enough if the meat is brought to their kitchen for them to lard it with formulas and dress 
it as they like. 

— Goethe, Maximen und Reflexionen, no. 1280 


ANSWER TO PHOTO ON PAGE 93 


The photo is of Louis de Branges, who proved in 1984 the conjecture Bieberbach made in 1916, 
by establishing a strong inequality proposed by the Russian function-theorist I. M. Milin. 


SOLVABILITY OF INFINITE SYSTEMS OF POLYNOMIAL EQUATIONS 
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The purpose of this paper is to show that if every finite subsystem of a system of less than 
continuum many polynomial equations over the field of complex numbers has a solution, then the 
entire system has a solution. This is not the case over the field of real numbers even if the system 
is only countable, and it is not the case over the field of complex numbers if the system has 
continuum many equations. 

It is known [1] that if every finite subsystem of an infinite (countable or uncountable) system L 
of linear equations over a field F has a solution in F, then L has a solution in F. For systems of 
polynomial equations of higher degree the facts are different. For instance, over the field R of real 
numbers, let us consider the countably infinite system of polynomial equations in the unknowns 


Xs Vis Vas Varree! 
(x-l-yp=0 
(x — 2) — yy =0 
(1) (x —3)-—yZ =0 


(x — m) — yn, = 0 
Every finite subsystem of (1) has a solution in R. To prove this, it is enough to show that, for 
every positive integer m, the finite subsystem of (1) consisting of the first m equations of (1) has a 
solution, and that is so because 


1/2 1/2 1/2 
x=mM, y=(m-1)”, vy =(m—- 2)”, yy =(m-3)”,..., Ym = 9 


gives a solution of the first m equations of (1). The entire system (1), however, has no solution in 
R. Reason: if (1) had a solution in R yielding, say, x = r, then for a positive integer n > r the 
nth equation (x — n) — y’ = 0 could not have a solution in R since the square of a real number 
cannot be negative. This situation could not have happened over the field C of complex numbers. 
Indeed, 


x=0, y, =i, yy = 277 i,..., y, =n? i 


is a solution in C of the entire system (1). 
On the other hand, over the field C of complex numbers let us consider the system of 
continuum many polynomial equations given by: 


(2) (z-—c)y,-1=0 with c € C. 


Not only every finite subsystem of (2) but every proper subsystem of (2) has a solution in C. 
Indeed, if P is a proper subsystem of (2) such that, say, for some a in C the equation 


Alexander Abian: I received my M.S. from the University of Chicago in 1954 and my Ph.D. from the University 
of Cincinnati in 1956 under I. A. Barnett. I taught briefly at the University of Tennessee and at Queens College, NY, 
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(z — a)y, — 1 = 0 does not appear in P, then a solution of P is given by z = a and y, = (a — 
c)~', which exists since a # c. Nevertheless, the entire system (2) has no solution in C. Reason: if 
(2) had a solution in C yielding, say, z = u, then the equation (z — u) y, — 1 = 0 would have no 
solution in C. 

In contrast to the case of a countably infinite system of polynomial equations over the field of 
real numbers as well as a system of continuum many polynomial equations over the field of 
complex numbers, this paper shows that if a system of less than continuum many polynomial 
equations (each with finitely many unknowns, of course) such as for instance 


a,x? + ayy?z + a4,x°2? + agy’z? +--+ +a,x°y?z7 +¢ =0 
(3) bz + b,y*z + b32z7v? +--+ +b, yz%v* +c, = 0 
e,xu’ + ex? uw + e,yz*u* + egx?vt +--+ +e,u°v4 +c, = 0 


over the field C of complex numbers is such that every finite subsystem of (3) has a solution in C, 
then the entire system (3) has a solution in C. 

As expected and as (3) shows, by a system of polynomial equations we mean a system of 
equations where each equation is a polynomial in finitely many unknowns. The system may 
involve altogether an infinite number of unknowns. 

In what follows C stands for the field of complex numbers, all polynomials are over C, and all 
solutions are meant to be in C. 

We need the following definitions: 

A system S of (polynomial) equations is called consistent if and only if every finite subsystem 
of S has a solution. 

Let S be a system of (polynomial) equations and z an unknown appearing in S, and let 
c © C. The ordered pair (z,c) is called an assignment of z for the system S if and only if the 
system obtained from S by substituting z = c consists of equations which are satisfied or which 
form a consistent system. Following tradition, instead of (z,c) we shall refer to z = c as an 
assignment of z. 

Similarly, let x, y,z,...be unknowns which appear in a system S of (polynomial) equations 
and let p,q,c,...be complex numbers. The set {(x, p),(y, q),(Z,¢),--. } is called a system of 
assignments for S if and only if the system obtained from S by the substitutions x = p, y = q,z 
=c,... consists of equations which are satisfied or which form a consistent system. 

Below we prove a nonobvious lemma of crucial importance; it is the key to what follows. It is 
not valid over the field of real numbers. 

In what follows when we say that “an unknown appears in a polynomial equation P(...) = 0” 
we mean that the coefficient of at least one term in which a nonzero power of that unknown 
occurs is not identically zero in P(...). 

Also, in what follows by “almost every” we mean “all but a finite number.” 


LEMMA 1. If P(x,...,2) = 0 is a polynomial equation in which the unknown z appears, then 
either z has only a finite number of assignments or else z = c is an assignment of z for almost every 
complex number c. 


Proof. If z is the only unknown which appears in P(x,...,z) = 0, then P(x,...,z) reduces 
to a nonconstant polynomial P(z) in z which (by the Fundamental Theorem of Algebra) has a 


finite number of zeros c,,...,¢,. Obviously, the finitely many assignments z = c,,...,Z = c, are 
the only assignments of z. Hence in this case z has only a finite number of assignments. 
If besides z, finitely many other unknowns x, y,...,u also appear in P(x,...,z), then we 


rewrite P(x, y,...,u,Z) aS a polynomial in the descending powers of x, say, as: 


(4) x*P)(...) + xk IP...) te +P...) = 0, 
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where the P.(...)’s are polynomials in which some of the unknowns, y,..., 4,2 may appear, and 
where P)(...) is not identically zero. 
In connection with (4) one and only one of the following two cases must occur: 


(5) in P)(...) the unknown z does not appear 
or 
(6) in P,(...) the unknown z appears. 

Let (5) occur. Then, without loss of generality, we may assume that P,(...) is a polynomial 
Po(y;..-, 4) in which the unknowns y,...,u appear. Since Po(y,...,u) is not identically zero, 
there exist complex numbers p,...,q such that 
(7) Po(p,.--,q) #9. 

Assertion: z = c is an assignment of z for every complex number c. Indeed, since z does not 
appear in P)(y,...,u4), we see that upon substituting y = p,...,u=q and z=c for every 
complex number c in P(x,...,2Z) aS given by (4), we get an equation that has a solution. This 


follows readily from (7) and from the Fundamental Theorem of Algebra. Hence in this case z = c 
is an assignment of z for every complex number c. 
Let (6) occur. Again, without loss of generality, we may assume that P)(...) is a polynomial 


Po(y;.--,u4,Z) in which the unknowns y,...,u,z appear. Again, since P)(y,...,u,Z) is not 
identically zero, there exist complex numbers s,...,¢ such that 
(8) Py(s,...,t,2) = Po(z) # 0. 


Now, as a nonconstant polynomial in z, by the Fundamental Theorem of Algebra, P)(z) as given 
by (8) has finitely many zeros h,,...,h,,. Thus 


P)(s,...,t,¢) # 0 for every c except c =h,,...,h 


m 


and consequently 
(9) P)(s,...,¢,2) # 0 after the substitution z = c, for almost every c in C. 


We claim that z = c is an assignment of z for almost every complex number c. Indeed, we see 
that upon substituting y=s,...,4=t and z=c for almost every complex number c in 
P(x,...,2Z) as given by (4), we get a system that has a solution. This follows readily from (9) and 
from the Fundamental Theorem of Algebra. Hence in this case z = c is an assignment of z for 
almost every complex number c. 

This completes the proof of Lemma 1. 


REMARK 1. We observe that Lemma 1 is not valid over the field R of real numbers. For 
instance, for the polynomial z — y* = 0 over R there are continuum many assignments z = r 
with r > 0 of z, but there are also continuum many nonassignments z = r with r < 0 of z. 


Now, we generalize Lemma 1 to the case of a finite system F of consistent polynomial 
equations (i.e., to the case where the finite system F is solvable). Our proof is based on Lemma 1 
and on the existence of a resultant system of a finite system of polynomial equations [2, 3]. 


LEMMA 2. Let F be a consistent finite system of polynomial equations in which the unknown z 
appears. Then, for F, either z has only a finite number of assignments or else z = c is an assignment 
of z for almost every complex number c. 


Proof. To the finite system F we apply, say, Bezout’s process of elimination of the unknowns 
appearing in F. We continue the process until it yields a finite system L of resultant polynomial 
equations L, = 0,...,L, = 0. We recall [2], [3] that F is consistent (1.e., has a solution) if and 
only if L is consistent (i.e., has a solution). Moreover, every solution of L can be extended to a 
solution of F. Furthermore, no two distinct L, = 0,..., 2, = 0 have any unknown in common. 
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One and only one of the following two cases must occur: 


(10) in L the unknown z does not appear 
or 
(11) in L the unknown z appears. 


Let (10) occur. Since by the hypothesis of the Lemma F is consistent, so is L and since z does 
not appear in L, obviously, z = c is an assignment of z (for L as well as for F’) for every complex 
number c. 

Let (11) occur. Then the unknown z appears in one and only one of the resultant polynomial 
equations, say, L,(x,...,z) = 0. But then, by Lemma 1, (for L as well as for F) the unknown z 
has a finite number of assignments or else z = c is an assignment of z for almost every complex 
number c. 

We extend Lemma 1 to the case of a consistent system D of less than continuum many 
polynomial equations. Thus, every finite subsystem of D has a solution and 


(12) D<2%=C 
(where D is the cardinal number of the set D). 


LEMMA 3. Let D, with D < C, be a consistent system of polynomial equations in which the 
unknown z appears. Then for D, either z has a finite number of assignments or else z = c is an 
assignment of z for every complex number c except for less than continuum many c’s. 


Proof. By Lemma 2, one and only one of the following two cases must occur: 
(13) D hasa finite subsystem F for which z has only a finite number of assignments 
or 


for every finite subsystem of D it is the case that z = c is an assignment of z for almost 
(14) .; . 
every complex number c (depending on the finite subsystem). 


Let (13) occur and let z = ¢,,...,z =, be the finitely many assignments of z for F. Let us 
assume to the contrary that none of these assignments is an assignment of z for D. Thus, upon 
substituting z = c, (with i=1,...,n) in D we get a finite subsystem F of D which has no 


solution with z = c,;. Hence, the finite subsystem F, U--- UF, U {F} of D has no solution 
whatsoever, and this contradicts the consistency of D. Thus, some of the finitely many z = 
C,,...,Z = c, must be assignments of z for D and since these are the only assignments of z for F, 
we see that in this case z has only a finite number of assignments for D. 

Let (14) occur. Then D < C by (12). Thus, D has less than continuum many finite subsystems. 
We recall that less than continuum many finite subsets of complex numbers contains less than 
continuum many complex numbers. However, since there are continuum many complex numbers, 
from (14) it follows that z = c is an assignment of z for D for every complex number c except for 
less than continuum many c’s. 

This completes the proof of Lemma 3. 


REMARK 2. We observe that “for every except for less than continuum many” which appears in 
Lemma 3 cannot be replaced by “for almost every” which appears in Lemmas 1 and 2. We note 
also that Lemma 3 ceases to be valid if in it D has continuum many polynomial equations (cf. our 
example (2)). 


COROLLARY 1. If z is an unknown appearing in a consistent system D of less than continuum 
many polynomial equations, then z has an assignment for D. 


We recall our earlier definition of a system of assignments for a system of (polynomial) 
equations. Accordingly, in Corollary 1, if (z,c) is an assignment, then {(z,c)} is a system of 
assignment for D. Thus, we may rephrase Corollary 1 as follows. 
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COROLLARY 2. The set of all the systems of assignments of a consistent system of less than 
continuum many polynomial equations is nonempty. 


Based on Corollary 2, we prove our main result: 


THEOREM. Let D be a system of less than continuum many polynomial equations over the complex 
numbers C such that every finite subsystem of D has a solution in C. Then D has a solution in C (and 
conversely). 


Proof. Let x, y,u,...,2Z,... be the unknowns which appear in D. By Corollary 2, the set A of 
all the systems of assignments for D is nonempty. Let 


A= {{(x,4)},{(%5),(y,¢)}, (C4 ),095 1), (4 D)} (45 8), ( gs fo Fe 


We partially order A by the set theoretical inclusion C . It can be readily verified that every 
simply ordered subset of (A, C) has a supremum. Thus, by Zorn’s lemma, (A, C) has a maximal 
element M = {(x, b),(y, 2), (u, Pp), (U, q),--- }. To prove the theorem, it is enough to show that 
every unknown which appears in D also appears in M. Let us assume that z does not appear in 
M. Then upon substituting the elements of M,1e, x = b,y =t,u=p,v =q,... in D, we geta 
system D’, with D’ < C, which is a consistent system of polynomial equations in which the 
unknown z appears. Thus, by Lemma 3 or Corollary 2, there exists an assignment z = c of z for 
D’. Clearly, M U {(z,c)} would contradict the maximality of M and therefore every unknown 
appearing in D also appears in M. Obviously, M yields a solution of D. (The converse is trivial.) 


REMARK 3. Throughout this paper the field of complex numbers C can be replaced by any 
algebraically closed field A, in which case C must be replaced by A. 


This work is partly supported by Iowa State University SHRI. 
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137. MISCELLANEA 


Two kinds of symbol must surely be distinguished. The algebraical symbol comes naked into 
the world of mathematics and is clothed with value by its masters. A poetic symbol—like the 
Rose, for Love, in Guillaume de Lorris—comes trailing clouds of glory from the real world, 
clouds whose shape and colour largely determine and explain its poetic use. In an equation, x and 
y will do as well as a and 5; but the Romance of the Rose could not, without loss, be re-written as 
the Romance of the Onion, and if a man did not see why, we could only send him back to the real 
world to study roses, onions, and love, all of them still untouched by poetry, still raw. 


—E. M. W. Tillyard and C. S. Lewis, 
The Personal Heresy. A Controversy, 
Oxford University Press, London, 
1939. (The excerpt above appears 
in a chapter by C. S. Lewis.) 


THE LOGARITHMIC MEAN IN nr VARIABLES 


A. O. PITTENGER 
Department of Mathematics and Computer Science, University of Maryland Baltimore County, Catonsville, 
MD 21228 


1. Introduction. In some recent notes in this MONTHLY, several authors have discussed 
properties of the logarithmic mean and variations thereof ((2], [8], [10], [15]). This little known 
“average” has been used to define a mean temperature between two points at different tempera- 
tures [11] and, somewhat surprisingly, in an analysis of index numbers in economics [1]. Further 
applications, however, require a “natural” generalization to a logarithmic mean of n variables, 
and the purpose of this note is to discuss such an extension and to derive its basic properties. 

As it happens, a suitable definition and some basic results are available in earlier papers of 
Carlson ([3], [4]). In Section 2 we present this definition and mention some of its properties. In 
Section 3 we let Jensen’s inequality run rampant and establish in particular that the logarithmic 
mean in n variables is bounded above and below by the arithmetic and geometric means, 
respectively. In fact we define an interpolating family L, of logarithmic means such that L_,, is 
the geometric mean of x,,...,x,, L_, is the logarithmic mean, L, is the arithmetic mean and L, 
is increasing in r for fixed x,,...,x,,. 

Applications may well require an explicit formula for the logarithmic mean, and that form is 
established in Section 4 for distinct x,,...,x,. Other applications and, indeed, one of the 
motivations for considering this problem involve the asymptotic behavior of the logarithmic mean. 
It can be shown that under mild restrictions on the x’s, convergence of the arithmetic mean to a 
limit value implies the convergence of L, to that same value for all fixed r. In the context of 
probability theory, this becomes a strong law for L,. Proofs of these more technical results will 
appear elsewhere. 


2. Definitions. For positive x and y the logarithmic mean in two variables is defined as 


log(y) — log(x) \~" 
(2.1) L(x,y) = y-x » FY; 
XxX; X=Yy); 


and some of the interest in this function stems from its relationship to the arithmetic and 
geometric means: 


1/2 
(2.2) (x+y) <L(x,y) < A(x, y) =(x+y)/2. 
In fact, if we define the pth arithmetic mean as 
(xy), p=0, 
(2.3) A (x,y) = Pp p\1/p 
Pp (= | ; P pa 0, 


then it is possible to show that the inequality 
(2.4) A,(x,y) < L(x, y) < A,(x,y) 


A. O. Pittenger: I received my Ph.D. from Stanford in 1967, under the direction of K. L. Chung, and then held 
visiting or faculty positions at Moscow State University, the Rockefeller University, and the University of Michigan 
before settling down in Maryland in 1972. My primary area of research is in probability theory, but every now and 
then I am lured into problems involving elementary inequalities, a fact I have heretofore concealed from my 
probabilistic colleagues. My second profession is being the only non-musician in our family of five and fulfilling our 
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holds for all positive x and y if and only if p < 0 and1/3 < gq. (These results were first obtained 
by Lin [10]. See also [13].) 

The definition given in (2.1) does not suggest a reasonable generalization, although some 
authors have sought to find one based on that form [5]. However, following Carlson, we can write 
log( y) — log(x) as f? dt/t and make a change of variable to obtain the following felicitous form: 


—1 
: 1 dv 
2.5 L(x, y) = sae 
2%) = ([Saaeos| 
Thus L involves the reciprocal of points in the convex hull of x and y. 
The representation (2.5) appears in Carlson [2] and is a special case of his multivariable 
definition which we now give. Let A,_, represent the simplex 


n—-1 
(2.6) Aya = ((O1eestha)i0€ 0, <1 Eo, <1}, 


and let its volume be denoted by m,_,. An easy calculation gives m,_, = ((n — 1)!)7'. (As a 
point of interest, the volume is a special case of Dirichlet’s integral. The reader unacquainted with 
that integral will find a nice presentation in [16, p. 258]. 

Letting dv denote the differential of volume in A,_,, we define dp,_, as the probability 
measure (n — 1)!dv and 


(2.7) L(x) = L(x,,...,X,) = / (x0) "depa) 


n-1 
where 
n 
(2.8) x-v= )) x,v,, 
i=] 
and v, = 1 —v, — ::: —v,_,. In calculations below we will move from x to (x,,...,x,,) without 


comment and will delete subscripts when no confusion will result. 
Let us note that L as defined in (2.7) does have some of the basic properties of a mean. It is 
obvious that 


min(x,,...,x,) < L(x) < max(x,...,x,), 


and thus that if x = L(x,,...,x,), then x = L(x,...,x). (However, unlike the multivariable 
arithmetic means, it is not true that L(x,,...,x,) remains unchanged if x,,...,x, are replaced 
by L(x,,...,x,), for k < n.) Furthermore, L is increasing with increasing x. Indeed if 0 < x; < y,, 
then 


0 < L(y) ~ Lx) = La) L(y) f P= -% 


eC») ( max(»,)) 


2 ° max( y, ~ x,). 


( min (x; ) 
3. Inequalities. In Section 4 we compute an explicit form of L which generalizes (2.1), but it is 


the integral representation that is more useful in establishing the theoretical properties of L. In 
fact, if we write 


(3.1) g(t)=r", 


the definition can be restated as 
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(3.2) L(x) = "| [6(x-v) do), 


a special case of Carlson’s function M, [4, p. 33]. The form of (3.2) suggests the theory of means 
as presented in Hardy, Littlewood, and Polya [7, p. 75] and we use their techniques below. 
Since @ is convex and p is a probability, Jensen’s inequality implies that 


fol: 0) dp > o{ [(x-») dp). 


Because @ is decreasing 


6 fo(x-0) do) < f(x-v)do= Dae, 
where c, = fv,;dp. By symmetry the c, must all equal n7! 
inequality with the arithmetic mean: 


(3.3) L(x1,..-5%,) < A,(x1,---,X,) = dx,/n. 


To obtain the geometric mean (IIx,)'”” as a lower bound, we expand the scope of the 
discussion. Recall that as p increases from zero to one, the p-arithmetic mean A, increases from 
the geometric mean A, to the arithmetic mean A,. Such an interpolating family L, is defined by 
Carlson, and a further study is given in Stolarsky [14] for the case of two positive variables. In fact 
the problem settled for L in (2.4) can be posed for L,: find sharp upper and lower p-arithmetic 
mean bounds for L,(x, y). (That problem was settled in [12] and independently in [15]. See [8] for 
an overview.) 

The definition of L, for two variables is 


, and we have thus established the 


(3.4) L,(x,y) = "| [o(ox +(1 - v)») do), 
where 
(3.5) #9 = {foa(0), a, 


The generalization is again immediate: 


(3.6) L(x) = #"| [o(x- 0) do). 


Thus L_,(x) = L(x) and L,(x) is the arithmetic mean of x,,...,x,. The geometric mean can 
also be expressed in this notation. 
(3.7) LemMa. (IIx,)'/7" = L_,(x,,...,X,): 


Proof. This follows by induction. For n = 2 it is easy to integrate (3.4) and obtain (xy)'”* = 
L_ (x, y). For the induction step we compute the inner-most integral in 


1 l—v l-v —n-1 
_ 1 _ 1 
LBs Maa =f nido, [ dv, of dv, (¥° x,v;) 
where v =v, + ++: +u,_,. If x, # X,4,, that computation gives 


1 1 7 1 


n(x, — Xn41) s v,x, +(1 - Ones s vx, +(L — "x 


n 


(If all the x, are equal, there’s nothing to prove.) Using that expression in the remaining 
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(n — 1)-fold integral and the induction hypothesis, we have 


- af _ 7 
LoTR (415-6. Xa) = (%, — Xn 41) (xn — x, ') 


completing the proof. 


We now have all the preliminary results for the following theorem, which is a special case of 
results given in [3] and [4]. 


(3.8) THEOREM. If r <s, L,(x) < L,(x). AS a Special case, 
(3.9) (IIx,)'”" < L(x) < A,(x), 
and equality holds if and only if the x; have a common value. 


Proof. The assertion is essentially a consequence of a discussion in Hardy, Littlewood, and 
Polya [7, p. 75]. Assume r < 5 and define 


(1) = 9,067). 


Thus 
7 r < s, both nonzero, 
®(t) =< e™, r=0<s, 
r_‘log(t), r<O=s, 


It is easy to check that ® is concave if r < s < 0 and is convex in all other cases. Hence, Jensen’s 
inequality gives for suitable positive f 


| {fap} > [O(s) ao, r<s<0, 


and 
| [fae] < fos) dp, otherwise. 
Using f(v) = $,(x - v) and the structure of ®, we obtain 


e,(L,) > [(x- 0) dp, ifr<s <0, 


and 


@,(L,) < foe -v) dp, otherwise. 


If s < 0, ¢, * is decreasing, so that application of ¢,! reverses the inequality. In all other cases, 
o, ' is increasing. Thus L, < L, in all cases, and the assertion about strict inequality is a 
consequence of Jensen’s inequality. 


4. An explicit form for L. To use the logarithmic mean in applications, it would be nice to have 
an explicit formula for L(x) which does not involve integration. Moreover, since we have 
identified the geometric mean of x,,...,x, as L_,(x), it is natural to attempt to find an explicit 
representation for L_, with r an integer. In this section we give that representation for distinct x;. 
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By the continuity of L(x)—and, analogously, of L_,(x)—it is possible to obtain explicit 
formulae when two or more of the x, coincide. 
First note that for x, # x, 


_ log(x,) , log( x) 
LN 2) = ty 
2 1 1 2 


Next, for distinct values of x,, x,, x3, easy calculations give 


3 
| 
L~*(x1,%5, 3) = 2 > _ x;log(x;) | 
i=1 II (x, — x;) 
J#i 
and 
3 
_ log( x;) 
L~3( x1, X53) = —-2)) aT 
i=1 PaAGe — x;) 


It is these results which lead us to the general form. 


(4.1) PRoposITION. Let p(i,1,n) = I_) 4,(x%; — x;). Then for distinct positive x; and integral r, 
O<r<n, 


xr log( x,) 
Li (x,,.--. =(-1)""° (771) ya ane eee 
(X1s-0-5%,) = (-1) X= pin) 
In particular, when r = 1 we have 
"* log( x,) 


L"! _ —1 
(x, x,) =(n ye p(i,1,n) 
In the proof of (4.1) we shall have occasion to use p(i,1,n) for different variables. When that 
occurs,the letter of the variables will appear. As an example, we have the following lemma whose 
proof we postpone. 


(4.2) LEMMA. Let a,,...,a, bes distinct, nonzero real numbers. ForO <k<s — 2, 
k 


Loan 0. 


| p(i, 1,5; a) 


Proof of (4.1). The proof involves two inductions—the first on r with n =r-+1 and the 
second on n > r+ 1 with r fixed. Both inductions tend to be notationally involved, and we omit 
many of the details. 

For the first induction observe that for r = 1 and n = 2 the formula given by (4.1) coincides 
with the definition of L~‘(x,,x,). For the induction step the trick is to integrate the innermost 
integral of L_,_,, and that leads to 


r+1 
ay en ae ; L a ; 
(X19 - X41) { “T(x1,.. ’ X41) 7 “1 (x4,. » X, X,+2)}. 


We now use the induction hypothesis to replace each of the terms inside the brackets by a suitable 
summation, obtaining for the right-hand side of the foregoing the expression 


(-1)"" CAD LY tox) { 1 XT X41 


Xp — Xp p(i,l,r+1) p(i,l,r+2 


Loti (%1,---5% 42) = 


4 log( Net 1) log( X42) 
p(rt+1,1,r) p(r+2,1,r) 


104 A. O. PITTENGER 


Some easy algebra allows this last expression to be written in the desired form. 

Now for the second induction. For fixed r we have just established (4.1) for n = r+ 1. The 
trick needed here is to make a change of variables in the inner integrals by setting v; = (1 — v,)u, 
and y, = v,x, + (1 — v,)x; for2 <i < n+ 1. Converting to the new variables leads to 


—r 1 n-ly_p+ 
Loy (Xy50565 Xn 41) -f n(1 a v,) L=7(Ya5+++s Yat) dv) , 


and use of the induction hypothesis allows the right-hand side to be written as 
n+1 
1 


n(-1)"* ("> 1 » 


1 n—r-1 
See iay flair + a) Nox a0, + x) doy 
i=2 - ‘ 


where a; = x, — X;. 
It is easy to evaluate the definite integrals above, but when one tries to write the results in the 
form of (4.1) it becomes necessary to show that 
nt+1 1 —] 
(4.3) XL p(i,l,n+1) p(i,l,n+1) 
and 


n+1 n-r —r 


xp °-— x; 
4.4 tt __ = 
44) 2X p(i,l,n+t 1) 
Here’s where we need Lemma (4.2): with s = n + 1 and k chosen first as 0 and then as n — r, we 
can verify both (4.3) and (4.4). That step completes the second induction and the proof of (4.1). 


Proof of (4.2). We have isolated (4.2) not only because.it’s required in the preceding proof, but 
also because it’s a nice example of the technique of partial fractions. Under the hypotheses, we 
can write 

x A, 
~*~ ep 


k+1 s 
=] (x — a;) 


II(x-a;) ! 
i=1 


It is then trivial to verify that A, = a**!/p(i,1, s; a) and that (4.2) results from setting x equal to 
Zero. 
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SYSTEMS OF NUMERATION 


AVIEZRI S. FRAENKEL! 
Department of Mathematics and Statistics, University of Calgary, Calgary, Alberta, Canada T2N IN4 


1. Introduction. There are many ways of representing an integer uniquely! The best known 
method is the decimal system. Whereas the Maya Indians used base 20 (using the 10 fingers and 
10 toes), some of the human race became recently more primitive using the binary system instead, 
being influenced by the computer race which, for electronic reasons, is zealously addicted to the 
binary system. It may be of interest to electronic computers to know that there are actually 
infinitely many binary systems! 

Somewhat less known systems of numeration include mixed radix, factorial representation, and 
exotic systems based on recurrence relations, a special case of which is the Fibonacci system of 
numeration. So there are many ways of representing an integer uniquely; many ways, that is, in 
each of which an integer can be represented uniquely. 

These and other systems of numeration normally hide in various unexpected places, where they 
are applied for varied purposes. Typically, when the need for a numeration system arises, it is 
defined and an ad hoc proof of its capability to represent integers uniquely is given. The purpose 
of this article is to unify these results and show how they can be derived simply and uniformly. 

In Theorem 1 we present a very simple yet general system of numeration. It may be used to 
derive all the numeration systems we intend to present, but some repetitive argumentation is 
involved. We prefer instead to use Theorem 1 to derive a general numeration system based on 
recursively defined basis elements. This is done in Theorem 2, which sheds more light on the 
nature of numeration systems than Theorem 1. We then derive our numeration systems from 
Theorem 2—all but one. The exceptional system is based on a recurrence relation with a negative 
coefficient, whereas the recurrence relations of Theorem 2 contain only positive coefficients. The 
exceptional system is therefore derived directly from Theorem 1. 

Theorems 1 and 2 are given in Section 2. The derivation of the numeration systems from 
Theorem 2 is carried out in Section 3 in a rather neater way than Theorem 1 would permit. The 
exceptional system is derived in the final Section 4. Applications and uses of the numeration 
systems are briefly indicated. These include the ranking of permutations, of permutations with 
repetitions and of Cayley-permutations; polyphase sorting and merging of large data files, 
irregularities of distribution of sequences, the Chinese Remainder Theorem, various games, and a 
class of binary search trees called cedar trees. New applications motivated by this work include 
coding theory and the compression of sparse binary strings. 

It should be pointed out that not all the known numeration systems can be derived from 
Theorem 1. An example is the combinatorial numeration system 


a, an- a, a . 
wa ()o(aro BB) Gea caso en 


(see, e.g., Lehmer [12]). There is a way of generalizing Theorem 1 (from numbers to infinite sets) 
so aS to include also the combinatorial representation, but we prefer at this stage to keep our 
results as simple as possible. (For a fast algorithm for computing the “digits” a,, see [7, Sect. 2].) 

To keep the discussion simple, we state and prove our results for nonnegative integers only. At 
the end it will become clear that they hold, in fact, for any integer. 


2. Two basic numeration systems. Let 1 = uy < u, < u, < --- bea finite or infinite sequence 
of integers. Let N be any nonnegative integer, and suppose that wu, is the largest number in the 
sequence not exceeding N (except that we let n = 0 if N = 0). Dividing N by u, and iterating 
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gives 
N =d,u, + Mrs O<r, <4, 
t= dy {Uy + M1 O< 4-1 < Uy-1 
Ty—1 = 4y—2Un—2 + My —25 O< 7,2 < U,-2 


ri,=du,t+n, O<r<4; 
r=dut+n, O<n<4y, 
aT = duo. 


Collecting terms we get 
N=d,u,+4,_\U,-1+ °°: tdou, (d,2>0,i> 0). 
This is the representation of N in the numeration system S = { uo, u, Uy,... }. The process above 
shows that every nonnegative integer can be represented in the system S. Note that 
4, = 4,u,t+d,_\u;-, + °°: tdouyp <u,,, (i> 0). 
Conversely, if N = L?_9d;u,, where 
(1) d,u;+d;_\u,_; +++: +doty <u,4, (i> 0), 
then 1)” _)d,u; is the unique representation of N by S, as shown in the following theorem: 


THEOREM 1. Let 1 = uy < uw, <u, < --- be any finite or infinite sequence of integers. Any 
nonnegative integer N has precisely one representation in the system S = {uo,U,,U>,...} of the 
form N = X"_)d,u,, where the d; are nonnegative integers satisfying (1). 

Proof. It remains only to establish uniqueness. Suppose that N has two representations: 

N = C4, + ++. + Co Uo = d,,u,, + es +doug; 
where the digits c, and d, are nonnegative integers and satisfy (1). Let i be the largest integer 
such that c,,, # d;41, Say ¢;,1 > d;4,. Then 
(C41 i” Dis) Ur41 _ (d, a c,) 4, Foe +(do a Cy) Uo 
du, + oes +douo, 


Ui41 S 
<< 
contradicting (1). 


The proof is complete by noting that the iterated division algorithm above produces a 
representation satisfying (1) for every nonnegative integer N. @ 


REMARKS. (i) The existence of the representation has been shown in Yaglom and Yaglom [19, 
Chap. 8], where it is also stated that every N has only one representation satisfying (1). 

(ii) Since every nonnegative integer N has a representation satisfying (1), it follows that if N 
has a representation violating (1), then it has two representations. In this sense (1) is a necessary 
and sufficient condition for uniqueness. 

(iii) Note that (1) implies 


IN 

Q 

A 
~ 
+ 


(2) 0<d, — (i > 0). 

Now sometimes (2) implies (1) and sometimes it does not. When the uw, are defined recursively, 
the situation depends on the length of the recurrence relation! If the recurrence relation contains 
only one term (u, = b'u,_,), then (2) does imply (1). Therefore for the more conventional 
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numeration systems such as decimal, binary, mixed radix, and factorial systems, (2) is a necessary 
and sufficient condition for uniqueness. But for systems in which the recurrence relation contains 
more than one term, (2) is only a necessary but not a sufficient condition. This will become clear 
from the deeper Theorem 2 below. 


For m > 1, let b, = b'”, b,,...,b,, be integers satisfying 
1<bh<-:- <b, <b” 
for all n>1. Note that b,,...,5, are constants, but b, = b{”) may depend on n. Let 
U_m+1>U—m+29+++)U_, be fixed nonnegative integers, and let 
(3) uy =1,u, = DB” u,_, + bu,» +--+ +b,u,_m (n> 1), 


be an increasing sequence. 

Suppose that N = £"_,d,u, is the unique representation satisfying (1) of an integer N in the 
system S = {uo,u,,U>,... }, where the u,; are defined by (3). Then the digits d, have to satisfy 
(2) by Remark (iii) above. If m = 1, (2) implies d, < u,,,/u, = b¢*» (i> 0). If m> 1, then 


1 
d; < (b+ u; + by u,_, Fores + Dy Uy — m1) / Uj 


< DEY +(bMu,_, + byu;_g +++ + Byte m) J, = OTP 41° (i>). 


m~i—m 


By (3) (with n = 1), dy < u, = 2", b,u,_,, where b, = b{. Thus 


(4) O0<d, <b) (i211), OK<d<bP+ Lidbu_, (m>1)), 
jH=2 
O0<d,<bit) (i20) (m=1), 


is a necessary condition for uniqueness. It is not in general sufficient, however. Here is the full—if 
slightly strange-looking—story. 


THEOREM 2. Let S = {u;} be a sequence of the form (3). Any nonnegative integer N has 
precisely one representation in S of the form N = L_,d,u; if the digits d, are nonnegative integers 
satisfying the following (two-fold) condition: 

(i) Letk > m—1. For any j satisfying 0 <j < m— 2, if 


(5) (dys dys. 0e5 dy ja1) = (BET, by... 8), 


then d,_, < 6,4; and if (5) holds with j = m — 1, then dyn. < Om. 
(ii) Let 0 < k <m-—1. If (5) holds for any j satisfying 0<j < k —1, then d,_, < b,.,; and 
if (5) holds with j = k, then dy <u 4 1b Up 44 _;- 


TABLE l 
1 N 5 1 236 65 18 5 1 N 
1 1 2 2 
2 2 2 3 
3 3 2 4 
4 4 3 0 
0 5 3 1 
1 6 3 2 
2 7 0 0 
3 8 0 1 
4 9 
0 10 1 3 1 34 
1 11 1 3 2 35 


EXAMPLE. Let us examine the numeration system defined by m = 3, u_, = 0, u_, = uy = 1, 
u, = 3u,_, + 2u,-, + u,-; (n> 1). Thus 5, = 3, b = 2, b = 1, and S = 
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{1, 5,18, 65, 236,...}. By (4), the digit bounds are 0 <d, <3 (i> 1) and 0<d, <4. Table 1 
exhibits the representation by S of a few integers. We point out, in particular, that by 
subcondition (i) the representation of 65 is 1000 rather than 321 as might be expected at first, and 
the representation of 236 is 10000 rather than 3210. By subcondition (ii) the representation of 18 
is 100 rather than 33, and the representation of 36 is 200 rather than 133. 

Note that subconditions (i) and (ii) are both concerned with blocks of consecutive digits. They 
differ only in the location of these blocks: in (i) the right-hand digit d,_,,,, of a block of 
maximal size m is in some position k — m + 1 > 0. Such a block may appear anywhere within 
the digit sequence. If it is “shifted right” until its mghtmost digit goes beyond position 0, we get a 
truncated block: the right-hand digit of the truncated block (of smaller maximal size k + 1 < m) 
occupies position 0. This is the situation considered in (ii). 

Further note that (4) is implied by the condition of the theorem: the first inequality of the case 
m > 1 of (4) is obtained by putting 7 = 0 in the first part of (i); the second by putting 7 = k = 0 
in the second part of (ii). For m = 1, (i) with j = m — 1 gives0 < d, < b{**) (k > O) and (ii) is 
empty. Also the first part of (i) is empty. So for m = 1 the condition is equivalent to (4). 

To motivate further the somewhat curious condition of Theorem 2, consider the violation of (i) 
in the special case when (5) holds for j;=m-—1 but d,_,,,, > 5,. Then the integer M = 
L*_,_,,+14,u,; has two representations: 

k-1 
Ma BY) + DY by it, + gm Uk—ma 1 
i=k—m+2 
and M = u,,, + (dy_ms+1 — Om) Ue—m+ 1 Lhe latter, which violates (1), follows from the former 
representation using the recurrence (3). (After Lemma 1 below, it may be seen that also the other 
violations of the condition of Theorem 2 boil down to the case of a sequence of digits sufficiently 
large, so that a number has an additional representation, violating (1), obtained by employing the 
recurrence (3).) 

We finally remark that Theorem 2 does not consider the most general case (for example, some 
negative coefficients could be permitted in the recurrence relation), but it suffices for deriving in a 
simple manner all but one of the numeration systems of interest to us. 


Proof of Theorem 2. We first show that a representation satisfying the condition of the theorem 
always exists. Let N be any nonnegative integer and let N = L”_,d,u, be its representation as 
produced by the iterated division algorithm preceding Theorem 1. Assume that (i) does not hold. 
Suppose first that there is some j satisfying 1 <j < m — 2 for which (5) holds but d,_, > b4,. 
Then 


k k j+l 

(k+1) 
» d,u, 2 » d,u, z bi uy + » Bugsi—, + up_, 
i=0 i=k—j i=2 


J+1 m 

_ 2(k+1) k—-y) 

= bi u, + » By Ug41—i + bys U,—,-1 + » buy), 
i=2 i=2 


m 
(k+1) _ 
> bj uy + DOU = Meads 
i=2 


violating (1). Secondly suppose that (5) holds with 7 = m — 1 but d,_,,,, > 6,,. Then 


k k m 

(k+1) _ 
> d,u, > » d,u, > bj uy + yy Ups iin = Ua 
i=0 i=k+l—-—m i=2 


again violating (1). If we assume that (ii) does not hold, then the same arguments show again that 
(1) is violated. 
It follows that the representation produced by the iterated division algorithm satisfies the 
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condition of Theorem 2—in addition to (1). 
For demonstrating that there is only one representation satisfying the condition, we first prove 
an auxiliary result which sheds a light of its own on the nature of general numeration systems. 


LEMMA 1. (i) For any nonnegative integer t, write t+ 1=qm+r,0<r< _m. Consider the 
following representation of a nonnegative integer f(t) in the system (3): 


f(t) = byt) u, + by Uu,_\ bores + by, Up42—m +(8,, _ 1) U;+1—m 


1- 
+ bt’ "bom + by U;_1—m tos + by, 1442-2 +(8,, i 1) Ur41—2m 
+... 


+1—-(q-1 

+ bi! (4 OW gym + by U1 ~(¢g—1)m tose: t+ By — {Ut 42~ qm + (By, i” 1) Ur+1—qm 
+1— 

+ bi! bom + by U,—-1— gm Fosse +b, Up 42pm + doug, 


wheredg=b, —1 (ifr=0); dog =U Gb..,u_,-—1 (ifr > 0). Then f(t) = u,,, — 1. 
(ii) Let N, = X"_od,u, and N, = "_,,,d,u; + U/_oc,u,, where c, < d,, l any integer in [0, n], 
and the c,,d, are arbitrary digits satisfying the condition of Theorem 2. Then N, < N,. 


Part (i) of the lemma states that f(t) is the analog of 99...9 (¢ + 1 9’s) in decimal. Part (ii) 
states that if two representations differ in the /th position being identical to the left of it, then the 
representation with the larger digit in position / corresponds to the larger number. This is of 
course trivial for, say, the decimal system. In the present more general case, however, this result is 
not a priori clear: Suppose for example that—for m > 3—three consecutive digits of some 
representation of some number satisfy (d,,d,_,,d,_>).= (b\**?, b), b,). By the condition of 
Theorem 2 these digits are maximal, in the sense that none of them can be replaced by a larger 
one. However, replacing, say, d, _, = b, by b, — 1 enables replacing d,_, = b, by d,_, = B{*~”, 
which may be considerably larger than b,. Part (ii) states that such a replacement yields a smaller, 
not a larger number as might be expected. 


Proof of Lemma 1. It is straightforward to verify the following two facts: 

(a) The representation of f(t) as given in (i) satisfies the condition of Theorem 2. 

(b) Increasing any digit of this representation while leaving the digits to its left unchanged 
results in a representation violating the condition of Theorem 2. 

For the proof of (i), note that (3) implies directly 


f(2) _ (ups — Ur+1—m) + (Up 41—m _ U,+1—-2m) = 
+ (Ura. —(g-tym ~ Mrsi—gm) +(tra1—gm — 1) = M41, — 1. 


We prove (ii) by induction on the “digital position” /. The result is clear for / = 0. Suppose the 
result holds for any two representations satisfying the condition of Theorem 2 whose leftmost 
differing digits are in some position </. We have to show that, given any representation 
N, = &"_9d,;u;, we have N, < N,, where Nj =2"_),,d,u,+Uitbeu,, g 4, <d)4,, and the 
representations of N, and N, satisfy the condition of Theorem 2. Subtracting, we get 

/ / 
MN, — Ny = (dpa. — S141) Mar + LL (4; — 8.) 4, > M41 - Le BM, 
1=0 i=0 
Let g(/) = Lj _og,u,. If the representations of f(/) = X/_)f,u, as given in (i) of the lemma (with 
replaced by /) and g(/) disagree, then the leftmost disagreement occurs in position k for some 
k € [0,/], that is, f, # g,, but f = g, for k <i</. Since the representation g(/) = L/_)g,u, 
satisfies the condition of Theorem 2, fact (b) implies g, < f,. By our induction hypothesis we thus 
have g(/) < f(/). Since by (i) f(/) = u,,, — 1, we thus have N, — N, > 1; that is N; < N,. 


We now resume the proof of Theorem 2. Let N = L"_,d,u, be a representation satisfying the 
condition of Theorem 2. If the representations of f(n) and WN are different, then fact (b) implies 


110 AVIEZRI S. FRAENKEL [February 


d,<f, for some k € [0,n]. By (ii) and (i) of Lemma 1, we thus have N < f(n) < u,.,, So 
inequality (1) is satisfied. Hence the representation L’_,)d,;u; of N is unique by Theorem 1. 


We note that the proof above demonstrated the equivalence of (1) and the condition of 
Theorem 2 for the system {u;} of the form (3). In particular, if N has a representation violating 
the condition, then N has two representations, since it also has a representation satisfying the 
condition. Analogously to remark (ii) above we can thus say that the condition of Theorem 2 is a 
necessary and sufficient condition for uniqueness. In the applications below we will see the form 
this condition assumes for each case. 


3. A spectrum of numeration systems. We shall now use Theorem 2 to derive several families of 
useful numeration systems. Existence of these families is evident from the iterated division 
algorithm just preceding Theorem 1. It will therefore suffice to demonstrate the digit bounds 
required for uniqueness. Recall that uw) = 1 for all systems. 


Polynomial systems. Let b > 1 be a fixed integer and let u, = b”, that is, u,,, = bu, (n > 0). 
Let N be any nonnegative integer. Since the recurrence for the u; has length 1, Theorem 2 implies 
that the representation N = ¥7_,d,b' is unique if and only if 0 < d, < b (i > 0). This gives the 
most commonly used numeration systems, such as the decimal (b = 10) and the binary (b = 2) 
system. 


Mixed radix. Let 1 = ay, a,,4,,... be any sequence of integers with a, > 1 (i > 1), and let 
u, = aj)a,...a,, that is, u,,, =4,,,uU, (n> 0). By the argument above, the representation 
N = "_»d;a,...a; is unique if and only if 0 < d, < a,,, (i > 0). The mixed radix representa- 
tion has been used for a constructive proof of the generalized Chinese Remainder Theorem (see [3] 
and Knuth [10, Sect. 4.3.2]), and, in conjunction with other numeration systems, for ranking 
permutations with repetitions and Cayley-permutations [15]. The latter method has been applied 
for compressing and partitioning large dictionaries in’ order to enable the storage of their 
“information bearing” parts in high-speed memory [7]. 


Factorial representation. This is the special case of the mixed radix representation where 
a, =n-+1, leading to u, =(n +1)! (n> 0). Thus the representation N = L!_,d,(i + 1)! is 
unique if and only if d; <i+1 (i> 0). The factorial representation has been used for ranking 
permutations; see Lehmer [12] and Even [2, Chap. 1]. 


Reflected factorial representation. To represent a nonnegative integer N, select h with h! > N, 
and let u, = h!/(h —n)!, that is, u,,, = u,(h —n) (n = 0). Since again the recurrence has 
length 1 only, the representation N = L"~¢d,h'/(h — i)! is unique if and only if0 <d,<h-—i 
(0 <i<h— 2). The reflected factorial representation has also been used for ranking permuta- 
tions [2, Chap. 1]. 

Up to this point all systems used only a one-term recurrence relation for the u, (the case 
m = 1 in Theorem 2). This produced the better known numeration systems. The more exotic 
systems are obtained for m> 1. In these cases requirement (4) does not suffice to insure 
uniqueness, and the condition of Theorem 2 is needed to guarantee it. We start with an example 
illustrating the case m = 2. 


Continued fraction representation. Let a be an irrational number satisfying 1 < a < 2. Then a 
has a unique simple continued fraction expansion of the form 


a = 1 + — ——— = [1,@,,4,43,...], 


where the a; are positive integers. Its convergents p,/q,, = [1,4,,a),...,4,| Satisfy the recursion 
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1), 
1). 


P-i = 1, Po=1, Pn = 4n Pn-1 + Py 2 (n 
q-j = 0, q =1, Qn = 4nIn-1 + Qn-2 (n 
See, e.g., Hardy and Wright [8, Chap. 10], Olds [16], or Perron [17]. 


2 
2 


THEOREM 3. Every nonnegative integer has precisely one representation of the form 
k 
(6) N = 2) S;Dj.9 < 8; < G443 $)41 = A442 > 5, = 0 (i > 0), 
i=0 


and also precisely one representation of the form 


l 
(7) N= Digi 9< I< 4,0 <4 <4,415 4 = 44, > 4) =0 (i > 1). 
i=0 


Proof. The requirements (4) imply the bounds on the digits s; and ¢,; and the condition of 
Theorem 2 implies that if one of these digits attains its maximal value, then its right-hand-side 
neighbor must vanish. 


We remark again that if any of the digit bounds in (6) or (7) is violated, then the condition of 
Theorem 2 is violated. Since N has also a representation satisfying the condition, N has two 
representation in this case. Thus (6) and (7) are necessary and sufficient for uniqueness in the two 
numeration systems. 

If a, =1(i> 1), then a = [i] = (1 + /5)/2 is the golden ratio (where x denotes the infinite 
concatenation of x with itself). In this case the system (6) becomes the Fibonacci numeration 
system which is a binary system (digits 0 and 1 only), with the proviso that two adjacent 1’s never 
occur. See Zeckendorf [20]. This system lies behind the Fibonacci search (see Knuth [11, Sect. 
6.2.1]); it has also been used by Whinihan [18], [9, Sect. 1.2.8] for giving the winning strategy of a 
game on a pile of tokens. 

Numeration systems of the form (6) and (7) can also be defined for rational a. An interesting 
relationship exists between n expressed in the system (7) and | na| and || in the system (6), 
where a! + B-! =1 [6]. This relationship is particularly interesting for the special case 
a =[1,da], where a is any positive integer. It can be utilized for giving a winning strategy for 
generalized Wythoff games both in normal play [4] and in misére play [5]. The class of cedar trees 
consolidates the winning strategies of these games. The case a = 1 gives a strategy for the classical 
Wythoff game [19], which is a two-player game played with two piles of tokens. Each player may 
remove any number of tokens from a single pile or the same number of tokens from both piles. In 
normal play, the player first unable to move is the loser, his opponent the winner. This outcome 
convention is reversed for misére play. 

An interesting class of numeration systems containing those considered in Theorem 3 is 
induced by expanding a in a not necessarily simple continued fraction: 
a=1+ Pi 

a, + 
a, +- 
a, + 4 


b; 


We shall not explore this family here any further, however. 
We consider next an example of an arbitrary length recurrence relation. 


Higher order Fibonacci systems. Fibonacci numbers of order m (m > 2) are defined by 
U_m+1 = U_ma42 = 00° = U_, = 0, u_, = uy = 1, 


u,=U, 1 +U, > + °°: +U,_m (n> 1). 
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(For simplicity we write u, instead of u‘”).) This definition gives the ordinary Fibonacci numbers 
for m = 2. 

It follows directly from Theorem 2 that any nonnegative integer N has precisely one binary 
representation of the form 


n 
N=) du, (0<d 
i=0 

such that there is no run of m consecutive 1’s. Because if there is no such run, then the condition 
of Theorem 2 is satisfied and the representation is therefore unique; if the representation of N 
contains a run of m consecutive 1’s, then the condition of Theorem 2 is violated. By Theorem 2, 
N has also a representation without a run of m consecutive 1’s, so N has two representations in 
this case. 

Since an m-order Fibonacci system exists for every m > 2, there are infinitely many binary 
systems as claimed at the beginning of the article. In a similar way we can in fact construct 
infinitely many b-ary systems for every integer b > 2. 

Fibonacci numbers of order m have been used by Knuth [11, Sect. 5.4.2] for polyphase merge 
of data runs stored on magnetic tape transports: we like to sort-merge data runs stored on m + 1 
magnetic tape transports such that at each stage m tapes merge into one tape and so that the 
tapes run continuously, to save time. This can be done if at each stage the number of runs on one 
of the tapes is an mth order Fibonacci number, and a simple function thereof on the other tapes. 
The same numeration system but with different values for u_,,41,U_m+25---»U_, has been 
applied by Lynch [13] to polyphase sorting. 


4. Another continued fraction system. Let a = [1,a,,a,a3,a,...], that is a,, =a (n>1), 
where a is any positive integer. Further, let wu, stand for either p, or qg,, the understanding being 
that in each formula involving u,, either all u; stand for p, or all stand for qg,. We shall develop 
two numeration systems based on the numerators and denominators of the even convergents of a. 
Let us start with two auxiliary results on the even convergents. 


0, ifu,=q;; 


Throughout we let ¢ = | (i> —-1). 


1, ifu, =p, 


LEMMA 2. If we define u_, to be 1 — ea (and uy = 1), then the even convergents of a satisfy 


Uon — (aa,,,-4 + 2) Uon—-2 —~ Uan-4 (n 2 1). 


Proof. For n > 1 we have, 


Uy, = AU, 1 + UgQy-2 = A(z, 1Uan—2 + Ur,—3) + Uyy 2 


n 
= G07, 1Urn-2 +(uUy,_9 — Uy,-4) + Uy, 


(aa, + 2) Urn—2 — Uo,-4- 80 


LEMMA 3. Let 0 < k < I. Then 
Uy )47 = A( Ag) 4 Uo, + Ay) {Maya + 07+ FAgg az Mrgs2 + Arp 4142x) 
+ Ug, + Ug, — Un, 2- 
Proof. By Lemma 2, 
Uyi42 = (4z)41 + 2) Ug) — Uy)» 
Uy, = (Gz,_, + 2) Ug). — Uyy~4 


Uo;-2 = (aay)_3 + 2) Us;_4 — Ug/-6 
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UrK+6 (da, 45 + 2) Uo~44 — Uop4+2 
Un. 4.4 = (AAg~ 43 + 2) Uap 4. — Udy 
Uo~K42 = (a5 444 + 2) Uo, — Urp-2:- 


Adding, we get the claimed result. @ 
We are now ready to present the last family of numeration systems considered here. 


THEOREM 4. Every nonnegative integer has precisely one representation of the form N = 
"9 d>,Uy;, where the digits d,,; are nonnegative integers satisfying 


(8) 0<d,, <aa,;,,+1 (i> 1), 0O<d)<a(a,+e) 


and the additional condition: If for some 0 <k <I<n, d,, and d,, attain their maximal values, 
then there exists j satisfying k < j < | (so actually | — k > 2) such that dy; < aay ,;,. 


Proof. Let n be any nonnegative integer. To show that N has a representation of the required 
form, apply the iterated division algorithm to produce the representation N = L"_)d,u;. This 
representation satisfies (1) and (2). By (2), 

Upi42 _ (aay, + 2) uy; — Ua;-> 


_ 2 | 
dy; < = 4A7;,,; + 2- (i > 0), 
Uy; Udi uy 


which implies the bounds (8). Suppose the additional condition does not hold. If k > 0, then 


[ [ 
» dy,Uz; > » dy,Uy; > A( Az)4 Uy, Fores + Ay 4 41U2,) 
i=0 i=k 


Fug, + Ugg = Ugi4g TF Ugg 2 > Ui 42> 


by Lemma 3. This violates (1). If A = 0, then 
/ 
Ye dy My, > A(Gy/41Ma) + +++ +azu, + aU) 
i=0 


+ U5, + EQU, = Ur/42 + u_5 +(ea —_ 1) Uo = Unj42. 


again violating (1). It follows that any representation satisfying (1) also satisfies (8) and the 
additional condition. 

Now suppose that the representation T = Lj_,)d>;u>, satisfies (8) and the additional condition. 
It is not hard to see that then 7 is maximal for / = t, that is, d,, = aa,,,, +1. Now d,, 
maximal implies d,; < aa,,,, for some k<j <1. Since u,;>u,;_, (i> 0), T does not 
decrease if we let d,, = aa,,,, for all k <j </, put k = 0 and decrease the maximal value of 
d,, = d, by 1. Thus 


T < A( Gy, 41M, + A, 1Uy, 2 + +++ +a;u, + au) 
+ Uy, + EQUy a 1= Uni42 + u_5 + (ea —_ 1) up —_ 1 = Uy142 —_ 1 (t > 0) 
by Lemma 3. Thus condition (1) is satisfied and so the result follows by Theorem 1. @ 


In the “Fibonacci case”, that is the special case where a = [1], the p,-system of Theorem 4 
becomes a rather curious ternary numeration system, since (8) now implies 0 < d,,< 2 (i> 0). 
In this case the condition of Theorem 4 states that between any two digits 2 there must be a digit 
0. This special case has been used by Chung and Graham [1] to investigate irregularities of 
distribution of sequences. It and several variations can also be used for efficient data compression 
of sparse binary vectors, where the lengths of the 0-runs are expressed in the selected numeration 
system, and the basis elements are Huffman-coded. (For Huffman coding see [9, Sect. 2.3.4.5.].) 
Numeration systems based on Fibonacci systems of order m and other variations can also be used 
to design error-insensitive codes for data transmission. The details of these developments will be 
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elaborated on elsewhere. 

Recapping the main results of this article, note that the simple-minded iterated division 
algorithm is the tool which produces the representation for any nonnegative integer N in any 
system S = {up,U,,Uz,...: 1 = Uy < uy < uy < ++: } in every one of the four theorems and the 
other applications considered here. This representation satisfies (1), and there is only one 
representation satisfying (1). The various systems considered here differ only in the definition of 
S, precipitating for each case a condition of its own equivalent to (1). This condition is very simple 
for systems in which the recurrence for the u, consists of one term only, but may become rather 
strange-looking for higher order recurrences. 

The main upshot of the applications is that—analogously to the case of data structures—the 
proper choice of a system of numeration may lead to considerably more efficient algorithms. 

We finally remark that for representing a negative integer N in any of the above numeration 
systems, represent |N| and then reverse the signs of all the digits. Note that for representing any 
integer in this way, its digits are either all nonnegative or all nonpositive. For representations with 
both positive and negative digits, see Matula [14], who considered only the case where the basis 
elements are powers of some integer b, however. Are there any interesting numeration systems 
using both positive and negative digits, with more general basis elements than b'—such as those 
considered above? 
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AN ELEMENTARY PROOF OF THE CAUCHY-KOWALEVSKY THEOREM 


WOLFGANG WALTER 
Universitat Karlsruhe, Mathematisches Institut I, Kaiserstr. 12, D-7500 Karlsruhe, West Germany 


Dedicated to Professor Johannes Weissinger on his seventieth birthday * 


0. Introduction. It is the main aim of this article to present an elementary proof of the 
Cauchy-Kowalevsky (= C — K) theorem based on the contraction principle. The theorem deals 
with a nonlinear partial differential equation 


(1) DP u = F(t, x15... %,5U,..-, D'D2u,...) (k+ lal<p;k<p) 


b] n? 


or with a system of such equations, where D, = 0/dt, D& = d!*!/dxM --- dx%, and p > 1. It 
states that if for t= 0 initial data D‘u =o, (0 < k < p) are prescribed, where the ¢, are 
analytic functions of x = (x,,...,x,,) in a neighborhood of x) = (x?,...,x®), and if the function 
F is analytic in a neighborhood of the point (0,x ,..., Df%,(0,x,),...), then there exists a 
unique solution u(t, x) in a neighborhood of (0, x,), analytic in ¢ and x and satisfying the initial 
conditions. The problem is called Cauchy’s problem, the initial conditions Cauchy data. It was 
first solved by Cauchy [2], and, in a more general and simplified way, by Sophie v. Kowalevsky 
[9]. It represents the analogue for partial differential equations of the basic initial value problem 
for an ordinary differential equation u‘”)(t) = f(t,u,u’,...,u’~») with given initial data 
u(0),..., u“?~ (0). Obviously, this analogy motivated Cauchy to consider the problem. 

As a preliminary step of the proof found, e.g., in [3; p. 39-56], the equation (or system) is first 
converted into a more convenient quasilinear first order system 


(2) u,= ), B(t,x,u) u, + c(t, x,u). 
j=l 


In the vector notation used here, x = (x,,...,%,), U= (u,...,U,,), the B; are m by m matrices, 
and u, c are 1 by m matrices, i.e., column vectors. Some textbooks, e.g. [10] [18], consider only 
the linear case 


(3) u,=A(t,x)ut Y B(t.x)uy, + (1,2), 


which is much easier to handle. In both cases the initial condition is given by u(0, x) = (x). 
We show in the important special case of a nonlinear second order problem (with n = 1, for 
simplicity) 


(1’) Uy = f(t,%, Uy Up, Uys Ux Uxx)s u(0, x) = $y ( x), u,(0, x) = o,(x), 


Wolfgang Walter: I was born in Schwabisch Gmiind/Germany. Since my early age, mathematics and music have 
been my main interests. Keeping the latter field for leisure, I studied mathematics and physics at the Universitat 
Tubingen and received a Doctorate in 1956 under E. Kamke. In 1958 I followed an invitation by A. Weinstein and 
spent a year at the University of Maryland. Since 1963 I am Professor of Mathematics at the Universitat Karlsruhe, 
spending a considerable part of my professional life at various American universities. My main interest is in ordinary 
and partial differential equations with a special emphasis in differential inequalities. I have written the monograph 
Differential and Integral Inequalities (Ergebnisse der Mathematik, Vol. 55) and several mathematical textbooks. The 
main ideas of the present article emerged in 1982/83 while I was a Visiting Professor at the University of Florida 
and the Georgia Institute of Technology: 


*As early as 1952 Professor Weissinger suggested to use contraction proofs in the classroom. His article Zur 
Theorie und Anwendung des Iterationsverfahrens (Math. Nachr., 8, 193—212, 1952) contains the generalized contrac- 
tion principle and numerous applications from different branches of pure and applied analysis. 
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how this transformation is accomplished. Setting u, = v and u, = w, a first order problem 


u, =U, u(0, x) = $o(x), 
(1) v, = f(t, x, u,v,W,v,,W,), v(0,x) = $,(x), 
WwW, = Uys w(0, x) = $(x), 


is obtained. It is easily seen that u solves (1’) if and only if (uv, v, w) solves (1”) (passing from the 
latter equation to the former one requires the equation w = u,, which follows from u,, = w, by 
integration in the ¢ direction). Now, (1”) or, more generally, any first order system of the form 


u, = g(t, x,u,u,) with wu = (u,,...,u,,), Z = (213--+5 Bm) 


transforms by differentiation with respect to x into a quasilinear system for (u,v) = (u, u,) 


(2’) i = g(t, x,u,v), 


U, = 8, + 8,0 + 8yV,5 


the argument being (t, x, u,v) in all three terms. The reasoning above is easily adapted to the case 
where x = (X,...,X,,) and/or u = (u,,...,u,,) in (1’). 

In the classical method of proof, used by Cauchy [2] and Sophie v. Kowalevsky! [9] and 
presented in many textbooks, the solution is sought as a power series in ¢ and x, whose 
coefficients are determined by recursion formulas using the corresponding expansions of B, and c. 
The centerpiece of the approach is the convergence proof for this series by the method of 
majorants. It consists of considering an auxiliary problem of the same structure (with the same 
recursion formulas) which majorizes the given problem in the sense that all coefficients involved 
are positive and larger than the absolute values of those in the given problem. Cauchy invented 
and applied the method in many instances. He called it, not very suggestively, calcul des limites. 

A new proof based on entirely different ideas was discovered in 1941 by M. Nagumo [11]. 
Nagumo considers the quasilinear system (2) with zero initial values (no loss of generality) and 
transforms it into an equivalent equation for v = u, 


n 
v(t,z)= > B(t,z,1v)Iv, + c(t,z, Iv), 
j=l 


t ° . . . 
where ([v)(t, Z) = i u(1t,z) dt (= u). This equation is treated as an operator equation v = O(v) 
0 


in an appropriate Banach space E, and a solution is obtained as a fixed point of ®, using the 
Schauder fixed point theorem. An essential tool is a lemma, reproduced with proof in Section 2, 
which provides an estimate of the derivative of a holomorphic function in terms of the function 
itself (in [11] it is given for the case where Q is a ball |z| < R). This lemma has proved to be of 
importance in many modern developments of the Cauchy problem and is called here Nagumo’s 
lemma. As a consequence, the variables x; are now complex by necessity, and for this reason we 
have written z instead of x. On the other hand, the variable ¢ is assumed to be real, and the 
solution is shown to exist in a conical region 0 < ¢ < /, |z| < R — Lt. The functions B; and c are 
assumed to be continuous in all variables and holomorphic in z and u, and v is continuous in 
(t,z) and holomorphic in z. Strictly speaking, Nagumo’s theorem does not cover the C-K 
theorem. But its proof is also valid in the case where ft is a complex variable and the functions 
involved are holomorphic in all variables, a fact which Nagumo does not mention. In short, 
Nagumo obtained a new proof and, by allowing ¢ to be a real variable, an important generaliza- 
tion of the C-K theorem. But this approach has also a drawback: uniqueness does not follow from 
the Schauder fixed point theorem and has to be proved separately. Another point of comparison 


‘Different ways of spelling the name have appeared in the more recent literature (Kowalewska, 
Kowalewskaya,...). The author, who has neither the historic nor the linguistic expertise to make a reasonable 
choice, has been led by the doctoral dissertation [9] which was written by “Frau Sophie von Kowalevsky’’. 


1985] AN ELEMENTARY PROOF OF THE CAUCHY-KOWALEVSKY THEOREM 117 


regards the “real’’ version of the C-K theorem, which states that the solution is real-valued for real 
values of the variables if the coefficients in the differential equation have this property. Nagumo 
does not prove that fact, yet it is easily obtained in the framework of his proof. If one restricts 
either the whole Banach space or the convex, compact subset which is mapped into itself by ®, to 
functions which are real-valued for real z,, then his proof gives the desired result. Recently, Keller 
and Schneider [8] published a proof of the classical C-K theorem by reduction to Schauder’s fixed 
point theorem, which bears a likeness to Nagumo’s proof (the latter is not cited). 

Another approach which gives rise to various generalized versions of the C-K theorem was 
developed in the sixties. The basic notion is that of a scale of Banach spaces, that is a collection 
(B,), > Of Banach spaces with the property that 0 < p < o implies B, C B, and |lul|, < ||u||, for 
u © B,. One considers bounded linear operators L from B, into B,, ||Lull, < ||L\|p\lul||, for any 
0 < p < a, which satisfy an estimate 

C 

o—p. 
These notions are already used in Vol. III of Gelfand and Silov’s treatise [4], A. 2.1-2.3, in 
connection with the linear Cauchy problem for constant coefficients and analytic initial values. In 
1965, Ovsjannikov [14] proved an existence and uniqueness theorem, often quoted as Ousjannikov’s 
theorem, for the Cauchy problem 


du _ 

dt 
in this abstract framework, where (L,) is a family of linear operators with the properties stated 
above for L. Further research along these lines by Treves [20] [21], Nirenberg [12], Ovsjannikov 
[15] [16], Nishida [13], Pate [17] and others (see also the bibliography of these articles for further 
reading) led to linear and nonlinear “abstract” Cauchy-Kowalevsky theorems. The connection 
with the classical theorem is established by the example 


LIl, < 


L,u, u(0) = uy € B, 


B,= {u: u(z) is holomorphic for max |z,| < p, ||u||, = sup |u(z)| < co} 
IzZi<p 
and the operators L = 0/dz,: B, > B, which satisfy ||d/dz,||} < 1/(o — p) for0 < p<o. An 
easily accessible exposition of the linear case is given in Chapter 17 of Treves’ book [22]. This 
theory shares with Nagumo’s proof the property that ¢ can be a real or complex variable. 

The proof presented in this paper is elementary inasmuch as it uses Banach’s contraction 
principle as the only tool from functional analysis. Basic properties of holomorphic functions u(¢) 
depending on a single complex variable § such as Cauchy’s integral formula are used. As far as 
holomorphic functions of several complex variables are concerned, it is sufficient to know the 
definition (a function f: G — C, where G is an open subset of C”, is holomorphic in G, if f and 
the derivatives df/dz, (i =1,...,) are continuous in G) and the fact that f(u,(§),..., u,(¢)) is 
holomorphic in §, if f and the u; are holomorphic. Power series expansions, which can be used to 
give an equivalent definition of holomorphy, are not used. The exposition is written with an eye to 
classroom use. For this reason, the linear case is treated first, where the proof is remarkably simple 
and short. Apart from didactic advantages, the proof has an important mathematical conse- 
quence: it gives not only existence and uniqueness, but also estimates between the solution and an 
approximate solution, in particular, continuous dependence on the initial values and the right 
hand side of the differential equation. In short: the proof by contraction shows that the analytic 
Cauchy problem is a well-posed (or correctly set) problem. 

At first glance this statement seems to contradict well-known facts about second order partial 
differential equations of mathematical physics. In explaining the situation, we consider the 
simplest prototypes, (a) the hyperbolic (vibrating string) equation u,, = u,,, (b) the elliptic 
(potential) equation u,, + u,, = 0, and (c) the parabolic (heat) equation u,=u,,. In the 
hyperbolic case, the problem models a simple physical situation. It is correctly set, and it has a 
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solution not only for analytic, but also for much more general (e.g., continuous) initial values. In 
contrast, the classical, well-posed elliptic problems are not Cauchy problems, but boundary value 
problems where only one function (e.g., u) is prescribed on the whole boundary. Consider, for 
example, problem (b’) with initial values u(0, x) = 0, u,(0, x) = $,(x). If a solution exists, it is a 
harmonic and hence (real) analytic function for t > 0. Furthermore, the solution allows an 
analytic continuation for t < 0 given by u(—t, x) = —u(t, x). Consequently, ¢, is analytic. Thus 
problem (b’), while possessing a solution for analytic $,, by virtue of the C-K theorem, never has 
a solution for non-analytic ¢,. A more detailed account is given in Hadamard’s classical Lectures 
on Cauchy’s problem [5, Chapter II]. Here one finds also the famous example (p. 33): 


u,+u,,=0, u(0,x)=0, u,(0,x) = asin kx, 


with the solution u(t, x) = (a/k) (sin kx)(sinh kt). If a > 0 is small, then |u(0, x)| < a is small, 
but for any fixed ¢ # 0 the solution can be made arbitrarily large by choosing k large. Hence the 
zero solution (a = 0) does not depend continuously on the initial values. These considerations 
give the impression that the applicability of the C-K theorem to elliptic problems is a strange 
curiosity: the solution exists, but it behaves oddly. 

But the odd behavior appears only as long as we consider analytic functions for real variables 
(one might add that there are other cases of odd behavior in this setting). If we put holomorphic 
functions in their natural habitat, which is C or C”, the oddities vanish. Theorem 2 will make this 
point more precise. It 1s followed by a discussion of Hadamard’s example. 

The heat equation of problem (c) is not of the type of equation (1); in this equation we have 
p = 1, but |a| = 2 in violation of the condition |a| < p. The C-K theorem becomes false for the 
heat equation. The following counterexample is already found in [9, p. 22]. The problem u, = u,.,, 
u(0,x) = 1/(1 — x) has a unique formal power series solution 


< (2i)!t = 6(2i)! 
u(t,x) = )) ———— = YO eo’, 
iso M1 —x)"** geo FIA! 


but the series diverges for any t ¥ 0. 

In this connection we remark that well-posedness, while being a standard subject with regard to 
initial and boundary value problems for elliptic, hyperbolic,... partial differential equations, is 
only rarely discussed in the Cauchy problem with analytic data. Also, the notion of well-posedness 
is often defined in a narrow sense, meaning continuous dependence on initial values only, while 
the physical meaning of this notion requires that continuous dependence on the right hand side of 
the differential equation be included. 

Let us finally note that there is a more general Cauchy problem for equation (1), where Cauchy 
data (i.e., initial values) are given on a noncharacteristic hypersurface Q defined by an equation 
(ft, 21,-..,Z,) = 0. This case can be reduced—at least locally—to the case treated here where Q 
is lying in the hyperplane t = 0. The notion of noncharacteristic surface and the reduction 
procedure are described in many textbooks, e.g., in [7; Chapter 3] and [18; § 3]. 

The author has looked for a proof of the C-K theorem by contraction for some time. His 
conviction that such a proof must exist was based on Bessaga’s noteworthy theorem [1]: 


If M is a set, a © (0,1) and T: M > Ma map with the property that T" has one and only one 
fixed point for n =1,2,..., then there exists a metric d on M which makes-M a complete metric 
space and for which d(Tx, Ty) < ad(x, y). 


The Cauchy problem in question can be written as a fixed point equation u = Tu (see (9) for 
the linear case and (18) for the nonlinear case), and the C-K theorem states that T has a unique 
fixed point. Moreover, it is not difficult to show that T” also has a unique fixed point for any 
n > 1. Bessaga’s theorem now implies that T is a contraction in a suitable metric space. The 
problem was to find the metric. 


1. Geometrical preliminaries. Since we want to construct a solution in the large, some 
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geometrical considerations are necessary. Let Q be an open set in C” with a nonempty boundary 
I = dQ, and let d(z) = dist(z,T) be the distance from z € Q to I, measured in the maximum 
norm |z| = max;_, __,,|Z,|. The set G, a subset of R xX C” (real case) or €”*' (complex case), 
consists of all points (¢,z) with z € Q and |t| < nd(z), where ny > 0 will be specified below. Let 
Q, be the set of all z © Q such that (7, z) © G or, equivalently, d(z) > |t|/n. It is easily seen that 


(4) d(1,2) = d(z) ~ MI >0 (zEQ,) 


is the distance from z to the boundary IT, = 0Q,. In geometrical terms, G is the double cone with 
base Q and slope y, and Q, is the base of that part of G which lies above t (t > 0) or below tf 
(t < 0). This description applies to the real case. The following property of d(t, z) will be needed 
later: 


(5) zE€Q,,|2-—2'|=r<d(t,z) =z’ €Q, and d(t,z’) > d(t,z) -r. 


This follows readily from the triangle inequality d(z) < |z — z’| + d(z’). 

If one is interested only in a local solution, it suffices to consider the case where {2 equals 
Br(Zo), the open ball in C” with center at z) and radius R. In this case d(z) = R — |z — Zol, 
Q, = B.(Z)) with r = R — |t|/n and d(t,z) = R — |z — Z| — |t|/n. 


2. The linear case. The linear C-K theorem deals with the following initial value problem 


(6) u,=A(t,z)ut YS B(t,2)u, + (tz) in G, 
j=l 
(7) u(0,z) = $(z) inQ, 
which can be stated as an equivalent integral equation of Volterra type 
(8) u(t,z) =g(t,z) + [ A(7,z)u(7t,z) + Y Br), (72) dt, 
j= 
where 


g(t,z) =$(z) + ['c(r,2) dr. 


Here, u = (u,,...,u,,) has values in C”, A and B; are complex-valued m by m matrices, and c 
and f are complex-valued m-vectors. In our matrix notation, u, u, = du/dt, u, = du/dz;, > 
and c are considered column vectors. A solution of (8) is, by definition, continuous in G and 
holomorphic in z for fixed ¢ (real case) or holomorphic in ¢ and z (complex case). In the complex 
case it is understood that integration is performed along the straight line from 0 to ¢. In both 
cases, a solution of (8) is, under proper assumptions regarding A,...,, of class C in G (complex 
derivatives with respect to the complex variables) and a solution of the initial value problem (6) 
(7), and vice versa. 

To avoid ambiguity, we use the maximum norm |u| = max|u,| (1 <j < m) and the corre- 
sponding operator norm for matrices A = (a,,), |A| = max, ,|a,,|, but any other pair of norms 
with the property |Au| < |A||u| would do as well. 

The following lemma, which goes back to Nagumo [11], gives a bound for df/dz, in terms of f 
and is essential for the proof. 


NAGUMO’S LEMMA. Let f: 2 — C”™ be holomorphic and p > 0. Then 


If(Z)I< Pa) If, (2) < AGG)’ 


P 
where C, = (1 + p(t + *] <e(pt+1),G=1. 
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Proof. For a function y of a single complex variable §, which is holomorphic in the circle 
\f — §’| <r, the estimate 


/ 1 / 
W(S)1< = max y(¢’)| 
boar 
holds. It follows immediately from the Cauchy integral representation formula 
: / 2 / 
Wd) = fu(ey/fani(g’ — 8)" a’, 


in which the integral extends over the circle |{ — §’| = r. Applying this result with ¢ = z,, we 
obtain the inequalities 


1 c 1 c 
'(z)| <= max z’)| < —~max——~ < ————__ 
(2) < pmax 1/2] <fmaxr <a 
Here, 0 < r< d=d(z) and d(z’) > d— r because of (2). The choice r = d/( p + 1) gives the 
(optimal) value stated in the lemma. O 


To be specific, the following theorem is formulated for the real case. 


THEOREM 1. Assume that 

(1) the functions A(t,z), B:(t,z), c(t, 2) are continuous in G and holomorphic in z for fixed t, the 
function f(z) is holomorphic in z; 

(ii) there exist positive constants a, B;, y, 8 and p such that 


|A(t,z)| < dz) IB(t,z)1<B, 
Y 5 
M(bIS Gaga? WIS Gacy mG 


(ii) a/p + (1+ (1/p))? “YB, < 1/ (this can always be achieved by diminishing 1, if neces- 


sary ). 
Then Equation (8) has a unique solution u in G, and it satisfies |u(t,z)| < C/d?(t,z) in G. 


Proof. Let E be the Banach space of all functions u € C°(G,C”), which are holomorphic in 
z and have a finite norm 


l|u||: = sup|u(t,z)|d?(t,z). 
G 


Note that convergence in the norm implies uniform convergence in compact subsets of G; hence 
the limit is holomorphic in z and E is complete. We write Equation (8) in the form 


u=g+ Tu, 


where T is a linear operator given by 

(9) (Tu)(t,z) = []AG.)u2 + 1 B(1,z) u,(7,z)| dt. 
j 

It follows easily from the inequality 


(10) 


f' dt f" ds n 

_— = < —$$ 
0 d?**(1,z) 0 (d(z)—s/n)?™’ pa?(t,z) 
that g © E. According to the definition of the norm, we have 


(11) lu(1,z)| < Pina’ 
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Nagumo’s lemma, applied to the region 2, with distance function d(t, z) instead of Q and d(z), 
gives the esimtate 


II 
12 u,(t,z)| << C—— 

It follows from the assumption (ii) together with (11) and (12) that 

||| 


| II 
———— ,|Bu,|< C 


aul < ~ da? (t,2) 7?’ 


hence, with B = XB,, 


(Tu) (t,z)| < [lula + BG) 


f' dt 

0 d?**(7,z) 
ll 

d?(t,z)- 


In the last step, inequality (10) was used. Going back to the definition of the norm, we arrive at 
the final estimate 


< (a + BC,)n 


2 oe 


(13) [| Ful] < q|lz|], where q = 


pt+l 
“+ p(1 +=] Jn<1. 


according to assertion (iii). Hence the contraction principle applies to the equation u = g + Tu, 
and it follows that this equation has exactly one solution in E. This proves the theorem except for 
the possibility (which will be excluded below) that there are other solutions which do not belong 


to E. oO 
REMARK 1. It is seen from (iii) that the constant », which describes the region of existence, of 

P 
the solution, depends only on B = L’_,8;. Indeed, for p large, a/p is small and | 1 + — is 


close to e (note that a and B do not depend on p). Hence (iii) is satisfied (for large p) if 
n < 1/(eB). Very simple examples show that this result is, except for the factor e, optimal. 
Let u, = bu, and u(0,z) = o(z) (n =1, 5 constant). Then the solution is given by u(t, z) = 
(bt + z). If @ is holomorphic, say, in the circle |z| < 1, then the solution exists for |z + bt| < 1, 
and this is best possible when ¢ cannot be continued analytically beyond the unit circle. If we 
vary b, keeping |b| = B fixed, then the largest region common to all those regions of existence is 
the circular cone B|¢| < 1 — |z|,1e., 7 = 1/8 is best possible. 


REMARK 2. In order to understand the nature of the assumptions (ii), let us assume that n = 1 
and that z, is an isolated boundary point of Q,ie., that z € Q and d(z) = |z — z,| for z near Zp. 
Then A is allowed to have a pole of the first order, while f and c may have a pole of any order at 
Zy. Now let @ be unbounded (this case is usually not considered in the context of the C-K 
theorem). Then d(t, z) — 00 (perhaps) as |z| — 00, which implies that A, c and $ must vanish at 
infinity. If, on the other hand, assumption (ii) is satisfied, then the region of existence G extends 
to infinity in the ¢ direction. By a slight change in the proof one can handle the case where A, c 
and ¢ do not vanish at infinity at the price of a smaller region of existence which is bounded in ¢. 
The change concerns the definition of d(z). Instead of the distance to I. we may take 
d(z) = min(dist(z,I°),d)), where dy is a positive constant, or more generally, any function with 
the properties 


(14) 0<d(z) <dist(z,T) and |d(z) —d(z’)|<|z-z’'| forz,z’€Q. 


If G, Q, and d(t, z) are defined as before, using the new function d(z), then property (5) and its 
one-line proof remain valid, and 0 < d(t,z) < dist(z,T,) for z € Q,. As a consequence, the 
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lemma remains true, and the proof of the theorem goes through. If we choose a bounded function 
d(z), then the inequalities in assumption (ii) can be satisfied as long as A, c and f remain 
bounded for |z| > oo. 


REMARK 3. In the complex case it is assumed that the functions A, B; and c are holomorphic 
in t and z. Otherwise the assumptions are unchanged. Theorem 1 remains true, and the proof goes 
through with the obvious change that E is now the Banach space of all functions u which are 
holomorphic in ¢ and z and have a finite norm. Note that (10) remains true as it stands. 


REMARK 4. In order to exclude the possibility that there are other solutions not belonging to E, 
let us assume that u* is a solution to (6) (7) in some open domain D C G. Let (0, z,) € D, let 
Q* = B.(z,) be a ball in C” with small radius r, and let G* be the corresponding set in R xX C 
defined by the inequalities |t| < nd*(z), where now d*(z) = r — |z — 2Z,|. Applying our theorem 
to the set Q* we see that there is exactly one solution in G* belonging to the corresponding 
Banach space E*. But since our original solution u as well as the solution u* (strictly speaking, 
their restrictions to G*) belong to E*, we arrive at u = u* in G*. This argument, together with 
the fact that solutions are holomorphic in z, leads to u = u* at least in a small strip, say, for 
(t,z) € D and |t| <a. But now the reasoning can be repeated with initial values given for 
t = +a instead of t = 0. In this way one obtains u = u* in D. In the complex case the last step 
is not necessary, since u = u* in the small implies equality in the large. 


REMARK 5. Assume that the functions in Theorem 1 have the property that their components 
are real-valued for real values of z; and ¢. Then the solution wu has the same property. This 
well-known fact follows immediately from our proof. Indeed, the function g(t,z) = $(z) + 
{oc(s, z) ds has the property in question. If v has the property, then also Tv. Hence, all members 
of the sequence (u,) defined by up = g, u,,, = g + Tu, (k = 0,1,...) have the property, and 
so does the solution vu = limy,. 


The fixed point of a contractive operator S is not only unique; it also depends continuously on 
S. Our next theorem deals with this important consequence of the contraction principle. 


THEOREM 2. Under the assumptions of Theorem 1, the solution u of the Cauchy problem (6) (7) 
depends continuously on the given functions A, B,, c and f: For any given e > 0, there exists 6 > 0 
such that the solution u* of a corresponding problem (6) (7) with A, B,, c and $ replaced by 
corresponding starred quantities which satisfy the inequalities 


A(t,2) — AM(t2)1< 


(1,2) 


6 “eye 
P45)" lo(z) — (Zz) I< P(a)’ 


belongs to the e-neighborhood of u (in E), 


|IB(t,z) — B*(t,z)| <6, 


|c(t,z) — c*(t,z)|< 


— & 
d?(t,z)_ 
Proof. The assertion follows easily from a standard estimate for contractive mappings. Let S 
be any contraction, ||Su — Sv|| < q||u — v||, and let uw be the fixed point of §. Then 
||4 — vo] < [JS — Sol] + ||So — o]] < g|lu — ol] + [So — of, 


lju(t,z) —u*(t,z)|< 


which implies 

1 
l-q 
As in the proof of Theorem 1, we write u = g + Tu and analogously u* = g* + T*u*. Then 
(15) reads 


(15) ||“ — oll < lv — Soll. 
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1 
1-q 
It follows easily from our assumptions and (10) that ||zg — g*|| < 6C, C = 1 + 7/p. The estimate 
(13), applied to T — T* (which means a = B = 8) shows that ||T — T*|| < 6C,. It also follows 
from (13) that for 0 < 6 < 65, where 6, is small, ||T*|| < g* < 1 with g* independent of 6. 
Hence u* is bounded independent of 6 (take, for example, (15) with u, v, q and S replaced by 
u*, 0, q* and g* + T*). Summing up, we have obtained an estimate ||u — u*|| < 8C, with C, 
independent of 6, which proves the theorem. oO 


(16) Il“ — u*|| < 


{Ilg— g*l+ I(T — T*) u* |}. 


REMARK. We come back to Hadamard’s example (a = 1) 
u, + Uu,, = 9, u(0,x) =0, u,(0,x) = sin kx, 
or, in the form of a first order system (v = u,,w = u,), 
u, =U, Uv, = —W,, 


u(0,x) = w(0,x) = 0, v(0,x) = sin kx. 


We replace x by z, choose the ball |z| < p for Q, and consider this example with regard to 
Theorems 1 and 2. It is easily seen that in Theorem 1 we have |A| = 1, |B| = 1, a = B = 1; hence 
G is defined by |t| < n(p — |z[) with n < 1/e. The solution is given by 


u(t,z) = : - sin kz - sinh kt. 


The maximum of |sin kz| or |cos kz| for |z| < o is approximately }e*°, the maximum of |sinh kt| 
or |cosh kt| for |t| <7 is approximately }e*", and in G we have |t| + |z| < p, which implies 
|¢| + |z| < p. Hence for k — oo the solution (u, u,, u,) does not grow faster in G than on (. 


3. The quasilinear case. As pointed out in the introduction, the case of a general nonlinear 
system reduces to that of a quasilinear system of the first order 


n 
(17) u= >» B(t,z,u)u, + c(t,z,u). 
j=1 

As before, the B, are m by m matrices, and c is a column vector of order m. This system is dealt 
with in our next theorem. As in the linear case, we describe a region of ascertained existence for 
the solution in terms of the given functions. But it is not our aim to give an optimal region, which 
would be somewhat cumbersome. The domain G = G, is defined as before by the inquality 
|t| < nd(z), and Bp is the open ball |z| < R in C”. In order to avoid complications as described 
in Remark 2, it is assumed from the beginning that the “distance function” d(z) is bounded and, 
of course, satisfies (14) (for example, 2 bounded). If the initial condition is given by u(0, z) = $(z) 
in {2, we write the solution in the form u = ¢ + v. This gives a new differential equation for v, 
which is of the same type as (17), and reduces the initial values to zero. We shall therefore restrict 
consideration to the latter case. 


THEOREM 3. Assume that the functions B,(t,z,u) and c(t,Zz,u) are continuous in G, X Br and 
holomorphic with respect to z and u (real case), or holomorphic in G, X Br with respect to all 
variables (complex case), and that the following estimates hold there: 


Y 
d(t,z) 
|B < B,, ya(t,z)|B(t,z,u) — B(t,z,v)|< Blu — | 
(j=1,...,n). If » > 0 is such that 
2nldo(Bty)<R,  n(3V3(B+y7) +28) <1 and nef’ <1 


lc| < , d(t,z)|c(t,z,u) — c(t,z,v)| < y’'Ju— vo], 
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with dy = supd(z) < 0, B =LB and B’ = LB’, then the quasilinear system (17) has a unique 
solution u satisfying the initial condition u(O,z) = 0 for z € Q and existing in G,. 


Proof. As in the linear case we consider the corresponding integral equation 
(18) w(t2) = [] EB (zu (n2) + er 2.0) a 
j 


and treat it as a fixed point equation u = Su to which the contraction principle will be applied. 
The underlying Banach space E is the same as in the proof of Theorem 1, and the nonlinear 
operator S is given by the right hand side of (18). In contrast to the linear case the operator S is 
not defined on all of E. In the first part of the proof we shall describe a proper closed subset F of 
E which is mapped into itself by S. We define F as the set of all functions uv € E which satisfy 
the inequalities 


tt 
/d(t,z) 
where p = 2n/dy (B + y)< R. Let u © F and v = Su. Then 

Bt y 
jd(t,z) 


ju(t,z)| <p and |u,,(1,z)|< 


lu.(t,2)|< LIBllu.| + lel < 
J 


Using the estimate 
[¢| 


< 2n/dy, 
0 


I| ds _ 7) 3 
I aa = —2n \/d(z) 7 


integration of v, with respect to ¢ yields 
lv(t,z)| <(B + y)2n/d, =p. 


Now we estimate the derivatives of v, using the notation c(u) = c(t, z, u(t, z)) and similarly for 
B,. Nagumo’s lemma implies 


yC C 
ew <1, ls 
and 
a B; 
92, PAM) | SG 


where d= d(t,z) and C=C,, = 373 /2. Using the product rule of differentiation, we obtain 


<a BC + B + 10) 


IY, 2,1 < 


and, after integration with respect to t (see (10)) 

27 
BC+Bt+yC)< 

(a 


The estimations above show that u © F implies v = Su &€ F. 


[v,,,| < 


— 
vA 


In the second part of the proof the difference Su — Sv will be estimated for u,v © F. In an 
abbreviated notation, we get 


(Su — Sv), = L(B(4) — B(v))u, + UB(v)(u, — v,,) + e(u) — e(v) 


j 
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u—U ’ 
\( Su — Sv),| < > 4 + LB luz, _ v, | + a — vi. 
J J 


Since |u — v| < ||u — vl//d? and |u, — v, | < Gllu - v||/d?*', the estimates 


u—vD , , 
(Su — Sv),|< Mu = 0] pel l(p +BG+y ) 
and, after integration, 
u—U , , 
|Su — Sv| < lu ~ ohh 7 (8 + C,B +7’) 


are obtained. We multiply by d?, take the supremum with respect to G,, and arrive at the final 
estimate 


\\Su — Sv|| < 7 AB + CB +y’)Ilu— oll 


This inequality shows that (for large p) S is a contraction and hence concludes the proof. O 


REMARK 1. Remark 4 of Section 2 carries over to the quasilinear case. The possibility that 
there are other solutions to the Cauchy problem not belonging to E is carried ad absurdum by 
considering subregions of G,. 


REMARK 2. In a similar way, Remark 5 carries over. If B; and c are real-valued for real values 
of t, z and uw, then the solution uw is real-valued for real t and z. This is shown by successive 
approximation u,,, = Su,, starting with uy = 0. 


REMARK 3. As in the linear case, the solution depends continuously on the initial value and on 
the nght hand side of the differential equation. The proof is based on the estimate (15) and 
follows the same lines as before. 


REMARK 4. The subset F of E constructed in the first part of the proof is easily seen to be 
compact. Hence the existence of a solution follows already from the first part of the proof, if 
Schauder’s fixed point theorem is called upon. This proof would have some similarity with those 
in [11] and [8]. 


Let us note finally that the method used in this article is not confined to the classical C-K 
theorem. It can easily be adapted to abstract versions and, more generally, to functional 
differential equations of the Cauchy-Kowalevsky type, for example to delay-differential equations 
with delay in the ¢-direction (real case). In the case where the set 2 is the whole space C”, one 
may consider the space of entire holomorphic functions with finite norm 


fll = suplf(z)je"" — (z € C",a > 0). 


In this space partial differentiation is a bounded operation, ||df/0z;|| < ae||f||. Using this fact, we 
find that the Cauchy problem can be treated in a very simple way; see Redheffer and Walter [19] 
for more details. 
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THE GAMMA FUNCTION AND THE HURWITZ ZETA-FUNCTION 


BRUCE C. BERNDT* 
Department of Mathematics, University of Illinois, Urbana, IL 61801 


The gamma function I(x) may be defined by 


= lim n\(n+1)* 
(1) P(x) = Tim x(x +1)---(x4+n)’ 


where x is any complex number, while the Hurwitz zeta-function ¢{(s, x) is defined by 


(2) (sa) =D (kx) 


*Research partially supported by National Science Foundation Grant No. MCS 7903359. 
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where o = Res > 1 and x is any complex number. (Normally, it is assumed that 0 < x < 1 in 
the definition of §(s, x).) Observe that ((s,1) = §(s), where §(s) denotes the Riemann zeta-func- 
tion. The function {(s, x) can be analytically continued into the entire complex s-plane and is 
holomorphic except for a simple pole at s = 1. 

In 1894, Lerch [9, p. 13] established the following beautiful formula relating I(x) and §(s, x). 


THEOREM. If the prime ’ denotes differentiation with respect to s, then 


(3) Log I(x) = (0, x) — §’(0). 

This formula is not particularly well known. However, it has achieved some prominence in 
recent research. If L(s, x) denotes the Dirichlet L-function associated with the character x, then 
there is a classical formula for L’(0, x) that depends upon (3). Analogues of this formula have 
been established for other L-functions, e.g., p-adic L-functions, and analogues of (3) naturally 
arise. (See, e.g., [2], [3], [10], and [11].) 

The main purpose of this note is to present a new, short proof of (3) that is more elementary 
than previously known proofs [9], [13, pp. 59-60], [14, p. 271]. A second purpose is to show how 
(3) along with other properties of the Hurwitz zeta-function can be used to give short proofs of 
Gauss’s multiplication theorem, Kummer’s formula for LogI'(x), and the reflection theorem. 
These proofs are new, but we do not claim that they are better or shorter than other proofs; we 
primarily wish to emphasize the little noticed connection between the two classical functions in 
the title of this paper. 


Proof of the Theorem. For o >1 and x > 0, a straightforward application of the Euler- 
Maclaurin summation formula [12, p. 13] yields 


t—[r] i 
l-s —s co’ ~ * 
(4) {(s,x) = +5 - sf 2 


(t 4 xo"? 
By analytic continuation, (4), in fact, is valid for o > —1. Differentiating both sides of (4) and 
then setting s = 0, we find that 


i-[1] -= 
€’(0, x) i re 
‘ 2 0 t{+x 


1 
= xLogx —x—-=zLogx+ hm 


poe + tim (~ [E23 
eles \ 
sion fintte +(n+ x +5 )Log(n + x)]. 


mn 00 k=0 


J 


(5) 


4 = Kp 


In particular, 


(6) &(0) = (0,1) = sim [- L—n—¥ Log(k +1) 4 AG +5 )Lo(n + 1)}. 


k=0 


Thus, (5) and (6) give 


1-—x+ y {Log( kA + 1) — Log(k + x)} 


(1) §"(0,x) ~ £0) = im 


AG + x + 5 }Loa(n + x) -(x +5 )Loa(n + 1) 
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1-—x+ > {Log(& + 1) — Log(k + x)} 


= lim 
n-—- co 


n+x 
n+1 


weds ) — Log(n + x) + xLog( + 1) 


x {Log(k + 1) — Log(k + x)} 


n— © 


—Log(n + x) + xLog(n + 1)]. 


On the other hand, by (1), 
(8) LogI'(x) = lim | >) Logk + xLog(n +1) — )) Log(k + x)]. 
nO \k=1 k=0 


A comparison of (7) and (8) completes the proof for x > 0. By analytic continuation, (3) holds for 
all complex x. 

In the sequel, we shall need one fact about ['(x) and a few properties of [(s, x). Recall that 
Euler’s constant y may be defined by 


Y= im { 7 ~ ben}, 


nO \k=1 


Then, from (1), it is an easy exercise to show that 


(9) M1) = - 
Recall that [14, p. 271] 

(10) (0.x) =5- 
and 

(11) §’(0) = - SLog(2n). 


Lastly, we shall need the most important property of §(s, x), Hurwitz’s formula [12, p. 37], [14, p. 
269], 


(12) §(s,x) =2T0 - sin 15/2) y cos(2mkx) + cos( 1s/2) x se |, 


k=1 (20k) k=1 (20k) 
where 0 < x < 1 and o < 1. (A simple proof of (12) can also be found in [1].) 


GAUSS’S MULTIPLICATION THEOREM ([14, p. 240]). For any complex number x and positive 
integer n, 


Proof. For x > 0 and o > 1, by (2), 


=| = (20)? ni -* T(x). 


n—-1 o Logl j + xtk 
=) = "Cs, ren thy | 


n 


(14) {(s,x) - Le(s 5 x + 


1985] NOTES 129 


= (5.x Log{(r+x)/n} 
( +E {(r+x)/n}- 
= (1 — n*)f’(s, x) — n’Log nf(s,x), 


where in the penultimate line we set r = jn + k. By analytic continuation, the extremal sides of 
(14) are equal for all s. Setting s = 0, employing (3), and using (10) and (11), we find that, for 
x > 0, 


Log I'(x) - > Log r( 4 + =| = [x - 5) Loe -~F(n — 1)Log(27), 


which is clearly equivalent to (13). By analytic continuation, (13) holds for all complex x. 
The following elegant representation is due to Kummer [7], [8, pp. 325—328]. Hardy [4], [5], [6, 


pp. 348-350, 428-432] has given two proofs, and references to other proofs may be found in [4]. 
See also [14, p. 250]. 


KUMMER’S FORMULA. If 0 < x < 1, then 


3 (y + Log(27k))sin(27kx) | 


(15) Log I(x) = 5 Log( meso mx)) + ak 


k=1 


where y denotes Euler’s constant. 


Proof. Differentiate both sides of Hurwitz’s formula (12) with respect to s, set s = 0, and 
employ (3) to find that 


Log I'(x) = ¢’(0,x) — £’(0) 
(16) =r Ee 
+ x Leg 2nh)sin20h) _ ¢(0). 


k=1 


The second series on the right side of (16) is well known to equal 5Log($csc(x)), 0 < x < 1 [14, 
p. 190]. Using also (9) and (11), we readily find that (16) reduces to (15). 


REFLECTION FORMULA ((14, p. 239]). If x is any complex number, then 
(17) T(x)TQ — x) = mese( rx). 
Proof. Suppose first that 0 < x < 1. Then by Kummer’s formula (15), 
Log I(x) + Log T(1 — x) = Log( mcsc(7x)), 
which is equivalent to (17). By analytic continuation, (17) holds for all x. 
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138. MISCELLANEA 


Just as surely. as our understanding of Nature is really valid only to the extent that it is 
mathematical, so also our understanding of higher domains must be based on mathematical 
models. 


—R. Steiner (1861-1925), quoted in Mathematical Reviews 82k : 51001. 


Each natural science contains genuine Science only to the extent that it contains 
Mathematics... . It may be that a formal philosophy of Nature in general, that is, a philosophy 
that deals only with general concepts, is possible without Mathematics, but a formal natural 
science dealing with definite objects (whether Physics or Psychology) is possible only with the use 
of Mathematics, and since each natural science contairis only as much genuine Science as it 
contains a priori knowledge, it follows that a natural science is a genuine Science only to the 
extent that Mathematics can be applied to it. 


—Immanuel Kant, Metaphysische Anfangsgrinde der Naturwissenschaft, 1876. 


Perhaps we should sometimes listen to the other side. Goethe, for example, did not like 
Mathematics. 


Mathematicians are amazing people. In virtue of their accomplishments, they have set 
themselves up as a universal guild and will acknowledge only what suits their circle, what their 
method of organization can produce. A prominent mathematician once said, when someone 
strongly recommended a topic in Physics, “But can’t it be reduced to calculation?” 

—Goethe, Maximen und Reflexionen, no. 1277 

It surely does not follow that the hunter who kills the game must also cook it. A cook might go 
hunting and shoot well; but he would be badly mistaken if he claimed that only a cook can be a 
good shot. It seems to me that this is the situation of mathematicians who claim that nobody can 
understand or discover physical phenomena without being a mathematician, since they should be 
pleased enough if the meat is brought to their kitchen for them to lard it with formulas and dress 
it as they like. 


—Goethe, Maximen und Reflexionen, no. 1280 


ANSWER TO PHOTO ON PAGE 93 


The photo is of Louis de Branges, who proved in 1984 the conjecture Bieberbach made in 1916, 
by establishing a strong inequality proposed by the Russian function-theorist I. M. Milin. 
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General, $(13-16), L. Problemes avec et sans...Problemes! Florentin Smarandache. Somipress, 1983, 
175 pp, (P). Contains 140 original problems at the level of the Olympiad contests. Subject areas 
include arithmetic, trigonometry, geometry, analysis, algebra, and general logical analysis. LCL 


General, T(16-17), S, P, L. Thinking, Problem Solving, Cognition. Richard E. Mayer. WH Freeman, 
1983, xi + 426 pp, $16.95 (PB). [ISBN: 0-7167-1441-8] A reorganized and updated version of Thinking 
and Problem Solving (TR, April 1978). This introduction to the psychology of cognition includes 
three additional chapters: "Intelligence," "Creativity Training," and "Mathematical Problem Solv- 
ing." LCL 


General, S(13), L. Unusual Mathematical Puzzles, Tricks, and Oddities. Matthew Mandl. Prentice- 
Hall, 1984, vi + 122 pp, $15.95. [ISBN: 0-13~938150-3] Mathematical and computer entertainments. 
Much familiar material mixed with some original. Three of the eight chapters are computer-related. 
On the light side. Uneven in content and in treatment. No special prerequisite for readers. Price 
is a bit steep. JK 


General, 8*(13-15), L*. The Lighter Side of Gravity. Jayant V. Narlikar. WH Freeman, 1982, viii + 
194 pp, $10.95 (P). [ISBN: 0-7167-1344-6] A wonderfully clear popular exposition of gravitational 
theory from Aristotle to Hawking, enlivened with anecdotes and unobtrusive wry humor. Reminiscent 
of Gamow’s classic 1,2,3,...Infinity, Narlikar’s discourse--slightly mistitled~-takes the lay reader 
on a revealing tour of contemporary cosmology, explaining in clear, simple language how black holes 
can radiate and binary starts can produce gravity waves. LAS 


Elementary, T*(13: 1), S. Using Algebra. Ethan D. Bolker. Little, Brown & Co, 1983, xi + 298 pp, 
$22.95. [ISBN: 0-316-10114-1] Applied intermediate algebra, written to teach students to think about 
problems. Virtually all the examples and exercises are word problems (!): algebra is used to model 
and draw inferences about everyday real-world phenomena. The pace is leisurely and the problems are 
carefully stated and developed. The book teaches the use of the calculator in parallel with the 
mathematics. LCL 


Elementary, T(13: 1), S. Elementary Algebra. C. Lee Welch, Gilbert M. ‘Peter. Scott, Foresman, 
1985, 390 pp, $23.95. [ISBN: 0-673~16576-0}] Intended as a first course in algebra at the college 
level--contains a broad range of topics including a brief introduction to functions and complex 
numbers. Has a very straightforward approach and a high reading level. 4-6 well-chosen examples 
per section, Contains many applications and challenge problems--but thrust of text is the acquisi- 
tion of skills. TR 


Education, T(14-15: 1, 2). A Problem Solving Approach to Mathematics for Elementary School Teachers, 
Second Edition. Rick Billstein, Shlomo Libeskind, Johnny W. Lott. Benjamin/Cummings, 1984, xvi + 
723 pp, $26.95. [ISBN: 0-8053-0856-3] Heuristic approach to problem solving. Changes from the First 
Edition (TR, November 1981; Extended Review, January 1982) include: inclusion of calculator and com- 
puter exercises; both LOGO and BASIC covered in chapter on computers. JRG 


History, P, L*. Writings of Charles §. Peirce: A Chronological Edition, Volume 2, 1867-1871. 
Charles S. Peirce. Indiana U Pr, 1982, xiviii + 649 pp, $35. [ISBN: 0-253-37202-X] Continuation of 
a monumental work begun in Volume 1 (TR, February 1983); a landmark edition including numerous 
manuscripts never before published. Contains extensive scholarly and editorial notes, three intro- 
ductions, and a complete list of Peirce manuscripts, LAS 


Foundations, $(18), P. Mathematical Applications of Category Theory. Ed: J.W. Gray. Contemporary 
Math., V. 30. AMS, 1984, vii + 307 pp, $28 (P). [ISBN: 0-8218-5032-6] Introduction and ten papers 
dealing with some aspect of category theory ranging from logic to differential geometry to topos 
theory. Separate sets of references. JC 
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Combinatorics, T(17: 1), S, P, L. The Theory of Uniform Distribution. Edmund Hlawka. Transl: Henry 
Orde. AB Academic, 1984, x + 141 pp. [ISBN: 0-907360-02-5] An appealing introduction to the subject 
in three parts. The first gives the general foundations, the second introduces the abstract theory, 
and the third applications to numerical analysis, analysis and number theory. Originally published 
in German in 1979. Good list of references. No exercises. CEC 


Number Theory, T*(14-15: 1, 2), S, P, L*. Topics from the Theory of Numbers, Second Edition. Emil 
Grosswald. Birkhauser Boston, 1984, xv + 333 pp, $29.95. [ISBN: 3-7643-3044-9] The first half of 
this book is an outstanding introduction to elementary number theory which is essentially the same 
as the First Edition (TR, May 1976; Extended Review, May 1968). The second half consists of topics 
from analytic and algebraic number theory and has been thoroughly redone for this new edition. 
Interesting selection of topics and good exercises. CEC 


Number Theory, S(17), P*, L. Lecture Notes on Primality Testing and Factoring: A Short Course at 
Kent State University. Carl Pomerance. MAA, 1984, 34 pp, $3.50 (P). [ISBN: 0-88385-054-0] A brief 
monograph which is a superb introduction to state-of-the-art factoring and primality testing algo- 


rithms. Includes a list of references. CEC 


Linear Algebra, T(16-17: 2), S. Lineare Algebra und Geometrie. W. Klingenberg. Hochschultext. 
Springer-Verlag, 1984, xi + 313 pp, $12.60 (P). [ISBN: 0-387-13427-1] A fairly elementary but 
comprehensive introduction to linear algebra over an arbitrary commutative field, together with a 
development of the elements of affine, Euclidean, projective and non-Euclidean geometry based on 
linear algebra. No exercises. JD-B 


Linear Algebra, T(13-15: 1), S, L. Linear Algebra, Second Edition. Larry Smith. Undergrad. Texts 
in Math. Springer-Verlag, 1984, vii + 362 pp, $29.95. [ISBN: 0-387-96015-5] The most significant 
change from the First Edition (TR, December 1978) is the inclusion of a new chapter on the Jordan 
canonical form. JS 


Linear Algebra, T(14-17: 1), L. Linear Algebra: An Introductory Approach. Charles W. Curtis. 
Undergrad. Texts in Math. Springer-Verlag, 1984, x + 337 pp, $24. [ISBN: 0-387-90992-3] A Fourth 
Edition with no major changes from the Third Edition published by Allyn and Bacon. (1968 Second 
Edition, TR, October 1969; Extended Review, November 1972; 1974 Third Edition, TR, October 1974.) 
JS 


Group Theory, T(17-18: 1, 2), S. Lie Groups, Lie Algebras, and Their Representations. V.S. Vara- 
darajan. Grad. Texts in Math., No. 102. Springer-Verlag, 1984, xiii + 430 pp, $29.80. [ISBN: 0- 
387-90969-9] Relatively self-contained treatment of the algebraic and analytic aspects of Lie group 
theory, presuming only some knowledge of differentiable manifolds and topological groups. Cul- 
minates in detailed development of the structure and representation theory of semisimple Lie groups 
and Lie algebras with both Harish-Chandra’s algebraic methods and Weyl’s transcendental methods via 
compact groups. Many nice exercises develop aspects of the theory not covered in the text. RM 


Group Theory, S, P*, L. The Classification of Finite Simple Groups: Volume 1: Groups of Noncharac- 
teristic 2 Type. Daniel Gorenstein. Univ. Ser. in Math. Plenum Pr, 1983, x + 487 pp, $59.50. 
[ISBN: 0-306-41305-1] A comprehensive outline of the classification of simple groups of noncharac- 
teristic 2 type. Gorenstein is a guide and companion along the way, providing direction, overview, 
intuition, and purpose. The mathematical community is indeed fortunate to receive this invaluable 
contribution to the literature, LCL 


Group Theory, T(16-18: 1, 2). An Introduction to the Theory of Groups, Third Edition. Joseph J. 
Rotman. Allyn & Bacon, 1984, x + 422 pp, $40.71. [ISBN: 0-205-07963-6] An updated and reorganized 
edition that represents some significant new material and changes in older material. New material 
is found on permutations, Mathieu groups, automorphisms of 86, the fundamental theorem of projective 
geometry and much more. The treatment of amalgams and HNN extensions is now based on pushouts and 
the Seifert-van Kampen theorem. (Second Edition, TR, March 1974.) JAS 


Group Theory, P. FC-groups. M.J. Tomkinson. Research Notes in Math., No. 96. Pitman, 1984, 171 
pp, $19.95 (PP). [ISBN: 0-273-08566-2] A group G is an FC-group if every conjugacy class is finite. 
This volume gives an account of results of Hall and Goréakov about embedability of certain classes 
of FC-groups into products of finite groups, and surveys advanced techniques. Twenty-five unsolved 
questions; sizable bibliography. RB 


Algebra, P. Classgroups of Group Rings. Martin Taylor. London Math. Soc. Lect. Note Ser., No. 91. 
Cambridge U Pr, 1984, xiii + 119 pp, $17.95 (P). [ISBN: 0-521-27870-8] A number of new approaches 
and techniques for investigating modules which are locally free over an integral group ring, and the 
consequent problem of determining whether or not the module is globally free. LCL 


Algebra, P. Characters of Reductive Groups Over a Finite Field. George Lusztig. Princeton U Pr, 
1984, xxi + 384 pp, $50; $19.50 (P). [ISBN: 0-691-08350-9] The main result of this work is an expli- 
cit formula for the multiplicities with which the various irreducible representations of G(Fq) 

appear in the virtual representation of G(Fq) where G is a reductive algebraic group with connected 


center defined over a finite field, Fq. Extensive bibliography. CEC 


Algebra, S(17), P. The Umbral Calculus. Steven Roman. Pure & Appl. Math., V. Jl11. Academic Pr, 
1984, x + 193 pp, $35. [ISBN: 0-12-594380-6] Elementary introduction to modern umbral calculus with 
some applications and a short chapter on nonclassical umbral calculi. Clear, friendly writing style 
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provides excellent overview of subject as it is developed. No problems. BH 


Algebra, $(17), P. Stone Spaces. Peter T. Johnstone. Stud. in Adv. Math., No. 3. Cambridge U Pr, 
1982, xxi + 370 pp, $59.50. [ISBN: 0-521-23893-5] Presents Stone’s representation theorem for 
Boolean algebras and some of its consequences in topology, ring theory, sheaf theory, and lattice 
theory. Includes an introduction with a description of the historical development of the theorem 
and its consequences. Historical notes at the end of each chapter and an extensive bibliography. 
BH 


Algebra, S(18), P. Relative Finiteness in Module Theory. Toma Albu, Constantin NXstasescu. Pure & 


Algebra, P. FPF Ring Theory: Faithful Modules and Generators of mod-R. Carl Faith, Stanley Page. 
London Math. Soc. Lect. Note Ser., No. 88. Cambridge U Pr, 1984, 168 pp, $19.95 (P). [ISBN: 0-521- 
27738-8] Based on lecture notes by Faith, it is a systematic study of the known results concerning 
finitely pseudo-Frobenius (FPF) rings. Essentially self-contained; basics lead to discussion of 
semi-perfect, semi~prime, non-singular, Goldie, and self-injective FPF rings. Open questions; 
bibliography; index. JS 


Calculus, T(13: 1). A Short Calculus: An Applied Approach, Fourth Edition. Daniel Saltz. Scott, 
Foresman, 1985, 616 pp, $26.95. [ISBN: 0-673-16625-2] Contains a good general overview of algebra. 
Many straightforward examples and applications in business, economics, and life, physical, and 

social sciences. Very broad-based text without much background material. Stresses recognition of 
many types of notation within each topic. (First Edition, TR, June-July 1973; Second Edition, TR, 
October 1977; Third Edition, TR, February 1981.) TR 


Calculus, T*(14-16: 1, 2), S*, L*. Multivariable Analysis. G. Baley Price. Springer-Verlag, 1984, 
xiv + 655 pp, $39. [ISBN: 0-387-90934-6] A new approach (triggered many years ago by a penetrating 
question by a bright student in a course in analytic geometry to the theory of differentiation for 
functions which map sets in Rn into Rm. Reduces to the standard theory in the one-dimensional case. 
Includes Riemann integration and related topics in analysis. Methods are based on oriented simpli- 
cial complexes and determinants. Numerous examples and exercises. Recommmended for a different 
kind of course in multivariable analysis. Schools with honors courses should have a look. For the 
student "with a first course in calculus and the ability to read and understand mathematical defini- 
tions, theorems and proofs." But beware: even the author cautions "it is not always easy." JK 


Calculus, S(13). Problem Book for First Year Calculus. George W. Bluman. Problem Books in Math. 
Springer-Verlag, 1984, xv + 385 pp, $39. [ISBN: 0-387-90920-6] What should the speed limit be to 
maximize traffic flow during rush hour? 1000 problems from single variable calculus, emphasizing 
applications but covering as well standard techniques and theory. Each section opens with a brief 
synopsis of terms, followed first by solved problems and then by supplementary problems for which 


only answers are provided. Problems are drawn from university exams throughout Canada. LAS 


Real Analysis, S(18), P. Intertwining Functions on Compact Lie Groups. B. Hoogenboom. CWI Tract 
No. 5. Math Centrum, 1984, 86 pp, Dfl. 13,20 (P). [ISBN: 90-6196-271-4] Based on work by 
Koornwinder, James, Constantine, and Vretare, this book explores some of the relationships between 
orthogonal polynomials and intertwining functions on a compact Lie group. References, index. JS 


Complex Analysis, S(17), P*. The Cauchy Method of Residues: Theory and Applications. Dragoslav S. 
Mitrinovié, Jovan D. Ketkié. Math. & Its Applic. D Reidel, 1984, xiv + 361 pp, $67. [ISBN: 90- 
277-1623-4] Devoted entirely to residues in one complex variable, but briefly mentions several com- 
plex variables. Contains a wealth of applications to ordinary differential equations, partial dif- 
ferential equations, finite differences, special functions, and an impressive collection of contour 
integrals. Many bibliographic items, but few are recent. Pleasant biography of Cauchy’s life and 
work. No exercises, but great supplementary reading or reference. YN 


Complex Analysis, T(17: 1), P. Funktionentheorie I. Reinhold Remmert. Grund. Math., No. 5. 
Springer-Verlag, 1984, xiii + 324 pp, $16.10 (P). [ISBN: 0-387-12782-8] A modern introduction to 
complex analysis, with many historical notes but almost no exercises. JD-B 


Differential Equations, T*(16-17), S, L. Introduction to Partial Differential Equations with Appli- 
cations. Mark A. Pinsky. McGraw-Hill, 1984, x + 326 pp, $32.95. [ISBN: 0-07-050117-3] For students 
who have had a first course in ordinary differential equations. Stress on systematic solution algo- 
rithms based on separation of variables and Fourier series and integrals. Includes classical tech- 
niques, plus chapters on asymptotic solutions and numerical solutions. Numerous worked examples, 
good exercises, physical motivation and applications. LCL 


Differential Equations, P. Lectures on Stochastic Differential Equations and Malliavin Calculus. 5S. 
Watanabe. Springer-Verlag, 1984, 111 pp, $7.30 (P). [ISBN: 0-387-12897-2] Mathematical disquisition 
on stochastic differential equations arising from parabolic differential operators. Concise intro- 
duction followed by one hundred pages of definitions, lemmas, theorems, and computations. YN 


Partial Differential Equations, T(17: 1), P. Lectures on Three-Dimensional Elasticity. P.G. Ciar- 
let. Springer~Verlag, 1983, 149 pp, $7.90 (P). In the first chapter a nonlinear system of partial 
differential equations is established as a mathematical model of elasticity. In the second (and 
last), questions of existence of solutions are studied. Tools are the implicit function theorem and 
the theory of J.M. Ball. Some exercises. References. Some open problems. JK 
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Numerical Analysis, T?(17), S. Numerical Solution of Differential Equations, Second Edition. M.K. 
Jain, Halsted Pr, 1984, xix + 698 pp, $34.95. [ISBN: 0-470-27389-5] Encyclopedia of numerical 
methods for approximating solutions to differential equations. Presumes some knowledge of advanced 
calculus and elementary numerical analysis. Added problems and enlarged chapter on the finite ele- 
ment method distinguish text from its First Edition (IR, October 1980). Large number of problems 
with solutions in back. Although text is rather dry, the bibliographic references after each 
chapter and extensive bibliography make it a useful reference. BH 


Functional Analysis, T(18: 1, 2), S, P. Applied Nonlinear Analysis. Jean-Pierre Aubin, Ivar Eke- 
land. Wiley, 1984, xi + 518 pp, $47.50. [ISBN: 0-471-05998-6] Topics in nonlinear functional 
analysis-~solving nonlinear equations and inclusion relations, variational problems, non-smooth 
analysis--are studied together with applications, e.g., to game theory, mathematical economics, con- 
vex optimization. Authors assume only the most elementary functional analysis; they prove many 
classica] results, especially of convex analysis. Informal, readable--directed at variety of consu- 
mers of mathematics. PZ 


Functional Analysis, P. The Analogue of the Group Algebra for Topological Semigroups. H.A.M. Dzino- 
tyiweyi. Research Notes in Math., No. 98. Pitman, 1984, 196 pp, $19.95 (P). [ISBN: 0-273~-08610-3] 
"A great deal of abstract harmonic analysis on a locally compact topological group is pivoted at the 
object L1(G)--the so-called group algebra of G. The purpose of this book is to present analogues of 
L1(G) for topological semigroups..." For researchers and students in the specialty, and possibly 
theoretical probabilists, functional analysts. RB 


Functional Analysis, T(18: 1), S, P. Harmonic Analysis on Semigroups: Theory of Positive Definite 
and Related Functions. Christian Berg, Jens Peter Reus Christensen, Paul Ressel. Grad. Texts in 
Math., No. 100. Springer-Verlag, 1984, x + 289 pp, $39, [ISBN: 0-387-90925-7] Centered on the gen- 
eral theme of positive definite functions defined as Abelian semigroups with involution. Treatment 
is largely self-contained, proceeding through preliminary measure theory to such topics as 
Schoenberg”s theorem, moment functions, and Hoeffding’s inequality. Notes and remarks; bibliogra- 
phy; index. JS 

Functional Analysis, P. Spaces of Measures. Corneliu Constantinescu. Stud. in Math., No. 4. 
Walter de Gruyter, 1984, 444 pp, DM 128. [ISBN: 3-11-008784-7] An ‘attempt to present a systematic 
approach to the theory of spaces of measures which has grown out of initial work by Nikodym, Orlicz, 
Pettis, Vitali, Hahn, Saks, and Phillips. Applications to locally convex vector lattices. Refer- 
ences, index, JS 


Analysis, P. Operator Commutation Relations. Palle E.T. Jdrgensen, Robert T. Moore. Math. & Its 
Applic. D Reidel, 1984, xviii + 493 pp, $69. [ISBN: 90-277-1710-9] The authors use certain opera- 
tor commutation relations to unify and extend recent results in the theory of Lie algebras. In par- 
ticular, they study families of noncommuting unbounded operators on Banach spaces by techniques that 
generalize the classical Lie theory of matrices and Lie transformation groups. LCL 
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David Mumford. Birkhauser Boston, 1984, xiv + 272 pp, $22. [ISBN: 0-8176-3110-0] The second in a 
series of three volumes which survey the theory of theta functions. Presents an explicit elementary 
construction of hyperelliptic Jacobian varieties and a self-contained introduction to the theory of 
Jacobians. CEC 


Algebraic Geometry, P. Topics in Transcendental Algebraic Geometry. Ed: Phillip Griffiths. Annals 
of Math. Stud., No. 106. Princeton U Pr, 1984, viii + 316 pp, $45; $12.50 (P). [ISBN: 0-691-08335- 
5; 0-691-08339-8] The proceedings of a seminar (1981-82, Institute for Advanced Study) to explore 
ways formal Hodge theory might be applied in algebraic geometry. Seventeen lectures aim to give an 
intuitive picture of what is known and of some of the available techniques. LCL 


Son, 1983, viii + 276 pp, $35. [ISBN: 9971-950-98-7] 26 research papers in differential geometry, 
reprinted from the Journal of Differential Geometry and the Chinese Annals of Mathematics, on the 
occasion of the 80th birthday of Su Buchin, President of Fudan University in Shanghai. LAS 


Differential Geometry, P. Mathematical Essays in Honor of Su Buchin. Ed: C.C. Hsiung. Heyden & 


Geometry, S(13), P, L*. Orderly Tangles: Cloverleafs, Gordian Knots, and Regular Polylinks. Alan 
Holden. Columbia U Pr, 1983, x + 97 pp, $19.95. [ISBN: 0-231-05544-7] A descriptive work which 
includes photographs of remarkable models of various highway interchanges, knots, cat’s cradles, 
chained polylinks and regular polylinks. Informative and aesthetically pleasing. CEC 


Geometry, S*(13-16), L**. Geometry in Architecture. William Blackwell. Wiley, 1984, xii + 185 pp, 
$34.95. [ISBN: 0-471-09683-0] Reflects the author’s "lifelong fascination with the endless relation- 
ships of geometric forms and the continual discovery of their applications to architecture and 

design." No theorems but lots of illustrations. Excellent drawings and magnificant (B & W) photo- 
graphs with cogent comments. The reader is led from regular polygons in the plane to the classical 
solids in three dimensions, A_ book to be savored by all. No special knowledge of geometry is 

required, Read--or just browse--and enjoy this delightful book. JK 


ed 


tente RYume. Peter J. Hilton. Springer-Verlag, 1984, v + 219 pp, $11 (P). [ISBN: 0-387-12910-3] 


1985 Genter Section Cl7 


Algebraic Topology, P. Topological Topics. I.M. James. London Math. Soc. Lect. Note Ser., No. 86. 
Cambridge U Pr, 1983, 184 pp, $21.95 (P). [ISBN: 0-521-27581-4] “Articles on algebra and topology 
presented to P.J. Hilton in celebration of his sixtieth birthday." Includes a list of publications, 
two survey articles, and a number of papers dedicated to Hilton. JAS 


Algebraic Topology, P. Lecture Notes in Mathematics-1069: Bordism of Diffeomorphisms and Related 
Topics. Matthias Kreck. Springer-Verlag, 1984, 144 pp, $7.50 (P). [ISBN: 0-387-13362-3] The bor- 
dism relation is defined on the class of all diffeomorphisms of all manifolds of fixed dimension mm, 
and is based on the notion of bordism of m-manifolds. Bordism groups of diffeomorphisms, whose ele- 
ments are bordism classes, are computed and applications are given in this research monograph. RB 


Statistics, 8(17). Statistical Methods and the Improvement of Data Quality. Ed: Tommy Wright. 
Academic Pr, 1983, xvii + 357 pp, $25. [ISBN: 0-12~765480-1] A set of papers presented at the Small 
Conference on the Improvement of the Quality of Data in November, 1982 at Oak Ridge, Tennessee. 
Most of the papers deal with large data collection systems and are concerned with survey sampling 
topics. MT 


Statistics, T*(15: 2). Elementary Statistics for Business and Economics. Cari-Louis Sandblom. 
Walter de Gruyter, 1983, 352 pp, $21.80, [ISBN: 3~11~008302-7] This text assumes knowledge of dif- 
ferential and integral calculus and is intended for basic statistics courses in business administra- 
tion and economics, It is a relatively brief text but its topics still include summary statistics, 
probability, random variables, sampling, estimation and hypothesis testing, analysis of variance, 
experimental design, regression, and time series. MT 


Statistics, P. Percentage Points of the Distribution of the Correlation Coefficient. Dolun Uksoy, 
Leo A. Aroian. Inst. of Admin. & Manag. (Union Coll. & Univ., Schenectady, NY 12308), 1982, vi + 
128 pp, (P). Tables of the percentage points of the distribution of r for sample sizes from three 
to 500 and population correlations from zero to one. MT 


Statistics, P. Tables of the Distribution of the Correlation Coefficient. Dolun Oksoy, Leo A. 

Aroian. Inst. of Admin. & Manag. (Union Coll. & Univ., Schenectady, N¥ 12308), 1982, xxxiii + 132 
pp, (P). Tables of the cumtlative distribution function for r, the Pearson product moment coeffi- 
cient of correlation, for sample sizes varying from three to 500 and for population correlations 


varying from zero to one. MT 


Statistics, P. Probability, Statistics and Analysis. Ed: J.F.C. Kingman, G.E.H. Reuter. London 
Math. Soc. Lect. Note Ser., No. 79. Cambridge U Pr, 1983, viii + 286 pp, $29.95 (P). [ISBN: 0-521- 
28590-9] A collection of research papers dedicated to David Kendall on his 65th birthday and  span- 
ning his interests in mathematical statistics, probability, and analysis. LCL 


Statistics, S*(15-18), P*, L. S: An Interactive Environment for Data Analysis and Graphics. Richard 
A. Becker, John M. Chambers. Stat./Prob. Ser. Wadsworth, 1984, xii + 550 pp, $23.95 (P). [ISBN: 
0~-534-03313-X] An appealing introduction to Bell Labs” UNIX-based statistical package S, an interac- 
tive system emphasizing graphical and exploratory methods. Simple one- or two-line commands yield 
dramatic visualizations of data, including scatterplots, starplots, boxplots, grid plots, cluster 
plots, and contour plots. Half of the book illustrates the use of S with numerous real data sets; 


the other half provides reference details on al] S commands. LAS 


Statistics, P. Statistics and Probability. Ed: J. Mogyorédi, I. Vineze, W. Wertz. D Reidel, 1984, 
x + 415 pp, $45.75. [ISBN: 90-277~-1675-7] Proceedings of the third Pannonian Symposium on Mathemati- 
cal Statistics held at Visegrad, Hungary, September 13-18, 1982. JAS 


Statistics, P*, L. Statistical Computation. J.H. Maindonald. Wiley, 1984, xviii + 370 pp, $39.95. 
[ISBN: 0-471-86452-8] This book encourages the use of statistical software based on modern algo- 
rithms developed by numerical analysts. Many of these algorithms are discussed. About two-thirds 
of the book is devoted to multiple linear regression and closely related topics. AQ 


Statistics, $(16), P. Analyse des Données. Francis Cailliez. Pr U Montreal, 1984, 103 pp, $12 (P). 
[ISBN: 2-7606-0657-0] Author’s lecture notes from 1982 NATO seminar; in French. Interprets concepts 
of multivariate data analysis (mean, correlation, factor...) in terms of linear algebra (projection, 
angle, eigenvector...). The main novelty lies in using the inner product of linear operators in 
this context. (However, similar results appeared first in M.L. Eaton, Multivariate Statistics, 
Wiley, 1983. TR, August-September 1984,) Concise, well written. YN 


Computer Literacy, 8(13-15), L*. The Computer Comes of Age: The People, the Hardware, and the 
Software. R. Moreau. Transl: J. Howlett. Ser. in History of Comp. MIT Pr, 1984, x + 227 pp, 
$19.95. [ISBN: 0-262-13194-3] A concise history of computing used as a vehicle for explaining the 
nature of modern computing. Focuses particularly on the development of mainframe machines and 
languages, up to the late 60°s; does not get into the microcomputer age. LAS 


Computer Literacy, $(13-15), L. Are Computers Alive? Evolution and New Life Forms. Geoff Simons. 
Birkhauser Boston, 1983, xi + 212 pp, $14.95 (P). [ISBN: 0-8176-3144-5] A fascinating, persuasive, 
well-researched examination of the birth of species machina sapiens. Beginning with issues in the 
definition of life (biochemistry vs. information theory), Simons examines behavioral, anatomical, 
and psychological features of computers that, in his view, support a hypothesis that computers are 
alive. LAS 
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Computer Literacy, $(13). The 3-D Animated Apple. Phil Cohen. Prentice-Hall, 1983, viii + 198 pp, 
$18.95 (P). [ISBN: 0-13-920224-2] Text is out of step with current technology. Intended as an 

inspiration-—-but those who might be interested will not be challenged by the information presented. 
Centered around two general programs--one 2-D graphics and one 3~D graphics program; but contains 
other related examples. TR 


Computer Literacy, P*. The Art of Computer Conversation: A New Medium for Communication. Brian R. 
Gaines, Mildred L.G. Shaw. Prentice-Hall, 1984, x + 214 pp, $16.95 (P). [ISBN: 0-13-047332-4] 
Advice and examples (both good and bad) of human-machine dialogue, organized into 10 chapters and 30 


proverbs (e.g., "Be consistent"). Much of the advice is routine, albeit too often ignored. LAS 


Computer Programming, P*, L. The American Pascal Standard with Annotations. Henry lLedgard. 
Springer-Verlag, 1984, 97 pp, $16.95 (P). [ISBN: 0-387-91248-7] A two-column presentation of the 
ANSI standard for Pascal, with commentary and examples paralleling the formal statement of the stan- 
dard. Employs several type fonts and well-conceived layout "to help reduce the mental effort needed 


to understand the standard." LAS 


Computer Programming, T. Learning BASIC Programming Essentials. Carl Grame, Dan O”Donnell. SRA, 
1984, xiv + 282 pp, $16.95 (P). [ISBN: 0-574-21370-8] A self-paced text which assumes the essentials 
of elementary algebra. Designed as system independent-—-should not be used with a micro-based sys- 
tem. Has many consumer applications. Provides a list of possible errors and corrections. TR 


Computer Programming, T*(13: 1). Programming in Pascal, Second Edition. Peter Grogono. Addison- 
Wesley, 1984, xi + 420 pp, $21.95 (P). [ISBN: 0-201-12070-4] Primarily a language manual. Covers 
the entire language systematically. Numerous examples of complete programs are given. The basic 
structure and tone are the same as in the First Edition (IR, February 1979). AO 


Computer Programming, $(13-15), L. Programming the IBM Personal Computer: Assembly Language. Chao 
C. Chien. Holt, Rinehart & Winston, 1984, xiv + 299 pp, $18.45 (PB). [ISBN: 0-03-070442-1] For pick- 
ing up IBM PC assembly on one’s own. Presumes no programming experience and only rudiments of PC 
operation (e.g., copying a diskette). Use of text editor, internal computer organization, mention 
of top-down approach and structured programming. Library of useful low-level subroutines. A good 
choice for self study. RB 


Computer Programming, P. Lecture Notes in Computer Science-167: International Symposium on Program- 
ming. Ed: M. Paul, B. Robinet. Springer-Verlag, 1984, vi + 262 pp, $13.50 (P). [ISBN: 0-387- 
12925-1] Twenty-two papers in French and English, the proceedings of the sixth biennial symposium 
held in Toulouse in April, 1984. Topics include concurrency and optimization, e.g., the vectoriza-~ 
tion language Vesta for high performance scientific computation; data network protocols, e.g., 
application of temporal logic to design of protocols; miscellaneous topics. RB 


Computer Programming, T(14-15: 1), S, P, L. Using the IBM Personal Computer: Organization and Assem- 
bly Language Programming. Mark A. Franklin. Holt, Rinehart & Winston, 1984, xv + 357 pp, $20.45 
(P). [ISBN: 0-03-062862-8] Comprehensive introduction to assembly on the IBM PC: nearly 30% of text 
devoted to computer organization, architecture, etc., before assembly language instructions. 
Presumes experience with algorithm formation, coding in high level languages. Advanced topics; 
bibliography; numerous exercises. Somewhat technical for relatively unsophisticated students; good 
choice for the experienced programmer. RB 


Software Systems, $7(17). Program Transformation and Programming Environments. Ed: Peter Pepper. 
NATO ASI Ser. F, V. 8. Springer-Verlag, 1984, xiv + 378 pp, $39.50. [ISBN: 0-387-12932-4] Includes 
discussions and position papers from an Advanced Research Workshop of the NATO Science Committee. 
Deals with formal methods in software engineering and the application and acceptance of these 
methods. FA 


Software Systems, S, L. MS-DOS User’s Guide. Paul Hoffman, Tamara Nicoloff. Osborne/McGraw-Hill, 
1984, viii + 312 pp, $17.95 (P). [ISBN: 0-88134-131-2] An introduction to versions 1 and 2 of 
Microsoft’s Operating System MS-DOS, the generic version of IBM’s PC-DOS. Explains how to use each 
feature, including the UNIX-like pipe, redirect, and directory features of Version 2, but does not 
provide any details of interest to those writing programs that interact with the operating system. 


LAS 


Computer Science, T(13-14: 1). Structure and Interpretation of Computer Programs. Harold Abelson, 
Gerald Jay Sussman, Julie Sussman. Elect. Engin. & Comp. Sci. Ser. MIT Pr, 1985, xx + 542 pp, 
$30. [ISBN: 0-262-01077-1] This is an introductory computer science text which approaches its sub- 
ject matter in a very different, and quite sophisticated, manner. It treats programming and 
languages as symbol manipulation systems and concentrates almost exclusively on problemm solving and 
high level abstraction mechanisms. The language that is used to illustrate examples is LISP--a 
functional language used frequently in artificial intelligence. This is an introductory text 
appropriate to only the most capable and mathematically sophisticated student. MS 


Computer Science, P. Analysis and Simulation of Semiconductor Devices. Siegfried Selberherr. 
Springer-Verlag, 1984, xiv + 294 pp, $54. [ISBN: 0-387-81800-6] The book discusses techniques for 
constructing formal models of semiconductor devices--~i.e,, chips, circuit boards, and microproces- 
sors. Using these models the designer can then study the behavior and stability of that device. 
The models discussed are mathematical simulation models. The author shows the development of the 
model and also gives techniques for solving the resulting equations. The mathematics is quite high 
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level and requires an understanding of calculus and linear algebra. The final chapter presents some 
results of the model when used to study one class of semiconductors called MOSFETs. MS 


Computer Science, $(17-18). Lecture Notes in Computer Science-173: Semantics of Data Types. Ed: G. 
Kahn, D.B. MacQueen, G. Plotkin. Springer-Verlag, 1984, vi + 391 pp, $16.50 (P). [ISBN: 0-387- 
13346-1] Includes selected papers from the International Symposium on the Semantics of Data Types. 
Discusses a variety of data typing notations and mechanisms from the viewpoint of language designers 


as well as of pure and applied logicians. FA 


Computer Science, $(14-15). On-Line Systems Design and Implementation (Using COBOL and Command Level 
CICS). Charles J. Kacmar. Reston, 1984, x + 404 pp, $26.95. [ISBN: 0-8359-5231-2] Most new appli- 
cations of computers are on-line~-that is, the user is connected directly to the computer. Because 
of this direct interaction between the computer and a (possibly inexperienced) user, on-line systems 
must be designed in a different way. Extreme care must be taken in the design of the user inter- 
face, the screen format, on-line assistance, error handling, and the use of graphics. This text 
addresses techniques for the most effective design of interactive, on-line systems. MS 


Computer Science, $(15-16). Portability and Style in Ada. Ed: John Nissen, Peter Wallis. Cambridge 
U Pr, 1984, xii + 202 pp, $24.95. [ISBN: 0-521-26482-0] Ada is a new system implementation: language 
which is a standard for all new computer systems developed by the Department of Defense. It is a 
very powerful language incorporating many new features not found in existing languages like Pascal, 
FORTRAN, or C. This text is a guide to using Ada in a correct and elegant way. It discusses how to 
make optimal use of such new features as packages, tasks, and data abstraction. MS 


Computer Science, $(15-16), P. The 8087 Primer. John F. Palmer, Stephen P. Morse. Wiley, 1984, 
viii + 182 pp, $16.95 (P). [ISBN: 0-471-87569-4] Two parts on generalities of data representation 
and the ideas of coprocessors, and on 8087 architecture, precede a third section on programming. A 
hybrid book, part computer science and part processor manual. JAS 


Computer Science, P. Programmable Assembly. Ed: Wilfred B. Heginbotham. Intern. Trends in Manufac~ 
turing Tech. Springer-Verlag, 1984, 349 pp, $43. [ISBN: 0-387-13479-4] Discusses the use of 
robots--i.e., mobile, flexible, and programmable devices--in production and control. It is a col- 
lection of about 30 papers which address a wide range of issues in the area of robotics, including 
robot design, applications, and economic and social concerns. The reader should have a_ working 
knowledge of both computer science and mechanical engineering to appreciate the relatively high- 
level discussions. MS 


Computer Science, L*, Handbook of Algorithms and Data Structures. G.H. Gonnet. Internat. Comp. 
Sci. Ser. Addison-Wesley, 1984, xi + 286 pp, $25.95 (P). [ISBN: 0-201-14218-X] A reference work on 
algorithms and data structures for their implementation. Covers searching, sorting, selection, and 
arithmetic algorithms, Theoretical complexity results and implementations in both C and Pascal are 
given for each algorithm of practical importance. AO 


Computer Science, P. Local Area Networks: Issues, Products, and Developments. V.E. Cheong, R.A. 
Hirschheim. Ser. in Comp. Wiley, 1983, xv + 190 pp, $29.95. [ISBN: 0-471-90134-2] A concise 
introduction to management issues, engineering concepts, and hardware options for LANs--networks 
that connect computers within distances of .1 to 10 km at transmission rates of .1 to 10 Mbps (mil- 
lion bits per second). Discusses various types (e.g., Ethernet, Cambridge Ring) as well as alterna- 
tives (e.g., private automatic branch exchange--PABX). LAS 


Computer Science, T(15-16), P. Microcomputer System Design: An Advanced Course. Ed: M.J. Flynn, 
N.R. Harris, D.P. McCarthy. Springer-Verlag, 1984, vii + 397 pp, $19.95 (P). (ISBN: 0-387-13545-6] 
This text represents the course notes from an advanced course in microcomputer design held at Trin- 
ity College in July, 1981. The course addresses current technology in LSI/FLSI design, hardware 
Structures, and language design. The course is a series of lectures by 8 professionals in the field 
of computer architecture and design. MS 


Computer Science, T*(14-15: 1), L**. Algorithms. Robert Sedgewick. Addison-Wesley, 1983, viii + 
551 pp, $31.95. [ISBN: 0-201-06672-6] A survey of some of the most important algorithms in computer 
science. Major sections cover mathematics algorithms, sorting, searching, string processing, 
geometric algorithms, and advanced topics. AO 


Computer Science, P. Natural Language Communication with Pictorial Information Systems. Ed: Leonard 
Bolc. Symbolic Computation. Springer-Verlag, 1984, vii + 327 pp, $31. [ISBN: 0-387-13478-6] 
Reports of projects concerned with common representation and understanding of language and pictorial 
information. The papers discuss pictorial query languages and databases, natural language dialogue 
systems for scene analysis, interactive systems concerned with weather reporting, and interactive 
digital picture analysis. RM 


Computer Science, T(18), P. Lecture Notes in Computer Science-166: STACS 84. Ed: M. Fontet, K. 
Mehlhorn. Springer-Verlag, 1984, vi + 338 pp, $15 (P). [ISBN: 0-387-12920-0] This monograph con- 
tains a collection of 27 papers presented at the Symposium on Theoretical Computer Science held in 
Paris, April, 1984. The papers address issues of current research in such theoretical areas as 
algorithmic analysis, Petri nets, formal languages, and information theory. MS 


Computer Science, P. Lecture Notes in Computer Science-172: Automata, Languages and Programming. 
Ed: Jan Paredaens. Springer-Verlag, 1984, viii + 527 pp, $22 (P). [ISBN: 0-387-13345-3] Proceedings 
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of ICALP 84 (1lth Colloquium on Automata, Languages, and Programming) covering the theoretical 
aspects of computer science. RM 


Computer Science, S*(15-17), L**. Computing and Assessing Programming Languages: Ada, C, Pascal. 
Ed: A. Feuer, N. Gehani. Prentice-Hall, 1984, 271 pp, $24.95. [ISBN: 0-13-154857-3] A collection 
of papers discussing the programming languages Ada, C, and Pascal. The papers are grouped into sec- 
tions containing comparisons, assessments, criticisms of these languages, and a final section on 
methodology for comparison and assessment of programming languages. AO 


Computer Science, T(17-18), P. Supercomputers and Parallel Computation. Ed: D.J. Paddon. Institute 
of Math. & Its Applic. Conf. Ser., No. 1. Clarendon Pr, 1984, x + 258 pp, $39. [ISBN: 0-19-~853601- 
1] This volume is based on the proceedings of a conference on parallel computing held at the Univer- 
sity of Bristol in September, 1982. The book contains 18 research papers on the topics of VLSI 
parallel architecture, theory of parallel computing, vector and array processors, and algorithmic 
design for parallel machines. MS& 


Computer Science, P*. Information, Uncertainty, Complexity. J.F. Traub, G.W. Wasilkowski, H. 
WoZniakowski. Addison-Wesley, 1983, ix + 176 pp, $37.95, [ISBN: 0-201-07890-2] A research monograph 
which aims to create a general mathematical theory for understanding and dealing with uncertainty 
(as in approximate solutions to problems). The generality and power of the theory stems from the 
central role of information. A variety of application areas is presented to illustrate major con- 
cepts, together with an analysis of optimal or near-optimal algorithms. LCL 


Control Theory, P. Optimal Control of Variational Inequalities. V. Barbu. Pitman, 1984, 298 pp, 
$23.95 (P). [ISBN: 0~273~08629-4] Monograph on first-order necessary conditions for optimality of 
(nonlinear) control problems governed by variational inequalities and semilinear equations of ellip- 
tic and parabolic types. Gives theoretical development of several simple problems (obstacle prob- 
Jem, Dirichlet problem). RM- 


Control Theory, T(17-18), 8, P. Time Series Analysis, Identification and Adaptive Filtering. Daniel 
Graupe. Robert E Krieger, 1984, xv + 386 pp, $34.50. [ISBN: 0-88275-713-X] This book aims to pro- 
vide tools for retrieving information in a time series when no prior knowledge of mathematical 
structure is available .(i.e., “adaptive filtering of d-priori unknown data from a-priori unknown 
noisy background”). Assumes ~-a background in probability and statistics, and linear control theory. 
Thirteen representative computer programs cover all major results of the text. LCL 


Applications, $(17), P. Operational Calculus, Second Edition, V. I. Jan Mikusifiski. Pergamon Pr, 
1983, 321 pp, $27. [ISBN: 0-98-025071-8] Introduction to operational calculus and some applications, 
including: electric circuits, statics of beams, vibrations of a string, heat equation, and equations 
of telegraphy. Essentially the same as the first three parts of the first edition with an added 
chapter on applications to chromatography. BH 


Applications (Data Processing}, T(15: 1). Construction of Data Processing Software. John Elder. 
Ser. in Computer Sci. Prentice-Hall, 1984, xiv + 432 pp, $22.95 (P). [ISBN: 0-13-168675-5] Intro- 
duces commercial data processing problems. Assumes students have taken an introductory, Pascal- 
based programming course. Examines important concepts, techniques and algorithms using an extended 
Pascal in the first part of the book; the second part describes and utilizes ANSI Cobol. FA 


Applications (Economics), T(17-18: 2), S, P. Advanced Econometric Methods. Thomas B. Fomby, R. Car- 
ter Hill, Stanley R. Johnson. Springer-Verlag, 1984, xix + 624 pp, $54. [ISBN: 0-387-90908-7] A 
comprehensive graduate-level introduction to its subject; one course in econometric methods is 
prerequisite, First section studies fundamental statistics of least-squares regression. Later sec- 
tions treat violation of basic assumptions, special topics in linear regression, simultaneous equa~ 
tion models, frontiers. Each chapter ends with summary, guide to further reading, and exercises. 
PZ 


Applications (Economics), P. Capital Flows and Exchange Rate Determination. Ed: Lawrence R. Klein, 
Wilhelm E. Krelle. Zeitschrift flr National¥konomie Journal of Economics, Supp. 3. Springer- 
Verlag, 1983, ix + 220 pp, $41.10 (P). [ISBN: 0-387-81770-0] Seven papers presented at a September 
1982 LINK meeting in Frankfurt, West Germany, published as a supplement to the Journal of Economics. 
Papers are equally divided between two-country and multi-country models. LAS 


Applications (Economics), T*(18: 1), S*, P. Theory of Correspondences: Including Applications to 
Mathematical Economics. Erwin Klein, Anthony C. Thompson. Canadian Math. Soc. Ser. of Mono, & 
Adv. Texts. Wiley, 1984, xi + 256 pp, $39.95. [ISBN: 0-471-88016-7] Correspondences are graphs of 
relations; they model consumers” preferences. This monograph studies topologies on spaces of rela- 
tions and spaces of subsets. Generalizes to correspondences the classical selection (E. Michael’s) 
and fixed-point theorems (Brouwer’s, Kakutani’s). Introduces measure spaces to model economies; 
generalizes demand curves and Walras equilibria. Fun exercises in topology, measure theory, convex 
and functional analyses. Extremely well written. YN 


Applications (Engineering), P. Mathematical Models for the Analysis and Optimization of Elastoplas-~ 
tic Structures. A.A. Cyras. Transl: L.W. Longdon. Ser. in Civil Engin. Halsted Pr, 1983, 121 pp, 
$39.95. [ISBN: 0~-470-20020-0] Discrete models for predicting the behavior of a structure (deforma- 
tion, etc.) at different stages of loading. LCL 
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Mechanics. James F. Bell. Springer-Verlag, 1984, xii + 813 pp, $36 (P). [ISBN: 0-387-13160-4] This 
book originally appeared in hardcover as Volume Vlia/1l of Encyclopedia of Physics. JK 


Applications (Information Theory), P. Transactions of the Ninth Prague Conference on Information 
Theory, Statistical Decision Functions, Random Processes. D Reidel, 1983, $50 each. Volume A, 332 


pp [ISBN: 90-277-1499-1]; Volume B, 310 pp. [ISBN: 90-277-1500-9] Invited and contributed papers 
from a June 1982 conference in Prague. LAS 


Applications (Modelling), L. Models of Reality: Shaping Thought and Action. Ed: Jacques Richardson. 
Lomond Books, 1984, xiv + 328 pp, $22.95. [ISBN: 0-912338-35-0] Essays on models as tools for 
problem-solving; cross-disciplinary. Many essays first appeared in the UNESCO journal Impact of 
Science on Society, Volume 31, No. 4. JIRG 


Applications (Modelling), P. Mathematical Modelling in Science and Technology. Ed: Kavier J.R. 
Avula, etal. Pergamon Pr, 1984, xvi + 1006 pp, $150. [ISBN: 0-08--030156~-8] Typescript proceedings 
of the Fourth International Conference on Mathematical Modelling held at E.T.H. in Zurich, August 
15-17, 1983. Seven invited lectures plus 150 papers cover a very wide range of contemporary model- 
ling areas. LAS 


Applications (Physics), S(18), P,. Renormalization. Edward B. Manoukian. Pure & Appl. Math., V. 
106, Academic Pr, 1983, xi + 204 pp, $37. [ISBN: 0-12-469450-0] First, proves the absolute conver~ 
gence of Feynman integrals; uses tempered distributions and invokes Hironaka’s results on resolution 
of singularities. Second, describes and proves a graph-theoretic algorithm to renormalize Feynman 
integrands, A mathematics book for physicists, not a physics book for mathematicians: ever says 
how this mathematics is relevant to physics (see e.g., pp. 1-2 in Hwa and Toeplitz, Homology and 
Feynman Integrals, Benjamin, 1966). YN 


Applications (Physics), P. Introduction to Quantum Statistical Mechanics. N.N. Bogolubov, N.N. 
Bogolubov, Jr. Transl: V.P. Gupta. World Scientific (US Dist: Heyden & Sons), 1982, viii + 299 pp, 
$19 (P); $34, [ISBN: 9971~-950-31-6; 9971-950-04-9] In two sections: the first, described by the 
book”’s title, studies Hamiltonian systems in classical and quantum mechanics, thermodynamics. The 
second section is devoted to a new presentation of the second quantization method. Translated edi- 
tion of a special lecture series delivered at the physics department of Moscow State University. PZ 


Applications (Physics), P. Compressible Fluid Flow and Systems of Conservation Laws in Several Space 
Variables. A. Majda. Appl. Math. Sci., V. 53. Springer-Verlag, 1984, viii + 159 pp, $16.80 (P). 
[ISBN: 0-387-96037-6] From the book cover: "...systematic rigorous discussion of quasilinear hyper- 
bolic systems in several space variables...emphasis on the special structure of the hyperbolic con- 
servation laws of fluid dynamics and the interplay between theoretical results and these concrete 
equations.” From material presented in topics courses and lectures at the University of California, 
Berkeley. JK 


Applications (Physics), T*(17-18; 1, 2), P. General Relativity. Robert M. Wald. Uof Chicago Pr, 
1984, xiii + 491 pp, $30 (P); $50. [ISBN: 0-226-87033-2; 0-226-87032-4] Textbook on general rela- 
tivity for graduate students from a modern, geometrical viewpoint with some recent advances = and 
developments. Flexible. Suitable for one-term introductory course or for a full-year course. JK 
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Oklahoma~Arkansas Section 


The meeting of the Oklahoma-Arkansas Section was held on March 30-31, 1984 at Arkansas Tech 
University. The attendance was approximately 125 people. 
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Invited Addresses: 


"Another Approach to Riemann-Stieltjes Integrals," by Kenneth A. Ross, University of Oregon. 
"Transitions," by Jeanne Agnew, Oklahoma State University. 


Short Presentations: 


"Approximation of Periodic Functions by Trigonometric Sums," by Yuly Makovoz, Oklahoma State 
University. N 

"Small Factors of 2° + 1 for N > 1200," by Dale Woods, Central State University. 

"Approximation of Abstract Analytic Functions on Intervals," by H.G. Burchard, Oklahoma State 
University. 

"Quasi Circular Ground Track of Geosynchronous Satellites," by Philip C. Almes, East Central State 
University. 

"On Semi-distributive -heredity Rings," by Carlos E. Caballero, Oklahoma State University. 

"Ordered Sets Which Support Inverse Semigroups: Three Examples," by Lide Li, University of Arkan- 
sas. 

"Isomorphisms of Incidence Rings," by Joel K. Haack, Oklahoma State University. 

"Multiplicative Magic Squares," by Michelle Y. Penner, Oklahoma State University. 

"Another Approach to Riemann-Stieltjes Integrals," by Kenneth A. Ross, University of Oregon. 

"First and Second Moments of the Largest Part of a Partition," by Grey Williams, Hendrix College. 

"Actuarial Problems," by Kathy Wyers, Arkansas Tech University. 

"Tangents to the Parabola," by David Patocka, Oklahoma State University. 

"Identities in Lie Rings and Lie Pseudo~rings," by Diane W. Corckett, Hendrix College. 

“Maximizing Mathematics (Learning and Development) Through Review," by Sam Hartzler, Oklahoma City 
Public Schools. 

"Using Calculus to Solve a Familiar Discrete Problem," by Jerry Johnson, Oklahoma State Univer- 
sity. 

"Teaching Experiential Applied Mathematics--TEAM," by James Choike and John Jobe, Oklahoma State 
University. 

"Teaching Remedial College Algebra Using the Saxon Textbook," by Beverly Reed, Siloam Springs Pub- 
lic Schools. . 

"Transitions," by Jeanne Le Caine Agnew, Oklahoma State University. 

"A Characterization of T, Compact Spaces," by Dragan Jankovic, University of Arkansas. 

"Necessary and Sufficient Conditions for a Linear Function To Be Continuous," by Ray Hamlett, East 
Central University. 

"Functions with Q@-~-closed and Pre-closed Graphs," by Paul E. Long, University of Arkansas. 

"Continuum Chainability and Monotone Decompositions in Certain Classes of Unicoherent Continua," 
by W. Dwayne Collins, Hendrix College. 

"Cyclic Graphs," by Mahesh M. Hiremath, University of Arkansas. 

"Degree One Maps of 3-manifolds," by Benny Evans, Oklahoma State University. 

"Kuhn-Tucker Conditions in Nonlinear Mathematical Optimization," by Karen Anderson, Hendrix Col- 
lege. 

"Maximization of Determinants in (1,0) Matrices," by David S. Rogers, Oklahoma Christian College. 

"Commutativity and Distributivity: Different Perspectives," by Ben Marshall, Hendrix College. 


Northern California Section 


The annual meeting of the Northern California Section was held on February 25, 1984 at San 
Francisco State University in San Francisco, California. There were 168 registered participants. 


Invited Addresses: 


"Indices of Power as a Measure of Centrality in Social Networks," by Guillermo Owen, University of 
Towa. 

"Combinatorial Problems with Surprising Solutions," by David Roselle, Virginia Polytechnic Insti- 
tute. 

"How Large an Elephant Will Fit in a Cube?" by Don Chakerian, University of California, Davis. 

"The Mathematics of Perfect Shuffles," by Persi Diaconis, Stanford University. 


Luncheon Speaker: 


"Successful Education and Successful Life," by Peter Hilton, SUNY at Binghamton. 


Ohio Section 


The annual fall meeting of the Ohio Section was held on November 2-3, 1984 at Muskingum Col- 
lege, New Concord, Ohio. Approximately 125 persons participated. 


Invited Addresses: 


"Renewing Undergraduate Mathematics,” by Lynn Steen, St. Olaf College. 

“Undecidable Mathematics," by Dennis Burke, Miami University. 

"Teaching Experimental Applied Mathematics: The TEAM Project," by Phil Schmidt, University of 
Akron. 
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"Characterizing Trees by a Finite Sequence of Integers," by Lee Saunders, Miami University, Hamil- 
ton. 
"Mathematics and the Law," by Mary Gray, American University. 


Short Presentations: 
"Computing a Metric on Dendograms," by Vic Norton, Bowling Green State University. 
"Abundant Numbers on Odd Perfect Numbers," by Rich Laatsch, Miami University. 
"Optimizing Boxes and Discs," by Tom Hern, Bowling Green State University. 


"A Perspective on ICME-5," by Richard Shumway, Ohio State University. 


Panel Discussion: 


"The VisUMAP Project," by Zaven Karian (Moderator), Denison University; Lynn Steen, St. Olaf Col- 
lege; Wayne Carlson, Cranston~Csuri Productions. 


Metropolitan New York Section 


The forty-third annual meeting of the Metropolitan New York Section was held at the College of 
Mt. Saint Vincent on May 6, 1984 with approximately 150 persons in attendance. 


Invited Addresses: 


"Increased Interaction in the Mathematics Classroom," by William F. Lucas, Cornell University. 
"Sphere Packing in Higher Dimensions with Applications," by Neil J.A. Sloane, Bell Telephone 
Laboratories. 


Short Presentations: 


"Euler’s 36 Officers Problem in Three Dimensions," by Joseph Arkin, New York Academy of Sciences; 
Paul Smith and E.G, Straus, University of California, Los Angeles. 

"Teaching Basic to Liberal Arts Students on Pocket Computers Via Selected Topics in Finite 
Mathematics," by Joseph Ercolano, Baruch College, CUNY. 

"A Simple Proof of Rodrigue’s Formula Using the Weingarten Equations of Surface Theory," by Martin 
Lewinter, College at Purchase, SUNY. 

"Clarification of Conservation of Liquid Quantity and Liquid Volume," by Egon Mermelstein, Academy 
of Aeronautics. 

"Nonhomeomorphic, Equipotent Topological Spaces Possessing a Common Invariant," by Jay Schiffman, 
Kean College. 

"Continuous Time Markov Chains Without Differential Equations," by Frederick Solomon, College at 
Purchase, SUNY. 

"Covers, Piles, and Binary Search," by Aaron Todd, St. John’s University. 

"Good Numbers and the Multiplicative Property," by Clara Wajngurt, Queensborough Community Col- 
lege, CUNY. 


Panel Discussion: 


"Computers in Undergraduate Mathematics," by Raymond Greenwell, Hofstra University; Sheldon P. 
Gordon, Suffolk Community College; John Miller, City College. 


New Jersey Section 


The fall meeting of the New Jersey section met at Rutgers University, New Brunswick, New Jersey 
on November 3, 1984. Approximately 60 people attended. 


Invited Lectures: 


"Classical Surface Theory in Minkowski 3~space," by Tilla Milnor, Rutgers University, New 
Brunswick. 

"Reaching Math Anxious College Students--Some Psychological Approaches," by Irene Deitch, College 
of Staten Island, CUNY. 

"What is Discrete Algorithmic Mathematics?" by Steven Maurer, Swarthmore College. 


Special Presentation: 


"A showing of videotapes and discussion of the materials from the M.A.A.°s TEAM project was 
presented by Ashby Foote, Rutgers University, New Brunswick. 
Seaway Section 


The fall meeting of the Seaway Section was held on November 2-3, 1984 at St. Bonaventure 
University, St. Bonaventure, New York. There was a registered attendance of 91. 
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Invited Addresses: 


"Gifted Students in Mathematics," by Robert Exner, Syracuse University. 
"Computing and the Changing Nature of College Mathematics," by Donald Kreider, Dartmouth College. 


Short Presentations: 


"Upper Bound Graphs: A Polynomial--Time Algorithm," by Douglas Cashing, St. Bonaventure Univer- 
sity. 

"Determining Hidden Surfaces of Convex Polyhedra," by Paul O”’Heron, Broome Community College. 

"Teaching Experimental Applied Mathematics~~-Video Tape Project," by Edwin Hoefer, RIT and TEAM 
Leader for the Seaway Section. 

"Solving Trigonometric Inequalities," by Boris Rakover, St. John Fisher College. 

"International Congress of Mathematics-82," by Nura Turner, SUNY at Albany. 

"Writing in the Math Classroom," by Marcia Birken, RIT. 

"Appreciating Efficient Sorting," by Morton Goldberg, Broome Community College. 

"Analytical Hierarchies and Decision Making," by Ronald Brzenk, Hartwick College. 

"Polynomials, Map Coloring, and Flattening Equations," by Frank Bernhart, RIT. 


Northeastern Section 


The spring meeting of the Northeastern Section was held jointly with the A.M.S. on June 29-July 


1, 1984 at Plymouth State College in Plymouth, New Hampshire. There were approximately 110 regis- 
trants. 


Invited Addresses: 


"Teaching and Research in “A Nation at Risk*," by Richard D. Anderson, Louisiana State University, 
Baton Rouge. 

"Mathematics in Management Courses,” by Alice T. Schafer, Simmons College, Wellesley College 
(Emeritus). . 

"Accreditation and Certification of MA and CS Programs," by John Dalphin, Norwich University. 

"The Fourth Dimension," by Stephanie Troyer, University of Hartford. 


Workshops: 


"Applications of Undergraduate Mathematics," by Solomon A. Garfunkel, Executive Director, COMAP. 

"UNIX," by Roger Kleinpeter and Michael Pearson, Plymouth State College. 

"MACSYMA and SMP," by Ken Lane, Colby College. 

"Karel the Robot: A Gentle Microworld of Structured Programming," by William J. Taffe, Plymouth 
State College. 


Panel Discussion: 


"The Crisis in Mathematics Teachers Training," by Fernand Prevost (Moderator), New Hampshire State 
Department of Education; Joan Mundy, University of New Hampshire; Robert Rosenbaum, Wesleyan 
University; Martin Badoian, Canton High School, Canton, Massachusetts. 


North Central Section 


The fall meeting of the North Central Section was held at Moorhead State University, Moorhead, 


Minnesota on October 26-27, 1984. There were 103 registrants. 


Invited Addresses: 


* 


"The University of Minnesota Talented Youth Project," by Harvey Keynes, University of Minnesota. 
“Some Unexpected Results in Elementary Mathematics," by Ivan Niven, University of Oregon. 


Short Presentations: 


* 
* 


+ 


+ $ + S 


“Composition for Two Peanos," by Paul Fjelstad, St. Olaf College. 

"Time in Special Relativity," by Abraham Ungar, North Dakota State University. 

"Mathematical Art by Computer," by Douglas Dunham, University of Minnesota, Duluth. 

"Proteomorphisms--On Generalizing Duality and Similarity," by Thomas Sibley, St. John’s Univer- 
sity. 

"Continued Roots," by Walter S. Sizer, Moorhead State University. 

"An Extended Hatzenbuhler-Mattson Technique," by Marlon C. Rayburn, University of Manitoba. 

"Contraction Mappings and Approximation of Roots," by Michael Tangredi, College of St. Benedict. 

"Some Properties of Pascal Triangle Matrices," by Dan Kalman, Augustana College. 


Special Sessions: 


Special sessions were held on "Policies for Secondary Mathematics Education," and "On Problem 


Solving." 
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9. M. Lerch, Dal8&i studie v oboru Malmsténovskych tad, Rozpravy Ceske Akad. 3, no. 28, 1894, 63 pp. 

10. C. J. Moreno, The Chowla-Selberg formula, J. Number Theory, 17 (1983) 226-245. 

11. T. Shintani, On special values of zeta-functions of totally real quadratic fields, Proc. Inter. Congr. Math., 
Helsinki, 1978, vol. 2, Acad. Sci. Fennica, Helsinki, 1980, pp. 591-597. 

12. E. C. Titchmarsh, The Theory of the Riemann Zeta-Function, Clarendon Press, Oxford, 1951. 

13. A. Weil, Elliptic Functions According to Eisenstein and Kronecker, Springer-Verlag, Berlin, 1976. 

14. E. T. Whittaker and G. N. Watson, A Course of Modern Analysis, 4th ed., University Press, Cambridge, 
1962. 


138. MISCELLANEA 


Just as surely. as our understanding of Nature is really valid only to the extent that it is 
mathematical, so also our understanding of higher domains must be based on mathematical 
models. 


—R. Steiner (1861-1925), quoted in Mathematical Reviews 82k : 51001. 


Each natural science contains genuine Science only to the extent that it contains 
Mathematics... . It may be that a formal philosophy of Nature in general, that is, a philosophy 
that deals only with general concepts, is possible without Mathematics, but a formal natural 
science dealing with definite objects (whether Physics or Psychology) is possible only with the use 
of Mathematics, and since each natural science contairis only as much genuine Science as it 
contains a priori knowledge, it follows that a natural science is a genuine Science only to the 
extent that Mathematics can be applied to it. 


—Immanuel Kant, Metaphysische Anfangsgriinde der Naturwissenschaft, 1876. 


Perhaps we should sometimes listen to the other side. Goethe, for example, did not like 
Mathematics. 


Mathematicians are amazing people. In virtue of their accomplishments, they have set 
themselves up as a universal guild and will acknowledge only what suits their circle, what their 
method of organization can produce. A prominent mathematician once said, when someone 
strongly recommended a topic in Physics, “But can’t it be reduced to calculation?” 

—Goethe, Maximen und Reflexionen, no. 1277 

It surely does not follow that the hunter who kills the game must also cook it. A cook might go 
hunting and shoot well; but he would be badly mistaken if he claimed that only a cook can be a 
good shot. It seems to me that this is the situation of mathematicians who claim that nobody can 
understand or discover physical phenomena without being a mathematician, since they should be 
pleased enough if the meat is brought to their kitchen for them to lard it with formulas and dress 
it as they like. 

—Goethe, Maximen und Reflexionen, no. 1280 


ANSWER TO PHOTO ON PAGE 93 


The photo is of Louis de Branges, who proved in 1984 the conjecture Bieberbach made in 1916, 
by establishing a strong inequality proposed by the Russian function-theorist I. M. Milin. 
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ON HALLEY’S ITERATION METHOD 


WALTER GANDER 
Seminar fur angewandte Mathematik ETH, CH-8092 Zurich, Switzerland 


1. Introduction. Solving a nonlinear equation is a problem that has occupied mathematicians 
for many centuries and many numerical analysts of the present generation first learned how to do 
it from P. Henrici’s textbook [7] which has now been replaced by [8]. 

Nearly 300 years ago in 1694 Edmund Halley published a paper [6] in Latin where he presents 
a new method to compute roots of a polynomial. Halley is well known for first computing the 
orbit of the Halley comet, which he observed in 1682 and which will soon visit us again in 1986. 

Halley generalized an iteration formula due to Lagney for computing the cubic root of a 
number and obtained an iteration to compute roots of a polynomial. Halley did not use calculus 
to derive his formula. If we try to generalize and translate his derivation into modern mathemati- 
cal notation we can argue as follows: 

Let f be a C* function and x, an approximation of a zero s of f. We replace the equation 
f(x) = 0 by T(h) = 0, where T is the Taylor polynomial of degree 2 of f 


f’ Pu) ) a2 

(1) T(h) = f(x,) + f'(%,)h + 
and h = x — x,. Now if x, is a good approximation of s, then neglecting h? we get from 
(2) T(h) =0 
the Newton correction 
(3) = —f(x,)/f(%)- 
Since we neglected h’, the denominator in (3) is wrong: equation (3) should be 

fC X4) 
(4) h= —f(x,)/ f(x) + Sh 


Therefore, replacing h in the denominator of (4) by the Newton correction (3), we obtain the 
Halley correction 


(5) h= —f(x;) 
Pm) LCC) 
2f"( xx) 


For the following discussion we rearrange Halley’s iteration formula (5) to 


f(x) 1 


(6a) Xp4y = XE f’ (x,) _ 4) | 
where 
(6b) (x) LI) 


f(x) 


In his paper [6], Halley calls the iteration formula (6a) the rational formula. He also considers the 


Euler correction 
—f'(x,) + Vf’( X,) — 2f( Xn) F(X) 


”) - f’'( xx) 
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that is obtained by solving the quadratic equation (2) and which he calls the irrational formula. If 
we rearrange (7) and compute the correction closest to zero, we get Euler’s iteration 
(8) X41 7 XK ~ Le). ———— 
f(x) 1+y1- 2t(x,) 
where ¢ is defined by (6b). 


In the following we shall show how Halley’s method and also many other third order iteration 
formulas may be derived algebraically using an elementary technique. For geometric interpreta- 
tions of Halley’s method we refer to [1], [2]. 


2. Algebraic interpretation. Halley’s method (6) belongs to the class of one point iteration 
methods without memory [12] 


(9) Xpa1 = F(x;,)- 
We consider the special case where the iteration function F has the form 
(10) F(x) = x - $x), 
f(x) 
For the following discussion we assume that s is a simple zero of f and that f and G have a 


sufficient number of continuous derivatives in a neighborhood of s. It is well known [11] that the 
iteration (9) is of second order if 


(11) F'(s) = 0, F”(s) #0. 


Let u(x) = f(x)/f’(x). Then we have u’(x) = 1 — t(x) where ¢ is defined by (6b). It follows 
that 


(12) u(s) =Oand u’(s) = 
Differentiating (10) we get 
(13) F’(x) =1— u(x) G(x) — u(x) G(x). 


And using (12) we get F’(s) = 1 — G(s). 
LEMMA 1. The iteration (9) with F defined by (10) is of second order if and only if G(s) = 1. 


We note that for the special case G(x) = 1 we have Newton’s method for the equation 
f(x) = 0. 
The iteration (9) is of third order if 


(14) F’(s) = F’(s) =0,F’’(s) #0. 
Differentiating (13) we get 
F’(x) = —u’’(x)G(x) — 2u’(x) G(x) — u(x) G”(x). 
Since 
ep) = xy = LD 4 Gren ed 
= =H) = FS + pn EET | 


it follows that 
(15) u"(s) = ze = -1’(s) 
and 


ny) £743) “ 
F(s) = 775 G(s) = 26/3) 


1985] NOTES 133 


LEMMA 2. The iteration (9) with F defined by (10) is of third order if and only if G(s) = 1 and 
G’(s) = (1/2) f(s) /f"(s). 
The assumptions of Lemma 2 are not helpful for choosing a function G since they need the 
knowledge of the zero s. However, if we choose 


(16) G(x) = H(t(x)) 
with some function H, then we have 
G(s) = H(0) 
and 
G’(s) = H’(0)t’(s). 
Therefore the assumptions of Lemma 2 are simply H(0)=1 and H’(0)=1/2. We get the 
following theorem. 


THEOREM. Let s be a simple zero of f and H any function with H(O)=1, H’(0)=1/2 and 
| ’’(0)| < co. The iteration x,,,, = F(x,,) with 


yee LH) 
F(x) (x F(x) CEO x)) 
where 
) FOL) 
Ze 


is of third order. 
Many well-known third order iterative methods are special cases of this theorem: 


(1) Halley’s method 
H(t) =(1—-4t) eltsrt ae foe 
(2) Euler’s formula 
H(t) =2(1 +y¥1—2t)” =] +or+Sr+ _ 
(3) Hansen-Patrick family [5] 


H(t) =(a+1D(atyjl =(a+1)t)~ =1 +5145 p qos. 


(4) Ostrowski’s square root iteration [10] 


H(t) =(1-1) S=1+srtorte 
(5) Quadratic inverse interpolation 
H(t) =1+ st 


Not all the third order iteration methods are special cases of this theorem. It is possible to 
describe all methods that do not explicitly depend on s by using the Schréder iteration functions 
[9, p. 531]. One can show [4] that all third order methods are given by the iteration function (10) 
with 


G(x) = H(t(x)) + f(x)*b(x), 


where 5 is an arbitrary function which is bounded for x > s. 
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A CONVERSE FOR THE CAYLEY-HAMILTON THEOREM 


CARMEN CHICONE, N.J. KALTON*, AND IRA J. PAPICK 
Department of Mathematics, University of Missouri-Columbia, Columbia, MO 65211 


The Cayley-Hamilton theorem asserts that every n X n matrix A satisfies its characteristic 
polynomial, det(A J — A). This note deals with the problem of characterizing those polynomials 
for which the Cayley-Hamilton theorem holds. Informally stated our result is that the polynomials 
which a square matrix satisfies are precisely the multiples (in a ring of polynomials) of the 
characteristic polynomial. 

Some notation is necessary to make a precise statement of our theorem. If X denotes the n X n 
matrix of indeterminants (x;;), then it is apparent that det(X) is a polynomial in n’-variables. 
Moreover, if F(x;,;) is any polynomial in n’-variables with coefficients in a commutative ring R 
with identity, then by F(X) we shall mean F(x;;). We now restate our problem: Characterize 
those polynomials F(X) having the property that every n X n matrix A with entries in R satisfies 
the polynomial F(AI — A). We shall call such polynomials F(X) Cayley-Hamilton polynomials, 
and we may now state a precise converse of the Cayley-Hamilton theorem. 


THEOREM 1. Let R be an infinite (commutative) integral domain, and let F(X) be a polynomial 
in n*-variables with coefficients in R. Then F(X) is a Cayley-Hamilton polynomial if and only if 
F(X) = det(X)G(X), where G(X) is a polynomial in n*-variables with coefficients in R. 

Before proceeding we make two observations. The “if’’ direction of the theorem is clear. For if 
A is an n Xn matrix with entries in R, and if F(X) = det(X)G(X), then F(AI — A) = 
det(AI — A)G(AI — A). Hence, by the Cayley-Hamilton theorem, A satisfies the polynomial 
F(AI — A). Secondly, the theorem is false without some assumptions on R. For example if 
R = Z,, the field with two elements, define F(X) = (x). + X5)X 1X2). Then given 


Qi, 42 
A= 
GQ, 422 


with a;, © Z,, we see that 
F(AI _ A) = (a1) + An, (A _ ay, )(A _ Any). 


Thus F(AI — A) = 0 unless A is either upper or lower triangular and not diagonal, and in this 
case : 


F(AI — A) = (A — a,,)(A — ay) = det(AT — A). 


*Supported in part by NSF grant MCS-8001852. 
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follows from the equation 
F(X, X) = O(X, X) D(X, X) + R(X, X), 
since by the Cayley-Hamilton theorem D(X, X) = 0 and, as shown above, F(X, X) = 0. Thus if 
R(X,A) =u,(X)N + +++ +ug(X), 


we have u,(X)X" + +++ +u9(X)I = 0. 

We now claim that the characteristic polynomial, D( X,A) € S[A] of X has n distinct roots 
(in some splitting field of the field of fractions of S). Once this is shown, the proof will be 
complete, for the minimal polynomial of the matrix X will be degree n and hence we can 
conclude that u,(X) = -:- =u)(X)=0, ie, RCX,A) = 0. 

To establish our claim we note that the discriminant A(X) of D(X,A) is a homogeneous 
polynomial in the coefficients of D(X,A), and A(X) = 0 if and only if D(X,A) has a repeated 
root [3, p. 288]. If A € M,(R), then A(A) is the discriminant of D(A, A); hence by selecting A to 
be diagonal with n distinct eigenvalues (R is infinite!), we see that A(X) cannot vanish 
identically. This completes the proof. 


Now to derive the theorem from Lemma 2. Let G(X) be a Cayley-Hamilton polynomial, and 
define F(X,A) = G(AI — X). Then F(A, A) = 0 for each A € M,(R), and thus it follows from 
Lemma 2 that D(X,A) is a factor of F(X,A), ie, F(X,A) = D(X,A)QCX, A). Hence, putting. 
\ = 0 and replacing X by —X we have 


G(X) = det( X)Q(X,0), 
where . 
Q( X,0) € R[ X]. 
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AN ELEMENTARY CHARACTERIZATION OF WEAK CONVERGENCE OF MEASURES 


JERRY JOHNSON 
Department of Mathematics, Oklahoma State University, Stillwater, OK 74078 


In 1977, Hégnas [2] solved a problem of P. Lévy which, roughly put, asks for conditions on a 
family { K,} of functions so that for each continuous function f on [0,1] we have continuity of 
s > {jfdK,. The solution follows immediately from his main theorem, whose proof is fairly long. 
This note presents a short, elementary proof of his theorem, using only a few basic results from a 
standard real variables text such as Royden [3]. 

Our notation is as follows: C denotes the continuous functions on [0,1] with supremum norm 
ll - Ilo: If f and K are functions on [0,1], {fdK will denote the Riemann-Stieltjes integral when it 
exists (see [1] for the definition and basic facts). A will denote Lebesgue measure on [0, 1]. 

The following five items used in the proof are stated in the Jeast possible generality, reflecting 
the assumptions in the theorem. 


(1) The Arzela-Ascoli Theorem: An equicontinuous bounded subset of C is relatively compact. 

(2) The Stone-Weierstrass Theorem: The polynomials are dense in C. 

(3) If w is a signed measure on [0,1], fe C and K(x) = p[0, x] for 0 < x <1, then K is of 
bounded variation and {fd = K(0)f(0) + {fdkK. 

(4) Integration by parts: If either {fdK or {Kdf exists, so does the other and [fdK + {[Kdf = 
Kf |p (see [1, p. 195]). 
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(5) If f is absolutely continuous and K is of bounded variation, then {Kdf = {Kf’d2. 
(Suppose ¢ is bounded and measurable. Let f(x) = {dX and replace f’ by . This is 
the context in which (5) is used and it is an exercise to prove this special case.) 


We are now ready to present the main theorem in [2]. 


THEOREM [2, p. 182]. Let {u,} be a bounded net (or sequence) of signed Borel measures on 
[0,1]; i.e., there is a number M > 0 such that |ffdu,|< M\|f\|.,. for fe C. Define K,(x) = 
bh [0, x] for 0 < x <1. Then the following are equivalent: 


(A) lim, ffdu, = 0 for each f € C (i.e., {u,} converges weak* to zero). 
(B) lim, ({|KqldA + |K,(1)) = 0. 


Proof: Assume (B). Let f be continuously differentiable on [0,1]. Then 


| [fuel =| [faKa + (0) K4(0) 


=|{K| — [Ku af + f(0) K,(0)| 


< 0) Ke()+| [Kea 


di 


< [f(1) Ka(1)|+ [| Kal’ 


<|f(1) Ka(1)| + Uf leo f Kal aa 


which converges to zero. Hence lim, /fdu, = 0 for f in a dense subset of C. Since {p,} is 
bounded, it is an easy exercise to show that lim, {fdu, = 0 for all fe C. 

Next assume (A). First K,(1) = {1dp,, which converges to zero. Now, for each a, choose a 
function ¢, so that , is measurable, |¢,(¢)| = 1 for all ¢ and $,K, =|K,|. Define f,(x) = 
foo, dA for 0 < x < 1. It is easy to check that { f,} is an equicontinuous bounded set in C and 
hence is relatively compact in C. Now, it is another easy exercise to show that since {, } is 
bounded and lim, {fdp, = 0 for f € C, then lim, {fdu, = 0 uniformly for f in a compact set. 
From this, it follows that lim, {f, du, = 0. Finally, 


[fadta = [fed + fa(0) K,(0) 
= faKalo — [Ka du + fa(0) Ka(0) 
= K,(1) fa(1) — [Kafe ar 
= K,(1) fa(1) — [Kaba dd 


= K,(1)f(1) ~ [| Kal aa. 


Therefore, {|K,|d\ = K,(1)f,() — ff, du, which converges to zero since { f,(1)} is bounded. 
This completes the proof. 
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INTERLACED SECOND CATEGORY SETS 


HARVEY DIAMOND AND GREGORY GELLES 
Department of Mathematics, West Virginia University, Morgantown, WV 26506 


In this note we construct a pair of disjoint subsets of the real line, each of which intersects 
every interval in a set of second category. Our intuitive motivation is the creation of disjoint 
subsets each having a “large” representation in every interval of the real line, where “large” is 
interpreted in a topological way as “second category.” In a measure-theoretic sense “large” may 
be interpreted as nonnull (i.e., positive outer measure) or as positive measure; in either case, 
well-known examples exist of disjoint sets which are measure-theoretically “large” on every 
interval. A nonmeasurable example may be found in [3, p. 70], and a recent measurable example is 
given in [9]. Our intention here is a simple construction of analogous examples for second category 
largeness. We will provide both measurable and nonmeasurable examples. 

The existence of analogies between measure and category have long been the subject of 
investigation. For example, a formal link between second category and positive outer measure can 
be found in the Sierpinski-Erdés Duality Theorem [6]: 


Let P be any proposition involving solely the notions of measure zero, first category, and 
notions of pure set theory. Let P* be the proposition obtained from P by interchanging the 
terms “nullset”’ and “set of first category”? wherever they appear. Then each of the propositions 
P and P* implies the other, assuming the continuum hypothesis. 


A further connection that our second category sets have.with an established area of mathematics 
concerns the Property of Baire. Briefly, the collection of sets having the Property of Baire is the 
smallest o-algebra containing the open sets and the sets of first category. In [4, p. 88] it is proved 
that a set of real numbers does not have the Property of Baire if and only if there exists an interval 
such that the set and its complement both have second category intersection with every 
subinterval of that interval. Thus, the disjoint sets that concern us may be equivalently char- 
acterized as sets whose intersection with any interval does not have the Property of Baire. A fuller 
discussion of the Property of Baire may be found in [4] and [6]. 

We begin our construction by noting that there already exist examples of disjoint nonmeasura- 
ble sets which are second category on every interval. (This property does not appear to have been 
previously observed.) For instance, the classic Vitali construction of a nonmeasurable set as in [8, 
p. 52] provides such an example. This construction produces a set E such that {E+ r:r © Q} is 
a countable partition of R, where Q is the set of rational numbers. Since R is of second category, 
E ++ r must be of second category for all r. Now suppose that J is an interval of length 2~*, 
k © N, where N is the set of positive integers. Then one of the sets (E + n2~*) OI, where n 
ranges over the set of integers Z, must be of second category. Consequently, the set A = U{E + 
n2-*:n © Z,k © N} meets every interval in a set of second category, as does the (disjoint) 
translate A + 1/3. 

The construction of two disjoint measurable sets which meet every interval in a set of second 
category can now be effected by intersecting the disjoint sets A and A + 1/3 above with any 
measure zero set having a first category complement. (Such measure zero sets are well known. An 
example is constructed in [2, p. 99].) Indeed, if V has first category complement, then A M V 
differs from A by a set of first category and thus retains the desired property. 

Finally we present a different, more explicit construction of a measurable example, which is 
interesting in that it reveals some of the unusual properties of the set of nonnormal numbers and 
parallels the Vitali construction of a nonmeasurable set. (Recall that a number is normal to base r 
(r an integer larger than 1) if in its base r expansion all blocks of digits with the same length 
occur with equal frequency. Thus for instance, each digit occurs with frequence 1/r. A more 
rigorous definition may be found in [5, p. 95]. A number that is not normal to base r is said to be 
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nonnormal to base r.) The construction proceeds by obtaining a countable partition by transla- 
tions of a second category set of measure zero. In contrast to the example in [2, p. 99], our 
purposes require a measure zero, second category set whose members can be characterized 
specifically, for instance, by their base two expansions. The set of numbers nonnormal to base two 
is such a set. 

A proof that the set of nonnormal numbers to any base r has measure zero can be found in [5, 
p. 103]. A different proof using probabilistic arguments is outlined in [1, pp. 105-107]. The proof 
uses the law of large numbers and the fact that Lebesgue measure can be obtained as the 
probability distribution of real numbers generated at random by choosing each digit indepen- 
dently with each of the r possibilities equally likely. 

The fact that the numbers nonnormal to base r form a set of second category (in fact have first 
category complement) does not seem to be widely known. Proofs can be found in [6] and [7], but 
we will furnish a short argument for the case of base two. 

In the base two expansion of a number x, let L(k, x) denote the number of zeroes that appear 
in the first k places after the decimal point. Now define for each m € N the union of intervals 


E, = {x:x €(n27-",n2-" + 27°") ne Z}. 


Note that if x © E,,, then the base two expansion of x has zeroes in places m + 1 through 3m so 
that L(3m,x) > 2m. This means in particular that the set imE,, = {x:x © E,, for infinitely 
many m} consists entirely of numbers nonnormal to base two. Set theoretically, 


(oe) (oe) 
limE,= f) U E,. 
M=1 m=M 

The set U%_,,E,, is open and contains the dense set of points {n2"" + 2>°""'},n € Z,m>M 
so its complement is nowhere dense. Therefore the complement of lim E,, is the countable union 
of nowhere dense sets and thus of first category. Therefore the set of numbers nonnormal to base 
two has a first category complement and so is a set of second category. 

The next step is the partition of the set of numbers nonnormal to base two, which we denote by 
B. Let D denote the set of dyadic rationals, where each element of D has a base two expansion 
which terminates in an infinite string of zeroes. We observe that if x € B and dé D, then 
x + d © B. Divide B into equivalence classes according to x ~ y if and only if x, y © B and 
x —y © D. Using the Axion of Choice, form a set F consisting of one element from each 
equivalence class. Clearly, B= U,ep(F + @). Since B 1s of second category, F + d must be of 
second category for all d € D. Let D, denote the set of dyadics having an even number of ones to 
the right of the decimal point in its base two expansion. Define D, similarly for odd numbers of 
ones (D) = D\ D,). Note that D, and Dy are both dense in R. Define the two sets 

B,= U (F+4,) and B= U (F+4)). 
d,€D, dye Do 

We show that B, meets every interval in a set of second category. Indeed, if J is any interval, then 
the density of D, implies that 


F= U {((F+4,) 91] -4,}. 


Since F is of second category, one of the sets (F + d,) OJ must be of second category and thus 
B, OI is of second category. A similar argument applies to B,. 
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LION-HUNTING WITH LOGIC* 


HOuSTON EULER 
c/o Leon Harkleroad, 4417 Woodgate Lane, Louisville, KY 40220 


Over the years there has developed a body of literature on the use of mathematical techniques 
to catch lions ({1]-[6]). In this literature there has been a comparative shortage of proofs based on 
mathematical logic. If those of us who commit logic believe in the vitality of our field, we cannot 
afford to allow such a shortage to continue. The following proofs, then, are offered as a first step 
towards rectifying the situation. 


1. Nonstandard Analysis. In a nonstandard universe (namely, the land of Oz [7]), lions are 
cowardly and may be caught easily. By the transfer principle, this likewise holds in our (standard) 
universe. 


2. Set Theory. If the set of lions is bounded, you can simply build a cage around the boundary. 
So assume that the set of lions is unbounded. It will then have an element in common with a 
stationary set. But a stationary lion is trivial to capture. 


3. Set Theory. Assume V = L. Since the lion is in the universe, it is constructible. So just carry 
out its construction within a cage in the first place. 


4. Set Theory. Assume AC. Perform a Tarski-Banach decomposition on the lion to halve its 
size. Repeat until the lion is small enough to be captured easily. 


5. Recursion Theory. Assume you can capture a lion. Having done so, you can easily bring it to 
a standstill, and you would thus have a solution to the halting problem. Since the halting problem 
is unsolvable, you cannot capture a lion after all. 


In conjunction with the previous results, we have 
COROLLARY. Mathematics is inconsistent. 


This corollary, besides being of intrinsic interest, also provides solutions to the Riemann 
Hypothesis, Fermat’s Last Theorem, and other questions (besides giving a proof of the Four-Color 
Theorem that does not require a computer). 
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AN INGRAINED ERROR CONCERNING RESULTANTS 


ERNST SNAPPER 
Department of Mathematics, Dartmouth College, Hanover, NH 03755 


All books I know in modern algebra which discuss the resultant of two polynomials, including 
the famous books by Jacobson [1], Lang [2] and van der Waerden [3], state the following wrong 
theorem. 


Let F be a field and f a polynomial of positive degree n with coefficients in F and leading 
coefficient a. Let f’ be the derivative of f, D the discriminant of f and R(/f, f’) the resultant of f 
and f’. The theorem in question states that 


R(f,f’) = aD. 


This theorem is already false for quadratic, monic polynomials over fields of characteristic 0. 
For example, if f is x* + x + 1, the quadratic formula gives that D = —3, but R(/, f’) = 3. 

The error in the proof of the theorem comes about as follows. Let the factorization of 
f(= f(x)) over the algebraic closure of F be 


f(x) =a(x—n)---(x- 4). 


Then one proves (the details appear in [3]), correctly, that 

ROA, f) =a" "TT, - 4). 
i+ k 

The error is caused by claiming that 


IG, —1%)= LH —_ r) 


l#k 


while the correct claim is of course that 


Il» —1)= (-yP"" TT (7, - r%) 


i<k 
Indeed, the correct theorem is 


‘THEOREM. 


R(F, fF) = (-1) "aD. 
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Experience has shown that the power of mathematical models in the study of the real world has 
increased along with the complexity of the models and the level of abstraction of the mathematics. 


—N. Teodorescu (1968), quoted by K. Manteuffel in Wiss. Z. Tech. 
Hochsch. Magdeburg 23 (1979), 523-525. 
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TEACHING PROOFS: A LESSON FROM SOFTWARE ENGINEERING 


MEL FRISKE 
Department of Mathematics and Computer Science, Wisconsin Lutheran College, Milwaukee, WI 53226 


Introduction. How can we provide a positive early experience with proofs for the average 
mathematical sciences student? The companion problem in computer science is how can the 
average computer science student design and write a computer program that works? 

During the infancy of computing programmer productivity was unacceptably low, and the cost 
of modifying and maintaining poorly written software soared. In response the computing 
community developed software methodology [3], [5], [6] and brought it into the curriculum [1], [2], 
[14] with courses that emphasized technique as well as content (e.g., data structures, systems 
analysis and design, and software engineering). The results have been quite satisfactory, for 
students have responded well to the methodology. 

There is a close analogy between the thought processes used in computer programming and 
those required for writing proofs. The methodology and principles which have been so effective in 
teaching programming can also be applied to teaching proof skills. Teaching proofs is one activity 
where mathematicians have had an unacceptable rate of success. 

Traditionally a student’s first intensive experience with proofs has been relegated to upper level 
courses in analysis and algebra. In such courses the primary focus is on the mathematical content. 
Consequently, proof related skills are often left for the student to absorb through some undefined 
process. The problem a mathematics student faces in proving a theorem is not much different 
from that which a computing student faces with a complex programming task. However, this 
intimidating barrier can be overcome by a systematic approach to problem solving. In computing 
there are three basic problem solving steps which we now consider. 


1. Problem Specification. The first step is the hardest. How often have we not heard a 
mathematics student complain: “I just don’t know where to start’? Computing students are 
taught that the first step in problem solving is problem specification: “Just what is the real 
problem?” One device used here is a data flow diagram [12] which diagrammatically represents the 
sources, flows, and destinations of all required data for the programming task. In mathematics, 
students usually learn the counterpart, namely, drawing a picture of the problem. Topologists are 
particularly adept at this. 

Another computing aid is the IPO chart [5] which the problem solver uses to identify the 
desired output (O), the available (I), and the processing (P) steps required to transform the input 
into the output. Here there is a strong and valuable analogy to proof writing. To prove a theorem 
a mathematician first asks, “What do I have to show?” Then he asks, ““What hypotheses and facts 
do I have to work with?” And finally, “How do I get from the given assumptions to the desired 
conclusion?” Notice the output, input, processing sequence once again. In teaching proof skills we 
need to identify and emphasize this three-step thought process. 

For students the first of these questions is often difficult. How does one decide what has to be 
shown? In [11] this question is broken into three parts: the abstraction question, answering the 
abstraction question, and applying the answer to the specific problem. For example, suppose we 
are trying to prove that if X is a probability space and E is an event, then P(E) < P(X). The 
abstraction question would be, “How can we show that the probability of an event is less than or 
equal to the probability of another event?” An answer to the abstraction question is, “Show that 
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the first event is a subset of the second event.” Applying this to the problem at hand, we need to 
show that E is a subset of X. 


2. Logical Design. After problem specification a computing student knows that the next step is 
designing the program logic. This is normally done in abbreviated form without regard to details 
of language syntax. The result of this analysis is called pseudocode: a logically correct solution of 
the problem which, however, is in a form unacceptable for computer consumption. This logical 
design corresponds to a mathematician’s initial sketch of a proof. The sketch is also abbreviated, 
contains the main steps, probably lacks some fine details, and requires conversion into acceptable 
English. We all know that this design step is the big challenge, this is where the blood is shed. 
Even here computing provides some methodological principles to ease the task. 

The structured design principle is one developed by two mathematicians. In the early days of 
high level programming languages program style varied widely. Clever programmers would devise 
cute tricks, abbreviate freely, and construct complicated flows of control just to shave a second off 
of processing time [13]. The result was software which few people besides the author would ever 
want to read, software which often contained hard to find logical errors. What was needed was a 
small set of simple logical building blocks (control structures) from which any program could be 
built. In a remarkable theorem proved in 1964, Béhm and Jacopini [4] provided such a set. It 
contained only three structures: a sequence, a decision, and a loop. Although theoretically any 
computer program can be constructed from this small set, most programmers extend the set for 
practical purposes to include approximately seven control structures. A program designed 
exclusively from this set is then properly called a structured program. 

Even freshmen can readily grasp this structured approach to the design of program logic. The 
same can be done with mathematical proofs. Certain basic logical techniques are used over and 
over in constructing proofs. In his book [11, pp. 97-98] Solow has identified what might be called 
a set of control structures for mathematical proofs. They include, for example, the “forward-back- 
ward” method, proving the contrapositive, proof by contradiction, construction proofs, induction, 
etc. Students can master the first few of these as sophomores, and as they mature, master others. It 
is not necessary to have a name for every technique, but it is important that students recognize a 
technique when they read it and be able to employ it themselves. 

Two other design principles closely related to structured design are the top-down and modular 
approaches [5]. “Top-down” and “modular” refer to the manner in which a complex problem is 
broken into more manageable subproblems (modules). “Structured” refers to the set of logical 
building blocks from which the problem solution is constructed. 

For example suppose that in graph theory we are asked to prove that graph isomorphism is an 
equivalence relation. After analyzing the problem our problem specification reveals that we need 
to show three things, namely, that graph isomorphism is reflexive, symmetric, and transitive. In a 
very natural way the big problem has been decomposed into three smaller subproblems (modules). 
For more examples and detail the reader should see the article by Leron [8]. 


3. Writing the Program. Upon completing the logical design a computing student converts it 
module by module into an appropriate programming language [7], paying special attention to the 
reserved words and syntax of the language. Converting a proof sketch into a paragraph proof is a 
similar task in which care must be taken with language and style. 

If we truly wish to be more effective in teaching proof skills, we need to do a better job of 
teaching. There is no excuse: Polya’s classic work [9] has been with us for years, and his “How to 
Solve it” list beautifully elaborates the process of problem specification and the design principles 
mentioned earlier. To teach these ideas in computing we find it helpful to prepare a detailed 
syllabus indicating not only the material content but also a careful ordering of the methods, 
techniques, and principles to be taught and the types of programs the student will write. In 
mathematics we need to do the same. This includes specifying in advance the types of proofs 
students will be required to write and providing consistent, constructive criticism when grading 
them. 
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We also need new introductory texts providing systematic coverage of proof skills (for example 
[10]). Ideally this material can be interwoven with the mathematical content. Such texts might 
contain exercises requiring problem specification, a logical design, or analysis of a proof by 
breaking it into modules. Debugging exercises (What’s wrong with this?) are also very effective. 


SUMMARY. Software engineering offers a methodology and some principles for proof writing, 
namely, 


A. Problem specification, 
B. Logical design (structured, top-down, modular), 
C. Writing the proof (attention to organization and special language). 


A first significant exposure to these techniques should be provided in a lower level course which 
has some flexibility in mathematical content, so that sufficient time can be allocated to technique 
and not just content. 
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USING THE MULTIVARIABLE CHAIN RULE 


FRED HALPERN 
10602 Stone Canyon Rd., #260, Dallas, TX 75230 


Finding the derivative of an expression with many occurrences of x is a major application of 
the multivariable chain rule. The usual computation, using extraneous variables, introduces 
various side calculations and bookkeeping chores. This paper restates the multivariable chain rule 
to provide a simple straightforward method of computation. Informally, the computation states 
that 


The derivative of an expression with many occurrences of x is the sum of the derivatives of the 
expression with respect to each separate occurrence of x. 


Thomas and Finney [1] (pp. 600-605) contains a standard statement of the multivariable chain 
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rule and its application to computing derivatives. 


We consider functions f(x,,...,x,) and are interested in Df(x,...,x) where D denotes 
differentiation with respect to x. Since dx,/dx = 1(i=1,...,n), 
_y Ff 
Df(x,...,x) = Ix. 


When computing each df/dx, the other variables are kept constant. This is signified by replacing 
the other variables by c, x being resubstituted for c after the differentiation. 

With this agreement about the relation between x and c, we obtain the following application 
of the multivariable chain rule to functions of the form f(x,..., x). 


THEOREM. If each of 
Df(x,c,...,¢), Df(c,x,...,¢),.--, Df(c,c,...,x) 
exists and is continuous in an interval containing c, then 


Df(x,x,...,x) = Df(x,c,...,c) + Df(c,x,...,c) + +++ + Df(c,c,..., x). 


The usual product and quotient rules for derivatives easily follow from the Theorem. In 
addition, an “exponential rule” for Df(x)&° can be obtained. Here are some applications of the 
Theorem. 


EXAMPLES: 
(1) Dx e*/sinx = Dxe‘/sinc + Dce*/sinc + Dce‘/sin x 
= e°/sinc + ce*/since + ce‘( —1/sin’ x)cos x 
= e* /sinx + xe*/sin x — xe* cos x/sin’ x. 
(2) Dx** = Dx® + De® + De“ 
= ¢x(—) 4+ (Inc) c* cx*~! +(Inc) c* (Incc*) 
= x*xCOD + xXx In x + x*x* In? x. 
(3) Df g(x. t) dt = Df gc. t) dt + Df g(x, t) dt 
a a a 


= Be f(x) F(x) + [OSE (x0) dt 


= 8x f(x)) f(x) +f SE (a0) at 


x 


(4) D [f(4,0) du dv = Df av f'f(u,») du + Df dv f f(u,0) du 


0 


= [ f(0) dv + [ fnx) du 
= [/f(») dv + [fax du. 


The Theorem also applies to expressions involving determinants, inner products, and cross 
products. 
Reference 


1. G. B. Thomas and R. L. Finney, Calculus and Analytic Geometry, 5th ed., Addison-Wesley, Reading, MA, 
1980. 
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ON REGULAR MARKOV CHAINS 


NICHOLAS J. ROSE 
Department of Mathematics, North Carolina State University, Raleigh, NC 27650-8205 


The following basic theorem concerning regular Markov Chains is often presented in courses in 
finite mathematics: 


THEOREM. Let P be the transition matrix for a regular Markov chain, then (i) P" > W as 
n — 00, (ii) each row of W is the same probability vector w which has positive components, (iii) if p 
is a probability vector, p* "> w, (iv) w = wP. 

Since, generally speaking, students in such courses know little mathematics beyond high school 
algebra, the proof of the theorem is usually omitted. In this note we present an elementary proof 
in the case when P is a2 X 2 matrix. 

Recall that a probability vector is a row vector with nonnegative components that sum to 1, a 
transition matrix is a square matrix whose rows are probability vectors, and a transition matrix for 
a regular Markov chain has the property that some power of the matrix has all positive entries. 

A regular 2 X 2 transition matrix can be written in the form 


_|l-a a 
p=| b ant 


where 0<a<1,0<b<1,a+b< 2. Before computing powers of P, we write P=I+ S, 


where 
—a a 
s=| b ne 


Letting r = —(a + b), we find S? = rS, and S" = r""'S for n > 2. Computing the powers of P 
we find 


P?=(14+S) =14+28+ SV =14+(14+(14+7))S, 
P= 7+((1+(1+r)+(14+7))S, 


_ 1-(1+,r)" 
Pe =T+(14+(14+r)t+---+(1 +r)" ')S=I+4 
(1+(1 +7) (i+r)"") 1—-(1+7r) 
Since —1 < 1+ 7 < 1, we find 
b a 
h _ 1 _| a+b at+b 
pe? Watt 7s = b 2 


a+b a+b 


This proves parts (i) an (ii) of the theorem; the other parts follow easily. 
Finally we note that another way of computing P” is to use the binomial theorem: 


P"=(I+5S)"= y (7) si =7+ y (i )rots 


i=0 


¥(7)r}s 


i=] 


=J+— 


=1+=((1 +r)"-1)S. 
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Send all proposed problems, typed and in duplicate if possible, to Professor D. H. Mugler, Department of 
Mathematics, University of Santa Clara, Santa Clara, CA 95053. Please include solutions, relevant references, 
ete. 

An asterisk (*) indicates that neither the proposer nor the editors supplied a solution. 

Solutions should be sent to the address given at the head of each problem set. 

A publishable solution must, above all, be correct. Given correctness, elegance and conciseness are preferred. 
The answer to the problem should appear right at the beginning. If your method yields a more general result, so 
much the better. If you discover that a MONTHLY problem has already been solved in the literature, you should of 
course tell the editors; include a copy of the solution if you can. 


ELEMENTARY PROBLEMS 


Solutions of these Elementary Problems should be mailed in duplicate to Professor D. H. Mugler, Department 
of Mathematics, University of Santa Clara, Santa Clara, CA 95053, by June 30, 1985. Please place the solver’s 
name and mailing address on each (double-spaced) sheet. Include a self-addressed card or label (for acknowledge- 
ment). 


E 3073. Proposed by Calin P. Popescu, Bucharest, Rumania. 
In the hexagon A,A,A,A,A;A, the triangles A, A,A, and A,A,A, are equilateral. Is it true 


that A,A,,3 (k = 1,2,3) are congruent if their sum equals the perimeter of the hexagon? Also, 
consider the converse. 


E 3074. Proposed by M. J. Pelling, London, England. 
Prove the inequality 


Y(t py+ ey (p-r)(q-r) = 210+ pe?) (yp - 49), pr eR, 


where the sums and products are over cyclic permutations of p,g,r, and determine all the cases 
where equality holds. 


E 3075. Proposed by Duane M. Broline, University of Evansville. 


Let B, be the kth Bell number, the number of partitions of k objects. For k < n, show that 
y ee . 
a1, az,...,a,20 1% (a,!)2 (ay!) a n°(a,,!) 


a, +2a,+ +--+ +na,=n 
k!\ 


wee B 
hobae,30 (11) "*(B,!)(21) 2B!) «(KN *(B, 8) 


E 3076. Proposed by F. S. Cater, Portland State University. 


Let g be a continuous function from R to R, let f be a positive-valued continuously 
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differentiable function from R? to R, and let F(x) = {& f(x, y) dy. We know that if ¢g is 
constant, we can differentiate under the integral sign to obtain the derivative of F: 


, d a(x) g(x) ,, 
(*) F(x) == [PP (x, ») ay = fDi f(x,y) dy (all. x © R)). 
0 0 
Is the converse true in general? That is, if equation (*) holds for all x € R, must g be constant? 


E 3077. Proposed by Nicholas Passell, University of Wisconsin— Eau Claire. 


Let N be the natural numbers with the topology T consisting of @, N and the complements of 
finite sets. Characterize the continuous functions from R, the reals with the usual topology, to NV 
with the topology T. 


ADVANCED PROBLEMS 
Solutions of these Advanced Problems should be mailed in duplicate to Professor D, H. Mugler, Department of 


Mathematics, University of Santa Clara, Santa Clara, CA 95053, by June 30, 1985. The solver’s full post-office 
address should be on each sheet. 


6488. Proposed by Robert M. Young, Oberlin College. 


Show that if A,,A,,A3,... are the positive solutions of the equation tan x = x, then 
[o.@) 
1 
2_ 4+ 
Ye == 


6489. Proposed by Eliot Jacobson, Ohio University. 
Suppose that E and F are fields such that the additive groups (FE, +),(F, +) are isomorphic, 
and the multiplicative groups (£*, -),(F*, -) are isomorphic. Are EF and F isomorphic as fields? 
6490. Proposed by J. Sutherland Frame, Michigan State University. 
Show that 
y (ke +k) > =10- 27, 
k=1 
and more generally that 
00 [n/2] ; 
("Peta "=" 1) 4 y (*" 1 2] 
k=1 n—1 —1 


j=l 


| Br,(—4n?)//(2))! 
where B,, B,, B,... are the Bernoulli numbers 1/6, —1/30, 1/42, —1/30,5/66,.... 


SOLUTIONS OF ADVANCED PROBLEMS 
An Extension of Hadamard’s Inequality 


6430 [1983, 402]. Proposed by J. Borwein, Dalhousie University, Halifax, Canada. 


Recall that the p-trace average of a positive definite symmetric n X n matrix C (we write 
C > 0) is given by 
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definition by continuity for p = 0, oo, and — oo. Thus 
. l/fn, . 
fn(C): = (det), jug(C): = Any and jg (C): = Ay. 
Let A(C) be the diagonal matrix whose diagonal coincides with the diagonal of C. Show that 


(i) j,(C) < j,(A(C)) for —o0 <p <l, 
and 
(ii) j(C)>7(A(C))  forl<p<o. 


Note that for p = 0 this is the well-known Hadamard inequality. 


Solution by the proposer. Let { D,|k = 1,2,...,2”} be the set of diagonal matrices with entries 
of +1. Then 
gn 
A(C) =2°" »© D,CD,. 
k=1 


(See A. W. Marshall and I. Olkin, A convexity proof of Hadamard’s inequality, this MONTHLY, 89 
(1982) 687-688.) Moreover, C and D,CD,, share the same spectrum. Since j, is concave (convex) 
for p < 1 (p > 1) we have 


if M(C)) > (<)2" Y jp(DiCD,) 


= 2"¥ jC) =H(C), 


and this establishes (i) and (ii). 


Also solved by Michael Golomb; Olaf Krafft (West Germany); Marvin Marcus & John Bruno; Pei Yuan Wu 
(Republic of China); Richard P. Savage, Jr.; Alain Tissier (France); and Henry Wolkowicz (Canada). 


Irreducibility of the Poisson Distribution 


6431 [1983, 402]. Proposed by Gabor J. Szekely and Andras Zempleni, Budapest, Hungary. 


Show that if X and Y are independent nonnegative integer-valued random variables and XY 
has a Poisson distribution, then either X or Y takes at most two values, zero and one, with 
probability one. (In this sense the Poisson distribution is irreducible.) 


Solution by L. E. Mattics, University of South Alabama, Mobile. Let m and n be positive 
integers with n > 1. To be more general we replace the hypothesis that XY has a Poisson 
distribution by the hypotheses 

P(XY = m)> 0 for all m, 
and 
lim P(XY = nm)/P(XY = m) = 0. 


Since P(XY = 1) = P(X = 1) P(Y = 1), both P(X = 1) and P(Y = 1) are nonzero. Let g be 
a prime, then 
P( XY = q) = P(X =1)P(Y=q) + P(Y = 1) P(X = @). 
Hence we can assume that (say) 
P(X = q) > P(XY = q)/2P(Y =1) 


for infinitely many primes g. Consequently, 
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P( XY = ng) > P(Y =n) P(X =@q) 
> P(Y =n) P( XY = q)/2P(Y =1) 
for infinitely many primes qg, and under the (new) hypotheses this implies that P(Y = n) = 0. 
Also solved by Lajos Takacs and the proposers. 


A C® Function Supported on a Given Open Set 


6438 (1983, 486]. Proposed by J. L. Brenner, Palo Alto, California. 


Construct a differentiable real function that is positive at every rational point, but zero at an 
uncountable number of points, or prove that no such function exists. 


Solution by Ivan Netuka and Ji¥i Vesely, Prague, Czechoslovakia. The following construction 
due to L. E. May (Canad. Math. Bull., 12 (1969) 25-30) shows more generally that given any 
open set G, there is a C” function that is positive on G and zero elsewhere. The set G is the 
union of disjoint intervals (a,,5,), n = 1,2,.... Define 


f,(x) = exp(—(x — a,) °(b, — x)~’) 
for x € (a,,b,), and f,(x) = 0 elsewhere. Let 
M,, = sup{ |f{(x)|:x ER,0<k <n}. 
Then f = £°_,(f,/n’M,,) is positive on G and is zero elsewhere, and it is easily seen that f € C®. 


Also solved by 28 others. 


REVIEWS 


EDITED BY ALLAN L. EDMONDS AND JOHN H. EWING 
COLLABORATING EDITOR FOR FILMS: SEYMOUR SCHUSTER 


Books IV to VII of Diophantus’ Arithmetica: In the Arabic Translation Attributed to Qusta Ibn 
Liqa. By Jacques Sesiano. Sources in the History of Mathematics and Physical Sciences 
Number 3. Springer-Verlag, New York, 1982. xii + 502 pp. 


WILBUR KNORR 
Department of Classics / Philosophy Program in the History of Science, Stanford University, Stanford, 
CA 94305 


If Diophantus had not existed, no historian of ancient Greek mathematics would have invented 
him. Unlike the formal geometric treatises of Euclid, Archimedes and Apollonius, taken as 
representative of the characteristically Greek style of mathematics, Diophantus’ Arithmetica falls 
within the category of arithmetic theory. Yet as antiquity’s foremost crypto-algebraist, Diophantus 
is disappointing for his failure to treat this subject as one would later come to do. His aim is to 
present the solutions to arithmetic problems, via methods of varying degrees of generality, not the 
systematic exposition of a subject matter. Although he progresses from simpler to more complex 
problems, he seems not to attempt to define any clear lines of deductive structure in his work. One 
might wish to describe it as investigating the solutions to determinate and indeterminate systems 
of equations of first, second and third orders, but that conforms only tenuously with Diophantus’ 
selection and ordering of problems (cf. the survey list compiled by T. L. Heath [ Diophantus, 1910, 
260-266]). Moreover, in contrast with the modern field of “Diophantine”’ analysis, Diophantus 
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allows rational solutions, not just integral solutions, thus drastically reducing the difficulty, and 
hence the interest, of many of his problems. He is thus never called on to develop those methods 
which grace the finest of the Indian efforts in indeterminate analysis; indeed, the indeterminate 
first-order problem in two unknowns is so trivialized as not to merit inclusion at all. 

The fact that only six of the original thirteen books of the Arithmetica survive in Greek 
manuscripts had led scholars to speculate that the lost portions contained the more interesting and 
advanced materials. With the appearance of Jacques Sesiano’s edition of the Arabic version of 
Diophantus’ Arithmetica, we should have expected enlightenment on this issue. Instead, we 
encounter new puzzles, these primarily of the textual sort. First, the Arabic version presents itself 
through its own titles as being Books IV through VII of the Arithmetica; but comparison with 
Books IV through VI in the Greek reveals at once that no Book in the one sequence duplicates 
any in the other. (In view of this, I shall refer to the Arabic Books as IVA, VA, etc., and the Greek 
Books as IVG, VG, VIG.) Second, there are important stylistic differences, the Arabic version 
tending toward prolixity quite out of keeping with the austerely terse wording in the Greek. 

Either of these considerations could raise doubts of the authenticity of the Arabic Books. 
Sesiano establishes convincingly that the Arabic does indeed transmit a portion of Diophantus’ 
work, specifically, the original Books IV through VII [Sesiano, pp. 4—8]. For instance, Book IVA 
bears a preface stating that its project is to extend to the solid numbers the methods that the 
previous Books had applied to linear and planar numbers; thus, rather than introduce new 
methods, the following Books propose to inculcate skill in the manipulation of techniques already 
presented. As Sesiano notes, this is a fair characterization of the Arabic Books, for they adhere to 
the technical base established in Books I-III. By contrast, Book IVG begins at once with cubic 
problems handled via new methods. Clearly, Books: IVA-VIIA would be poorly suited for 
placement after IVG. In this way, speculations among older scholars about the scope of the 
missing portions of Diophantus’ work are partly settled [Sesiano, sect. 13; cf. Heath, pp. 6-12]: 
Tannery’s hope to discover interesting advanced materials in the lost Books of the Arithmetica are 
effectively dashed, as we realize that the four recovered Books represent no increment in technical 
level. On the other hand, the conjecture of Nesselmann and others, that the lost Books fit 
somehow within the sequence of those extant in Greek, has in part been sustained; but the gap 
turns out to be between Books III and IVG, rather than between Books II and III as they had 
surmised. 

On a second question, relating to the stylistic differences between the Arabic and Greek Books, 
Sesiano’s proposed explanation is, in my view, not altogether persuasive. Exploiting the ancient 
testimony that the mathematician Hypatia (d. 415 A.D.) had composed a commentary on 
Diophantus, Sesiano suggests that the Arabic Books were translated from a Greek text which had 
incorporated substantial materials drawn from some such “major commentary,” whereas, by 
contrast, the extant Greek recension represents a text much closer to the one produced by 
Diophantus himself [Sesiano, sect. 9, 10, and 12.2]. This view is intended to account for systematic 
differences of style which distinguish the Arabic version from analogous parts of the Greek. Even 
if the Arabic translators introduced certain changes of their own, as Sesiano allows, he maintains 
that the more important differences must be referred to their Greek source. 

Among the minor sort of discrepancies due to the translator is the manner of rendering 
numerical terms; the Greek employs a notational system comparable to our own [1], whereas the 
Arabic writes out in full the number-words; thus, the Arabic of VIA, 4 expresses the number 
2563001 as “two thousand thousands and five hundred and three and sixty thousand and one,” 
where the Greek must have held a numeral like “256 My 3001” (avs M’ya, where “My” stands 
for myriades, or “ten-thousands”). So too, the translator must be responsible for replacing 
Diophantus’ “syncopated” mode with the full “rhetorical” rendition of algebraic expressions. 
Thus, in VA,7 we read, “The sum of these two cubes is two thousand units and sixty powers [sc. 
2000 + 60 x7],... equal to two thousand and two hundred and forty units;” a comparable 
passage in IVG,1 reads, “Then the cubes will be Dy 30 Mo 250 [sc. 30 x* + 250]; these are equal 
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to Mo 370” [2]. In the standard view of the progression from “rhetorical” through “syncopated” 
to “symbolic” expressions in algebra, this might seem to be a backward step [3]. But in this the 
translator has correctly intuited that the lines in the Greek were intended to be read out in full, 
even if the text is highly abbreviated. His practice might thus be viewed as an aid to the student. 

Sesiano admits that these differences in style are due to the translator; but he wishes to view 
other aspects of style as borrowings from the “major commentator” built into the text used by the 
translator. Among these are the belabored algebraic manipulations in the Arabic, where compara- 
ble passages in the Greek omit intermediate steps; the insertion of full statements of algebraic 
identities in the Arabic, where they would be tacitly assumed in the Greek; and the full working 
out of the “proof” in the Arabic, where the Greek merely states the solutions. Sesiano argues that 
since amplifications of these sorts are encountered in other Arabic renderings of Diophantus, in 
particular, those of al-Karaji, apparently independent of the present translation by Qusta, these 
must have been held in the Greek prototype. 

To support his view, Sesiano calls attention to several particular instances where comparable 
features in the Arabic indicate a basis in the Greek [Sesiano, sect. 10]. For instance, in VIA,4 one 
of the numbers of the solution is stated as 2563001 (that is, written out in full, as noted above); 
but the correct answer is in fact the product of 251 times 63001. The slip seems possible only 
through the mediation of a numeral notation like that employed in the Greek of Diophantus. 
Thus, an expression like “251 times 6 My 3001” might be wrongly construed as “256 My 3001” 
via a scribal deletion, either by a Greek scribe or the Arabic translator. What seems clear, 
however, is that the Greek text must at one point have included the expression “251 times 63001,” 
and this is precisely the kind of additional detail in the working out of the solution which 
distinguishes the Arabic from parallels in the Greek. 

A first reservation one may have about this use of the hypothesis of the “major commentary,” 
however, is that the sorts of changes at issue here are of such a low level as not to require any real 
mathematical insight. Retrieving the commentator via stylistic comparisons of the Arabic and 
Greek texts would thus isolate an essentially trivial mind. This is in direct conflict with ancient 
testimonies of Hypatia’s high caliber as a philosopher and mathematician. Given that we possess 
not a single directly attested word from her works, I think that historical fairness, if nothing else, 
should lead us to resist this project of reconstruction. 

But perhaps this is mere sentimentality. A consideration of certain textual issues, however, 
occasions deeper misgivings. Sesiano would maintain that the manuscripts available to the 
9th-century Arabic translators held a revised edition of Diophantus, influenced by the commen- 
tary of Hypatia, whereas the manuscripts available to the 13th-century Byzantine scholars, 
principally Maximus Planudes, on which the extant Greek tradition is based, represented the 
older, pre-Hypatian form of the text [4]. But, as a rule, new recensions of technical works largely 
displaced the older versions. The textual history of works of Euclid and Apollonius, for instance, 
followed this basic pattern [5]. Sesiano’s position thus conflicts with our evidence relating to the 
transmission of ancient technical works. 

I cannot hope to develop here an alternative view of the relation of the Greek and Arabic 
versions of Diophantus, but perhaps I can indicate the direction one might pursue. The Planudean 
manuscripts present not only a text of Diophantus, but also extensive scholia, attributed to 
Planudes himself, amounting to a running commentary on Books I and II. These scholia appear 
sometimes at the ends of the propositions they relate to, sometimes in the margins, and attempt to 
amplify the text, for instance, by filling in the intermediate steps of algebraic resolutions, inserting 
the statements of identities assumed in the text, working out in full the proofs of the solutions, 
and so on. These are comparable to the kinds of amplification found in the Arabic version of 
Diophantus. Although elaborations like these are obvious enough, we are no more compelled to 
suppose that Planudes originated them all in his commentary, as to suppose that Qusta originated 
the same in his translation. We can as readily conceive that the manuscripts at Planudes’ disposal 
already contained interpolated comments of this sort, and that he gleaned from these the materials 
of his commentary, occasionally adding remarks of nis own. We would suppose, further, that the 
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earlier manuscripts were in comparable condition, so that the 9th-century Arabic translators had 
before them both the text and its accompanying marginal commentary. They would of course 
recognize the pedagogical value of this marginal material, and so, quite naturally, could attempt to 
incorporate it into their translation. In this way, the Arabic and Greek recensions would owe their 
stylistic differences not to essential differences in the source texts utilized, but rather to differences 
in editorial technique. 

Although I have thus expressed doubts about one of Sesiano’s central interpretive positions, his 
views over all appear to be quite sound. More important for the practical utility of his edition is 
his care to separate his interpretations from the body of the text and translation. In this he parts 
company with the major 19th-century editors, whose penchant for bracketing purported interpola- 
tions, for instance, now enormously complicates the use of their editions. The reader can thus 
consult Sesiano’s text without concern over prejudgment, yet if he choose, can readily locate the 
places in the introduction and commentary where Sesiano presents his own views. The translation 
is conveniently cross-referenced to the text via skillful use of ine numbers. The introduction 
presents, among other things, a nice résumé of the textual history of arithmetic theory in Greek 
and Arabic. Useful study aids include a list in modern notation of problems from the Greek and 
Arabic Books of Diophantus and an Arabic lexicon, with frequent noting of Greek equivalents. 
Doubtless, many readers would have preferred a more extensive noting of English equivalents 
than appears in this lexicon, and this is, regrettably, but one of several ways in which Sesiano fails 
to make accommodations for the general reader. His edition will thus not readily serve as an 
introduction for the nonspecialist. Nevertheless, used in conjunction with the Greek edition of 
Tannery and the standard translation-commentaries (e.g., those by Heath, Ver Eecke and 
Czwalina), Sesiano’s work will provide a valuable instrument for renewed Diophantus studies. 

With reference to Sesiano’s ample bibliography, two salient omissions should be noted. First, 
F. Sezgin’s Geschichte des arabischen Schrifttums, V: Mathematik (Leiden, 1974) has become an 
indispensable reference resource for the study of Islamic mathematics, and provides capsule 
accounts and bibliographical guidance to manuscripts, editions and secondary literature for most 
of the mathematicians mentioned by Sesiano. Second, there are no citations of contributions by 
Roshdi Rashed, save for his edition of the algebra of Samaw’al (co-edited with S. Ahmad, 1972). 
Among the pertinent items would be Rashed’s article on al-Karaji (Dictionary of Scientific 
Biography, 7, pp. 240-246); his survey of the Arabic recension of Diophantus (Revue d’ histoire 
des sciences, 1974, 27:97—122 and 1975, 28:3--30); and his Arabic edition with Arabic commentary 
of the same manuscript now edited by Sesiano (Cairo, 1977). It is clear from Sesiano’s review of 
Rashed’s edition (Isis, 1977, 68:627—630) that he finds much therein to criticize. But Sesiano’s 
present silence on these prior efforts is surely a poorer solution to a delicate critical problem than 
the one he adopted in his earlier essay on Diophantus (DSB, 15, pp. 118-122), and I wish he had 
reconsidered this extraordinarily ungracious gesture. 


Notes 


1. The Greek numeral system employs the consecutive letters of the alphabet to stand for the digits from 1 to 9, 
the tens from 10 to 90, and the hundreds from 100 to 900; a numeral will thus consist of the figures from each order 
which sum to the desired value. Auxiliary strokes and signs permit the expression of thousands and ten-thousands. 
Although lacking the place notion, it is comparable to our own system for the purposes of rational operations. For 
an account, see T. L. Heath, A History of Greek Mathematics, Oxford, 1921, I, pp. 31-40. 


2. The abbreviations parallel those adapted in the Greek: Mo = forms of monas (“unit”); Dy = dynamis 
(“ power,” that is “second-power,” or “square”); the Greek usually expresses addition via simple juxtaposition. 


3. This view of the evolution of algebra is notable in G. H. F. Nesselmann’s Die Algebra der Griechen, Berlin, 
1842 and is elaborated by J. Klein in his Greek Mathematical Thought and the Origin of Algebra, Cambridge, Mass., 
1968. 


4. Sesiano [pp. 16-20] and Heath, Diophantus, pp. 14-19. 


5. Euclid’s Elements, Apollonius’ Conics and Ptolemy’s Syntaxis, for instance, led to the eventual extinction of 
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their precursors. The editions of Euclid’s Elements and Optics by Theon of Alexandria (4th century A.D.; father of 
Hypatia) overwhelmed the tradition of these works, without entirely supplanting the older recensions. The 
commentaries by Eutocius of Askalon (early 6th century) affected the tradition of Archimedes’ and Apollonius’ 
works. This phenomenon of displacement is neither surprising nor deplorable. Most of these writers were engaged in 
the preparation of textbooks serving their students’ and colleagues’ needs, not of philologically pure texts. By 
implanting these revised versions into their own tradition of teaching, they merely secured their survival at the 
expense of the earlier versions. 


Selecta: Expository Writing. By P. R. Halmos. Springer-Verlag, New York, 1983. xix + 304 pp. 


R. P. Boas 
Department of Mathematics, Northwestern University, Evanston, IL 60201 


There are often said to be analogies between mathematics and music, but it is less often 
realized that there are analogies between departments of mathematics and of music. Consider the 
complaint of the music critic Andrew Porter (The New Yorker, August 16, 1982) about a 
contemporary composer whose work appeals to the “middle-brows,” people who “know what 
they like and like what they know.” His worry is that if these Philistines get what they want, 
standards will fall and progress in music will eventually cease. The parallel with mathematics is 
rather clear, if we identify the Philistines with the users of mathematics, whether they are the 
students who do not intend to become creative mathematicians, or the people who want to use 
mathematics in solving their own practical problems. 

As a musical Philistine, I perceive weaknesses in Porter’s position. Most obviously, musicolo- 
gists are paid to devote time to studying avant-garde music and learning to appreciate it. 
Unfortunately for the rest of us, they do not spend much time explaining to us what it is all about 
(they are not paid to do that). We are not helped by just being asked to listen to the stuff. 

The mathematical Philistines may reasonably make the same complaint about the avant-garde 
mathematicians who are (quite understandably) more interested in advancing mathematics than in 
explaining it. Even those who understand it and could explain it are usually inhibited because the 
folks who give out money are interested only in published research, not in exposition. Halmos (in 
one of the essays in this book) makes the point that doing research is important for teachers—but 
he doesn’t say (or even imply) that it should be publishable research. The president of my 
university, on the other hand, is on the record as saying that “publish or perish” means that if you 
don’t publish research, you will automatically perish as a teacher. (This is clearly an over-simplifi- 
cation, if only because most professors at most universities have little chance to teach anything 
except a pretty standard curriculum.) It is no wonder that modern mathematics (like modern 
music) has the reputation of being abstruse, incomprehensible, and remote from reality. There are 
other barriers against the good exposition of mathematics, but readers of this journal can think of 
them for themselves. 

It is at least plausible that students of mathematics, and especially the users of mathematics 
who apply new mathematics—even without having read the proofs—to solving practical prob- 
lems, would benefit from more good exposition, but few creative mathematicians spend much 
time explaining what they and others have done. There are some honorable exceptions, of whom 
Halmos is an outstanding example. Not only is he unafraid of having people look down on him 
for explaining advanced mathematics in a comprehensible way, he evidently enjoys explaining it, 
and he does it superbly. In his book you will find not only samples of his expository writing at 
various levels, but instructions on how to do it, as well as inspirational sermons and stimulating (if 
possibly irritating) comments on the mathematical scene. I regret that he did not choose to include 
his spirited defense of complex analysis, written in response to allegations that the subject is 
outmoded (Notices Amer. Math. Soc., vol. 18 (1971) 69). If more mathematicians would follow 
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Halmos’s example and attend to his precepts, there might be less mathematical writing, but it 
would be more comprehensible. There is—most of us know it but are seldom willing to admit 
it—such a thing as good mathematical style, and Halmos is one of the small number of its 
exemplars. Whether you are a serious mathematician who is inclined to look down on “middle- 
brows,” or you are looking for information about some mathematics beyond what you were 
taught or what you specialize in, or you are willing to listen to a master stylist explain how to 
write, or you just want to be entertained by the conversation of an interesting personality—read 
this book! 


Group Analysis of Differential Equations. By L. V. Ovsiannikov. Academic Press, Inc., New York, 
1982. xvi + 416 pp. $54.00. 


WILLARD MILLER, JR. 
School of Mathematics, University of Minnesota, Minneapolis, MN 55455 


Group analysis of differential equations was inaugurated and developed by the Norwegian 
mathematician Sophus Lie in the closing decades of the nineteenth century. It is based on one of 
the most useful concepts in mathematics, that of a group G acting on a manifold M: a 
transformation group. (For p€ M, g€G we denote the group action by p —> pg € M.) 
Associated with this concept are such intuitive notion’ as the orbit of a point p € M (the set of 
all points pg in M as g runs over all group elements) and an invariant point p (pg =p, 
Vg © G). Furthermore, a function f on M is invariant under G if f( pg) = f(p) for all p € M, 
g € G and the equation f( p) = 0 is invariant under G if f( pg) = 0 whenever f(p) = 0 for all 
g © G,1.e., pg is a solution whenever p is a solution. 

In order to apply these notions to differential equations, the basic concepts are modified and 
specialized in three ways. First it is assumed that G is a real analytic group (a Lie group) and that 
the coordinate transformations p — p’ = pg are C®™ functions. This permits the application of 
ideas from calculus and leads to the construction of Lie algebras of Lie derivatives (infinitesimal 
group transformations). Second, increased generality is obtained by merely assuming that G is a 
neighborhood of the identity element in an analytic group and that M is a local coordinate patch 
with real coordinates z = (Z,,...,Z,,) for p. These two assumptions amount to the statement that 
G is a local Lie transformation group on M. Finally, it is assumed that M = X xX Y where the 
local coordinates for X are the independent variables and the local coordinates for Y are the 
dependent variables. For example, if dim X = n, dim Y = 1 coordinates of point p would be 
(X1,--+,X,>¥) = (x, y). Now suppose y = u(x) for some C™ function u and consider the group 
transformation (x’, y’) = (x, y)g. It is a straightforward consequence of the chain rule that the 
action of G as a transformation group can be extended to the manifold M, = X X Y,, where the 
dependent coordinates in Y, can be taken as (y, Vis Vj joes ee Vip veg de hess the partial derivatives of 
y up to order k. This extension of the group action to Ath order derivatives of y is known as the 
k th order prolongation of G. 

A kth order differential equation in y can be considered as an equation f( p,) = 0, p, © M,. 
We say that G is a symmetry group for this equation provided the equation is invariant, i.e., 
provided p,g is a solution for each g € G whenever p, is a solution. Knowledge of a symmetry 
group G for a differential equation can prove extremely useful. Given any solution of the equation 
one can generate an orbit of additional solutions by application of the symmetry transformations. 
Special solutions can be obtained through the requirement that they are invariant or partially 
invariant under the action of G. (Many of the special functions of mathematical physics arise 
exactly in this manner and a knowledge of G is crucial in understanding the properties of such 
functions.) Through the classical theorem of Emmy Noether (1918) and its modern generaliza- 
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tions, one can use certain symmetries of Euler-Lagrange equations to obtain conservation laws for 
these equations. (In particular this method can be used to generate an infinite number of 
conservation laws for the Korteweg-de Vries equation.) The technique of separation of variables 
for second order linear partial differential equations can be completely understood in terms of 
symmetries, as can the technique of dimensional analysis for the equations of mechanics. In short, 
Lie’s symmetry approach to differential equations has turned out to be enormously useful. Yet the 
full ramifications of this approach are far from exhausted and new work appears apace. 

Among the contributors to this field in the past 25 years no one has been more influential than 
Ovsiannikov. In order to apply group theoretical methods to obtain information about the 
solutions of a given differential equation, one needs to know the “maximal” symmetry group of 
the equation. Lie solved this problem through the use of infinitesimal generators for the group 
action and reduced the computation of the symmetries of a fixed (nonlinear) differential equation 
to the solution of a system of linear differential equations. Ovsiannikov’s work has centered on the 
refinement of Lie’s methods in order to determine the detailed structure of the maximal symmetry 
groups for partial differential equations of physical interest, and especially to classify the invariant 
and partially invariant solutions of these equations. 

This book (a translation of the 1978 Russian edition) contains a complete exposition of the 
basic Lie theory of local transformation groups and the determination of the groups which can be 
the symmetry groups of differential equations. A novelty of the treatment is that Lie’s three 
fundamental theorems on transformation groups are proved in the more general setting of Banach 
(generally infinite dimensional) Lie groups and algebras. The survey of applications is limited 
primarily to the construction of partially invariant solutions, classification of second-order linear 
equations, conservation laws and other areas where the author has made personal contributions. 

This present book is a considerable extension and updating of the author’s influential 1962 
monograph “Group Properties of Differential Equations” (in Russian). A generation of English 
speaking applied mathematicians studied this monograph through the help of a 1967 translation 
by G. W. Blumen which was circulated in mimeographed form. The greater degree of generality 
and abstraction (particularly the use of Banach Lie groups) employed by the author make the 
updated work less accessible to beginners than the 1962 original. Nevertheless, the general 
availability of Ovsiannikov’s work in English translation is a very welcome development for wide 
dissemination of this beautiful and useful theory. 


The Lady or the Tiger? and Other Logic Puzzles. By Raymond Smullyan. Knopf, New York, 1982. 
ix + 226 pp. 


STEWART SHAPIRO 
Department of Philosophy, Ohio State University, Newark, OH 43055 


As indicated by the title, this is a book of logical and mathematical puzzles. Its author, an 
eminent logician, is developing a much deserved reputation among the general public as a 
puzzle-master (thanks, in part, to Johnny Carson). I confess a previous addiction to logic puzzles 
and hereby admit that I thoroughly enjoyed the book. 

In this review, I would like to address a rather strong assertion made in the Preface (p. vii): 


So many people I have met claim to hate math, and yet are enormously intrigued by any logic or math 
problem I give them, provided I present it in the form of a puzzle. I would not be at all surprised if good 
puzzle books prove to be one of the best cures for so-called “math anxiety.” Moreover, any math treatise 
could be written in the format of a puzzle book! 


I would tend to agree in part, and disagree in part, with this pedagogical statement. First, the 
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disagreement: I doubt that puzzles will help those with severe cases of math anxiety. My 
experience is apparently different from that of Professor Smullyan. Among people I know, 
virtually all who enjoy logic puzzles also enjoy mathematics (and are good at both). Those who 
hate math also hate puzzles. In fact, an attempt to introduce a logic puzzle to a friend who suffers 
from math anxiety typically produces confusion, frustration, and unsolicited statements like “You 
know I hate math.” Often, he or she will even have trouble following the given solution. The 
reason for this, of course, lies in the close analogy between puzzles and mathematics and may shed 
some light on the nature and causes of math anxiety. In principle, both mathematical discourse 
and logic puzzles involve long, directed, and focused stretches of careful reasoning from clearly 
stated premises. In this way, both differ from ordinary nonmathematical thinking. Moreover, 
some logic puzzles (especially those of the present book) invelve concepts nearly as abstract as 
those of mathematics. 

The extent of my agreement with the above quoted passage is also due to the close relationship 
between mathematics and logic puzzles. Professor Smullyan’s books themselves demonstrate that 
certain mathematical ideas can be presented (to those receptive) through puzzles in an entertain- 
ing and engaging manner. One of the purposes of the book presently under review is to illustrate 
Gédel’s incompleteness theorem and Tarski’s theorem on the formal undefinability of truth. I 
found this a fascinating, thought-provoking, and insightful account and admit that I learned from 
it, but it is hard to judge the extent to which an average lay reader can grasp the mathematical and 
philosophical conclusions of this aspect of the book. It is clear, however, that certain more basic 
mathematical concepts and techniques are given lucid exhibitions in this puzzle book. In several 
places, for example, many instances of a certain type of problem are presented and the solutions 
are carefully shown to be similar. Then a “meta-problem,” which generalizes the previous work is 
formulated and solved. The reader thus encounters the mathematical technique of generalization, 
the notion of “theorem,” and, through the solutions, the notion of rigorous deductive proof. I 
suggest, then, that puzzle books such as this can go a long way toward illuminating at least some 
of the essential elements of mathematical activity. 

It might be added that such books can also have philosophical value. In particular, many of the 
sections of this book help shed light on the common notions of truth, knowledge, and belief by 
drawing out interesting and counterintuitive consequences of several prima facie plausible doxical 
and epistemic doctrines. 


LETTERS TO THE EDITOR 


Material for this department should be prepared exactly the same way as submitted manuscripts (see the inside 
front cover) and sent to Professor P. R. Halmos, Department of Mathematics, University of Santa Clara, Santa 
Clara, CA 95053. 


Editor: 


In their paper, ““Newton’s method, circle maps, and chaotic motion”, published in the January 
issue of this MONTHLY, Donald Saari and John Urenko study the convergence properties of 
Newton’s method applied to Real Polynomials with Only Real Roots (RPORR) and to poly- 
nomial-like functions. Their results are generalizations of Barna’s theorem: If fis a RPORR, then 
the set of nonconverging points S, is a Cantor set with Lebesgue measure Zero. 


In [1], we stated a theorem which gives more information on the dynamics of the Newton 
transform N on S,. The proof is based on a symbolic representation closely related to the 
representation introduced in Saari’s and Urenko’s paper. 
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Let A be an alphabet having n — 1 symbols and let S be the set of sequences formed using 
elements in A such that two consecutive elements are different. We show that the restriction of N 
to S$, is topologically equivalent to the shift acting on S. This can be extended to Saari’s and 
Urenko’s results. 


Two important facts can be deduced from all these results: 


1. The Newton transform of two nth degree RPORR are topologically equivalent. 
2. Newton’s method applied to RPORR converges almost everywhere. 


These two remarks lead to very efficient implementations of an algebraic equations solver 
based on Newton’s method. 

It is commonly thought in the numerical analysis of algebraic equations that the higher the 
order of convergence of a method, the narrower is its domain of convergence. In case of Newton’s 
method applied to RPORR, the order is two and the domain is almost all of R. Contrary to the 
common perception, however, there exists a globally convergent third order method-Laguerre’s 
method [2]: 


nP(X,) 
P(X) +LH(X) | 

where © = sign (P’(X)) and H(X) = (n— 1)?P”(X) — n(n — 1)P(X)P”(X). This method 
applied to RPORR converges for every starting point in R. We do not know if there exists a 
fourth order method globally convergent for such polynomial equations. 


X41 = L(X;) = X, = 
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CLEAR THEORY 


JEREMY GRAY 
The Open University, Milton Keynes MK7 6AA, England, United Kingdom 


Explanations (lectures, articles, or books) in mathematics come in a variety of forms. As a 
discrete attempt to impose a valuation upon them, one is naturally led to make the following 
distinctions. 

An explanation is globally clear if the overall structure of the argument is apparent; it is /ocally 
clear if it can be followed line by line. 

An argument that is locally clear except for a small (finite?) set of deductions may be said to be 
generically clear. An argument may be said to be asymptotically clear if it becomes clearer upon 
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repetition. Rather worse are the arguments that are stochastically clear; these are arguments that 
are only occasionally locally clear and then only clear apparently at random. 

An argument is uniformly clear if it is clear for everyone. A stronger condition is that an 
argument be stably clear; this is the case when its clarity survives repetition by an original 
member of the audience. The clarity of an argument that does not survive indefinite repetition will 
be said to be unstable; if, however, something always survives, the clarity may be said to be 
semi-stable. 

It is often the case that an argument only becomes clear on localizing at an example; one may 
then say the argument contains the germ of a true idea. Or, by severely abusing notation, one may 
say the argument has a /ocal ring to it. 
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LISP: THEY MAY ALSO SERVE, WHO DECLINE TO ENLIST... 


ROBERT M. BAER 
379 Countyview Drive, Mill Valley, CA 94941 


Last year we were told that “Lisp is crisp” [1], and this year we are told that LISP “is the only 
language in which civilized people write computer programs” [2]. (There is a counter-view to the 
effect that civilized people view with distaste any language in which assignment and equality are 
represented by the same symbol.) LISP has recursion, but its overhead is notorious. Its primary 
(perhaps only) virtue is that list-processing capability (but, then, there are all those parentheses... ). 
In any case, we put the question to our desktop verse-processor, and got the following output: 


Adrift on seas of parentheses 


LISP isn’t crisp but essentially soggy; 
Parentheses piled up make you feel groggy. 


Deciphering LISP would apparently be 
More work than Minoan Linear B. 


As for cond (aka if) and cdr and car, 
Like heavy seas they’re best viewed from afar. 


And vertigo thrives in anticipation 
Of waves upon waves of Polish notation. 


LISP for list-processing: cause to abort. 
(Is it listing to starboard or listing to port?) 


Yes, recursion is nice 
To put puzzles on ice. 


(But they dole out machine time, ration by ration, 
So Pll skip the recursion and use iteration.) 
References 


1. D. R. Hofstadter, Metamagical Themas, Scientific American, 248 (1983), #3, 22-30; #4, 14-28. 
2. M. Wand, What is LISP?, this MONTHLY, 91 (1984) 32-42. 


CALCULUS 


DENNIS BERKEY, Boston University 


SAUNDERS COLLEGE PUBLISHING... 


1984 * 1200 pages * hardcover * ISBN 0-03-059522-3 
Instructor's Solutions Manual * Student Solutions Manual * Computer Test 


Bank (with computer graphics) 


«x also available: T.A’.S HANDBOOK, by William Perry and Douglas Jewett, 


both of Texas A&M University 


HERE’S WHAT YOUR COLLEAGUES ARE SAYING ABOUT THIS 


OUTSTANDING NEW TEXTBOOK... 


“The major strengths of Berkey Calculus are its clearness, good examples, 
readability for students, problems, and illustrations.” 
——Paul W. Haggard, East Carolina University 


“The text was very readable and the illustrations excellent; the text was written 
at a level appropriate to a beginning calculus class.” 


— Ellen Ferrell, Pima Community College 


“The number and variety of exercises are outstanding.” 
—Louie Huffman, Midwestern State University 


SUPPLEMENTS... 


MICRO-CALCULUS: A GRAPHING AND TUTORIAL PROGRAM 
By TECHNICAL EDUCATIONAL CONSULTANTS 


This two-diskette package for the Apple 
Il+ and /le features graphic routines that 
are selected by the user. It contains illus- 
trative problems and examples drawn 


from differential and integral calculus that 
can be used in a classroom or individually 
as a tutorial. 


1984 ¢ ISBN 0-03-062821-0 


INTERFACE: CALCULUS AND THE COMPUTER, Second Edition 


By DAVID A. SMITH, Duke University 


Designed for use in computer labs, this 
computer enrichment text for beginning 
calculus introduces students to the appli- 
cations of calculus to numerical compu- 
tations. Most of the text is language inde- 
pendent; some elements of BASIC pro- 


gramming are included. An instructor's 
disk, which contains the computer pro- 
grams, is available for the Apple Il, //e, the 
IBM PC, or the TRS-80. 

1984 « 250 pages ¢ paper 

ISBN 0-03-070663-7 ¢ Instructor's Disk 


* SAUNDERS COLLEGE PUBLISHING 


A COMMITMENT TO EXCELLENCE 


PRESENTING SAUNDERS 1985 MATHEMATICS LIST... 


NEW! AN INTRODUCTION 

TO LINEAR ALGEBRA WITH 
APPLICATIONS 

STEVE ROMAN, California State Uni- 
versity at Fullerton 

Emphasizes motivation and contains 
abundant examples. | 
1985 ¢ 528 pages * Hardcover 

e ISBN 0-03-064017-2 

¢ Student Solutions Manual 


NEW! STATISTICS AND 
PROBABILITY AND THEIR 
APPLICATIONS 

PATRICK BROCKETT, University of 
Texas at Austin and ARNOLD LEVINE, 
Tulane University 

Covers sampling theory and surveys. 
1985 © 624 pages « Hardcover 

e ISBN 0-03-053406-2 

e Instructor's Manual ¢ Study Guide « Test 
Bank « Computerized Test Bank 


NEW! BASIC MATHEMATICS, 

A REVIEW Second Edition 

JACK BARKER, JAMES ROGERS, and 
JAMES VAN DYKE, al! of Portland Com- 
munity College 

Designed for students in need of review, 
this worktext covers arithmetic and ele- 
mentary algebra in one course. 

1985 ¢ 656 pages « Paper 

e ISBN 0-03-071588-1 

e Instructor's Manual ¢ Test Bank 


NEW! PRECALCULUS 
MATHEMATICS Second Edition 
THOMAS. W. HUNGERFORD, Cleveland 
State University and RICHARD 
MERCER, Wright State University 

1985 © 624 pages * Hardcover 

e ISBN 0-03-00843-3 

¢« Computerized Test Bank « Test Bank e 
Instructor's Manual 


A division of CBS Educational and Professional Publishing 
383 MADISON AVENUE e NEW YORK, NY 10017 


NEW! MATHEMATICS WITH 
APPLICATIONS TO BUSINESS, 
ECONOMICS, AND SOCIAL 
SCIENCES 


RICHARD BOULDIN, University of Georgia 
1985 © 784 pages * Hardcover 

e ISBN 0-03-062164-X 

e Instructor's Manual « Test Bank e Com- 
puterized Test Bank 


NEW! CALCULUS WITH 
APPLICATIONS TO BUSINESS, 
ECONOMICS, AND SOCIAL 
SCIENCES 


RICHARD BOULDIN, University of Georgia 
1985 ¢ 544 pages * Hardcover 

e ISBN 0-03-069764-6 

e Instructor's Manual « Test Bank « Com- 
puterized Test Bank 


AND DON'T FORGET THESE 
OUTSTANDING TEXTS... 


ELEMENTARY LINEAR ALGEBRA 


J.A. THORPE and PG. KUMPEL, both of 
SUNY, Stony Brook 


1983 © 512 pages ¢ Hardcover 
e ISBN 0-03-061249-7 

e Instructor's Manual 

¢ Student Solutions Manual 


DIFFERENTIAL EQUATIONS: 
A FIRST COURSE 


MARTIN GUTERMAN and ZBIGNIEW 
NITECKI, both of Tufts University 


1983 © 650 pages e Hardcover ¢ ISBN 
0-03-062502-5 « Student Solutions Manual 


HOW TO ORDER: 

For examination copies, please contact your 
local Saunders College Publishing sales repre- 
sentative or write on your college letterhead to: 
Karen S. Misler, S14, Saunders College Pub- 
lishing, PO. Box 36, Lavallette,| NJ, 08735. 
Include your course title, enrollment, and text 
currently in use. To expedite shipping, please 
include the ISBN (International Standard Book 
Number) for each item requested. 


S14-2AMM-5 


DEVELOPMENTAL STUDIES 


Now Available! 


* BASIC ALGEBRA FOR COLLEGE 


STUDENTS 

Second Edition 

Lawrence G. Gilligan, University of 

Cincinnati 

Robert B. Nenno, Monroe Community College 
Anthony S. Pruchnicki, Jr., Mattatuck 
Community College 

1985 Paper 398 pages 

Instructor's Guide with Tests 


INTERMEDIATE ALGEBRA FOR 


COLLEGE STUDENTS 

Second Edition 

Gilligan/Nenno 

1984 Paper 484 pages 

Text with Apple® Disk/Instructor’s Guide 


Now Available! 


* INTERMEDIATE ALGEBRA 


Marcus M. McWaters and You-Feng Lin, both of 
the University of South Florida 

1985 Cloth 450 pages est. 

Instructor's Guide with Tests/ 

Student Study Guide/Student Solutions Guide/ 
Solutions to Even-Numbered Exercises 


ARITHMETIC 

David Novak, Simmons College 

1983 Paper 514 pages 

Instructor's Guide with Tests/Student Guide to 
Margin Exercises/Answer Key/Computerized 
Testing Package/Video Tapes 


INTRODUCTORY ALGEBRA 

Novak 

1983 Paper 586 pages 

Instructor's Guide with Tests/Student Guide to 
Margin Exercises/Answer Key/Computerized 
Testing Package/Video Tapes 


INTERMEDIATE ALGEBRA 

Novak 

1983 Paper 668 pages 

Instructor’s Guide with Tests/Student Guide to 
Margin Exercises/Answer Key/Computerized 
Testing Package/Video Tapes 


ARITHMETIC FOR COLLEGE 


STUDENTS 

Fourth Edition 

Wright 

1983 Cloth 386 pages 
Instructor’s Guide/Student Workbook 


Top-notch texts 


APPLIED MATHEMATICS 


Now Available! 


* MATHEMATICAL APPLICATIONS 


For Management, Life, and Social 
Sciences 

Second Edition 

Ronald J. Harshbarger and James J. Reynolds, 
both of The Pennsylvania State University, 
Beaver Campus 

1985 Cloth 752 pages est. 

Solutions Guide/Answer Key 


APPLICATIONS OF COLLEGE 
MATHEMATICS 

Management, Life, and Social 
Sciences 

Stancl/Stancl 

1983 Cloth 720 pages 


Instructor’s Guide/Computer Problem-Solving 
Guide 


UNDERSTANDABLE STATISTICS 
Concepts and Methods 

Second Edition 

Brase/Brase 

1983 Cloth 489 pages 


BRIEF CALCULUS WITH 
APPLICATIONS 


Larson/Hostetler 
1983 Cloth 621 pages 
Answer Key/Student Solutions Manual 


BRIEF CALCULUS WITH 


APPLICATIONS 

Alternate Edition 

Larson/Hostetler 

1983 Cloth 433 pages 

Answer Key/Student Solutions Manual 


For details or sample copies, 
call us toll free: 800-225-1388. 
In Massachusetts, call collect: 617-863-1948. 


in Mathematics, 


CALCULUS 


CALCULUS AND ANALYTIC CALCULUS 

GEOMETRY Larson/Hostetler 

Second Edition 1982 Cloth 949 pages 

Philip Gillett, University of Wisconsin Answers to Even Numbered Exercises/Study 
1984 Cloth 915 pages and Solutions Guide/Three Volume, Complete 
Three Volume, Complete Solutions/Selected Solutions 


Solutions/Introduction to Linear Algebra 


PRECALCULUS 
* ALGEBRA AND TRIGONOMETRY = * PRECALCULUS 


Roland E. Larson and Robert P. Hostetler, both Larson/Hostetler 
of The Pennsylvania State University, 1985 Cloth 638 pages est. 
The Behrend College Instructor’s Guide 


1985 Cloth 638 pages est. ’ 
Study and Solutions Guide/Instructor’s Guide * TRIGONOMETRY 
Larson/Hostetler 


* COLLEGE ALGEBRA 1985 Cloth 332 pages est. 


Larson/Hostetler Instructor’s Guide 
1983 Cloth 438 pages est. 
Instructor's Guide/Study and Solutions Guide 


“The main strength is that extra work is devoted to certain topics used in calculus. 
Some of these topics include: (1) graphing rational functions, (2) partial fraction 
decomposition, (3) trigonometric identities and formulas. ’’ 

Professor William B. Jones — University of Colorado, Boulder 


and Computer Science... 


Now Available! 

* PASCAL PLUS DATA STRUCTURES, »* STRUCTURED COBOL 
ALGORITHMS, AND Gary Haggard, University of Maine at Orono 
ADVANCED PROGRAMMING Wendell Jones, United States Military Academy 
Nol Dal, Universi of Texas at Austin wruchecues 
1985 Paper 400 pages est. 

Instructor’s Guide Shaping the future 
epomenoegpe  e e 

Nell Dale and David Orshalick, 6 D. C. HEATH AND COMPANY 
both of the University of Texas at Austin College Division 

1983 Paper 470 pages 125 Spring Street 


Instructor's Guide Lexington, MA 02173 


Publications by the 


Mathematical 
Association 
of America 

1529 Eighteenth St., NW 
Washington, DC 20036 
202 387-5200 


New from the Carus Mathematical 
Monograph Series... 


From Error Correcting Codes through Sphere Packings 
to Simple Groups, 

by Thomas M. Thompson 

224 pp. Hardbound 

List: $21.00 MAA Member: $16.00 


Two of the most fascinating problems to challenge mathema- 
ticians in recent years concern the construction of data trans- 
mission codes that can correct errors introduced by static and 
the search for efficient ways to pack ping-pong balls into a 
box. Can one design the best error-correcting codes? Can one 
find the most efficient sphere packing? 

Therein lies a fascinating story which is told with great 
skill and clarity in this important addition to the Carus 
Mathematical Monograph series. The author has packed (sic) 
into 175 pages all of the basic mathematical ideas of this saga 
woven into a gripping historical account of the journey from 
error-correcting codes to sphere packings to simple groups. 


Table of Contents 


Chapter 1. The Origin of Error-Correcting Codes 

Chapter 2. From Coding to Sphere Packing 

Chapter 3. From Sphere Packing to New Simple Groups 

Appendix 1. Densest Known Sphere Packing 

Appendix 2. Further Properties of the (12,24) Golay Code and 
the Related Steiner System S (5,8,24) 

Appendix 3. A Calculation of the Number of Spheres with 
Centers in A, adjacent to one, two three, and 
four adjacent spheres with centers in A). 

Appendix 4. The Mathieu Group M,, and the order of M,, 

Appendix 5. The Proof of Lemma 3.3 

Appendix 6. The Sporadic Simple Groups 


SOME INFORMATIVE AND USEFUL 
BOOKS [IFIROM Uinlls IWIAVN 2 oo 


Browse through this list and see if your library is missing some of these important 


books published by the Association 


APPLICATIONS OF UNDERGRADUATE 
MATHEMATICS IN ENGINEERING—written and 
edited by Ben Noble. A collection of articles 
prepared by engineers for the Committee on the 
Undergraduate Program in Mathematics. 364 
pp. Hardbound. 

List: $19.00. MAA Member: $13.00. 


ANNOTATED BIBLIOGRAPHY OF EXPOSITORY 
WRITING IN THE MATHEMATICAL SCIENCES, 
prepared by M. P. Gaffney and L. A. Steen. An 
invaluable reference source of expository 
articles in mathematics. 282 pp. Paperbound. 
List: $12.00. MAA Member: $8.00. 


AN ANNOTATED BIBLIOGRAPHY OF FILMS 
AND VIDEOTAPES FOR COLLEGE 
MATHEMATICS, by David |. Schneider. An up- 
to-date listing of films and videotapes available 
for classroom use. 107 pp. Paperbround. 

List: $9.00. MAA Members: $6.00. 


A BASIC LIBRARY LIST FOR TWO-YEAR 
COLLEGES, prepared by the Committee on 
Basic Library Lists. A recommended library 
nucleus for two-year colleges. 66 pp. 
Paperbound. 

List: $8.00. MAA Member: $6.00. 


A BASIC LIBRARY LIST FOR FOUR-YEAR 
COLLEGES, prepared by CUPM. Presents 
listings of books and journals that should be in 
every college library. 106 pp. Paperbound. 
List: $9.00. MAA Member: $6.50. 


THE CHAUVENET PAPERS. A Collection of Prize 
Winning Expository Papers in Mathematics, 
edited by James C. Abbott. Two-volumes of the 
collected prize winning Chauvenet Papers. Vol. 
1—312 pp. Hardbound. Vol. 2—282 pp. 
Hardbound. 
List: $21.00 each. MAA Member $16.00 each. 
Two volume sets 
List: $36.00. MAA Member: $27.00. 


CRITICAL VARIABLES IN MATHEMATICS 
EDUCATION: Findings from a Survey of the 
Empirical Literature, by E. G. Begle. A joint 
publication of the MAA and the National Council 
of Teachers of Mathematics. List: $8.00. MAA 
Member: $6.40. 


A COMPENDIUM OF CUPM 
RECOMMENDATIONS. Volumes | and Il. A 
collection of the major recommendations of the 
Committee on the Undergraduate Program in 
Mathematics. Two volumes. 756 pp. 
Hardbound. 

List: $16.50. MAA Member: $12.00. 


THE WILLIAM LOWELL, PUTNAM 
MATHEMATICAL COMPETITION: PROBLEMS 
AND SOLUTIONS—1938-1964. Compiled by 
R. E. Greenwood, A. M. Gleason, and L. M. 
Kelly. Contains problems and solutions to the 
first 25 Putnam Exams. 652 pp. Hardbound. 
List: $35.00. MAA Member: $26.00. 


THE MATHEMATICAL ASSOCIATION OF 
AMERICA: Its First Fifty Years. An historical 
perspective of the Association. 170 pp. 
Hardbound. 

List: $10.00. MAA Member: $5.00. 


FIFTY YEAR INDEX OF THE MATHEMATICS 
MAGAZINE, edited by Lynn A. Steen, and J. 
Arthur Seebach. Cumulative index of volumes 
1-50. 163 pp. Paperbound. 

List: $10.00. MAA Member: $6.50. 


INDEX OF THE AMERICAN MATHEMATICAL 
MONTHLY. Contains the tables of contents for 
each issue of volumes 1-80 as well as Subject 
and author indices. 269 pp. Hardbound. 
List:$19.00. MAA Member: $13.00. 


PROFESSIONAL OPPORTUNITIES IN THE 
MATHEMATICAL SCIENCES, Tenth Edition. 
1978. Designed for the student interested in a 
career in mathematics. 35 pp. Paperbound. 
$1.50 each. 95¢ for orders of five or more. 


RECOMMENDATIONS ON A GENERAL 
MATHEMATICAL SCIENCES PROGRAM. 
Prepared by the Committee on the 
Undergraduate Program in Mathematics. 
(CUPM) 102 pp. Paperbound. $3.50 each. 


MATHEMATICAL TIME EXPOSURES, by Issaac 
J. Schoenberg. Brings together topics from 
geometry, number theory, algebra and analysis. 
288 pp. Paperbound. 

List: $18.00. MAA Member: $13.50. 

288 pp. Hardbound 

List: $30.00. MAA Member: $22.50. 


Order From: 


THE MATHEMATICAL ASSOCIATION OF 
AMERICA 

1529 Eighteenth Street, N.W. 
Washington, D.C. 20036 


DEVELOPMENTAL TEXTS 


BEGINNING ALGEBRA by 

R. David Gustafson and Peter D. 
Frisk, Rock Valley College. Offers a 
broad range of topics at a depth 
appropriate for students with no prior 
experience with algebra. 


INTERMEDIATE ALGEBRA 
FOR COLLEGE STUDENTS, 
3RD EDITION by Karl J. Smith 
and Patrick Boyle, Santa Rosa 
Junior College. Motivates new 
techniques and concepts within the 
setting of concrete examples, often 
from everyday experience. 


INTERMEDIATE ALGEBRA 
by Gene Sellers, Sacramento City 
College. Presents new concepts and 
techniques in an informal, non- 
threatening way. 


ALGEBRA FOR COLLEGE 
STUDENTS by R. David 
Gustafson and Peter D. Frisk, Rock 
Valley College. Designed for courses 
that combine topics from intermediate 
algebra and college algebra and for 
those who want to use one book for 
both courses. 


New in 1985 


PRECALCULUS 


PLANE TRIGONOMETRY, 
Z2ND EDITION by R. David 
Gustafson and Peter D. Frisk, Rock 
Valley College. Highly applied and 
concretely motivated. 


COLLEGE ALGEBRA, 3RD 
EDITION by Karl J. Smith and 
Patrick J. Boyle, Santa Rosa Junior 
College. New and ideas unfamiliar to 
the student are motivated by familiar 
images. 


ALGEBRA AND TRIGO- 
NOMETRY, 3RD EDITION 

by Bernard Rice and Jerry Strange, 
University of Dayton. Emphasizes 
mastery of algebraic and computa- 
tional skills with its traditional approach 
to trigonometry. 


COLLEGE ALGEBRA AND 
TRIGONOMETRY by Robert 

P. Webber, Longwood College. 
Focuses on doing problem solving to 
understanding mathematical relation- 
ships. 


COMPUTER MATHEMATICS 


ELEMENTS OF COMPUTER 
MATHEMATICS by Sandra 
Talbot, Mattatuck Community 
College; Harold Baker, Waterbury 
State Technical College; and 
Lawrence Gilligan, University of 
Cincinnati. A practical approach em- 
phasizes basic concepts and skills. 


from Brooks/Cole 


FINITE MATHEMATICS 


FINITE MATHEMATICS by 
Karl J. Smith, Santa Rosa Junior 
College. Provides the noncalculus 
mathematics background necessary 
for students in business ana the social 
or biological sciences. 


CALCULUS 


APPLIED CALCULUS by 
Claudia Taylor, Brevard Community 
College, and Lawrence Gilligan, 
University of Cincinnati. For non- 
science majors who require a short 
course for business, economics, 
social,or life sciences. 


STATISTICS 


THE ANALYSIS OF LINEAR 
MODELS by Ronald Hocking, 
Texas A & M University. Provides a 
new approach that provides insight 
into the model formulation as well as 
the analysis. 


ADVANCED MATHEMATICS 
WITH APPLICATIONS 


A FIRST COURSE IN 
MATHEMATICAL MODELING 
by Frank Giordano, U.S. Military 
Academy, West Point; and Maurice 
Weir, Naval Postgraduate School, 
Monterey. Bridges the gap between 
calculus and real-world applications of 
mathematics. 


NUMERICAL MATHEMATICS 
AND COMPUTING, 2ND 
EDITION by E. Ward Cheney and 
David Kincaid, University of Texas, 
Austin. introduces students to the use 
of computers in applying numerical 
analysis for solving problems in 
science, engineering, and technology. 


F OUNDATIONS OF 
DISCRETE MATHEMATICS 
by Albert Polimeni and Joseph 
Straight, State University of New 
York, Fredonia. Combines a students 
first encounter with abstract mathe- 
matics with an introduction to the 
mathematics basic to the study of 
computer sciences. 


Craig Barth, Editor 
Brooks/Cole Publishing 


Company 
555 Abrego Street 
Monterey, CA 93940 


Scientific Pascal 


Harley Flanders, Florida Atlantic University 


This text emphasizes the scientific applications of 
Pascal. The programs involve approximation, 
zeroes of functions, optimization, differential 
equations, matrices, and other numerical analysis. 
The text includes a clear treatment of files, and is 
well indexed for use as a reference. There are 
hundreds of worked examples that teach practical 
scientific programming rapidly and efficiently. 
The examples and exercises also show how pro- 
—— grams should be written and formatted. Numerous 
a BS ad flowcharts illustrate algorithms and the logic 
en eM involved in writing good scientific programs. Pro- 
i wal grams available on disk from publisher. 
Leong cloth: R5451-1 $21.95 624 pp. 


Table of Contents 
1 Introduction 4 Structured Types 
2 Types and Procedures 5 Applications Numerical 
3 Program Flow 6 Applications—Algebraic 


Appendices: Reserved Words, Symbols, Standard Iden- 
tifiers. Railroad Diagrams. UCSD Pascal. Apple Pascal 
Dialogue. ASCII Characters. Answers to Exercises. 


Reston Publishing Company, Inc. A Prentice-Hall Company 11480 Sunset Hills Road Reston, VA 22090 


The Mathematical Association of America 
CORPORATE MEMBERS 


The Mathematical Association of America gratefully acknowledges the 
interest and support of the following: 


CORPORATE MEMBERS 


Academic Press, Inc. The Mitre Corporation 
Addison-Wesley Publishing Company, Inc. Pergamon Press, Inc. 
American Mathematical Society, Inc. Prindle Weber and Schmidt, Inc. 
Birkhauser Boston, Inc. Saunders College Publishing, Inc.-~ 
International Business Machines Corporation Scott Foresman & Company, Inc. 
Macmillan Publishing Company Springer-Verlag New York, Inc. 
Marcel Dekker, Inc. John Wiley & Sons, Inc. 


CONTRIBUTING CORPORATE MEMBER 
Bell Laboratories, Inc. 


Scott, Foresman Math Texts — 
Working for You! 


Three new editions for your 
math for management courses 
with complete computer 
application program: 


Margaret L. Lial 
Charles D. Miller 


Mathematics and 
Calculus with 
Applications 


Second Edition 
Available Now 


«© 1985, 864 pages, hardbound 


Finite 
Mathematics 


Third Edition 
Available Now 


1985, 518 pages, hardbound 


Calculus with 
Applications 
Third Edition 
Available Now 
| © 1985, 640 pages, hardbound 


All three texts above with Instructor’s 
Guide and Student Solutions Manual. 


For further information write 
Meredith Hellestrae, 
Department SA—-AMM 


s 61900 East Lake Avenue 


Glenview, Illinois 60025 


Daniel Saltz 
A Short Calculus 


An Applied Approach 


Fourth Edition 
Available Now 


© 1985, 624 pages, hardbound, 
Instructor’s Guide 
Daniel Gallin 
Finite 


Mathematics 


© 1984, 525 pages, hardbound, 
Instructor’s Guide 


And continuing a 
a tradition of 
innovation... 


Al Shenk 
Calculus and 
Analytic 
Geometry 

Third Edition 

© 1984, 1151 pages, hardbound, 


Instructor’s Guide, Student Handbook 
Transparency Masters 


Sr 


Scott, Foresman 
and Company 


Calculus Flanders HARLEY FLANDERS 
Florida Atlantic University 
December 1984 

984 text pages 

ISBN 0-7167-1643-7 


CALCULUS 


Harley Flanders believes that the main objective of a calculus text is to 
help students learn to use calculus—how to set up and solve problems. 
And he never loses sight of that goal. 

Efficient, economic organization 

Flanders organizes each major section by developing a small amount of 
introductory material on theory (leaving detailed proofs and unusual topics for 


optional sections), working out examples to illustrate techniques and applications, 


and then providing numerous and varied exercises. 

Careful, informal explanation and motivation of topics 
Major topics are introduced with informal explanations, examples, and many 
graphics before the mathematical details of the theory or technique are 
presented. 

Attention to problem solving 

Keeping the student in mind at all times, Flanders fills the text with hints and 
suggestions for setting up and solving problems. He even provides a unique 
section on problem solving to help students develop problem-solving skills early 
in the course 

Avoidance of unnecessary mathematical detail 

By focusing on what is important and not being distracted by lengthy, time- 
consuming details, Flanders is able to develop the transcendental functions prior 
to integration, so that students can see some applications of integration during 
the first semester. 

An assurance of accuracy 

Instructors and students expect their calculus text to be accurate and error-free. 
The publisher has had each calculation and answer checked independently at 
several stages of production. 


We invite you to examine Flanders’ new Calculus for yourself. To obtain your 
copy, simply address your request on school letterhead to: 


as W. H. Freeman and Company Be sure to include your course name, 
M@ 41 Madison Avenue current text, and anticipated enrollment. 
New York, N.Y. 10010 


THE MATHEMATICAL ASSOCIATION OF AMERICA 


14990 Fishteenth Street N W 


THE AMERICAN 
MATHEMATICAL MONTHLY & 


AS 
bs § 
= 
w 
% 


Volume 92, Number 3 March 1985 


Contents (ISSN 0002-9890) 


ARTICLES 

Award for Distinguished Service to 

Professor Everett Pitcher ............ 2.2... 0.0.0.0 0 cee eae DAVID ROSELLE 169 

Algorithmic Thinking and Mathematical Thinking .......... DONALD E. KNUTH 170 

The Anatomy of Low Dimensional Stable Singularities ....... JONATHAN BRITT 183 

Characterizing Maximally Looped Closed 

CurveS ... 0.0.0... eee eee F. J. CRAVEIRO DE CARVALHO 202 
PHOTO... ee eee eens 182 
CENTER SECTION (Telegraphic Reviews, Official Reports)................ C25-C36 
NOTES 

On Gelbaum’s Algorithm for Computing the Minimal 

Polynomial of a Matrix... .....00.... 0.0.0... 00 eee eee RONALD J. LEACH 208 

Continuous Functions with a Dense Set of Proper 

Local Maxima.................. VLADIMIR DROBOT ano MICHAL MORAYNE 209 

The Diameter of a Graph and Its 

Complement.................. FRANK HARARY ann ROBERT W. ROBINSON 211 
THE TEACHING OF MATHEMATICS 

Simplification of Some Contour Integrations ..... MARY L. BOAS ano R.P. BOAS 212 

Pi. ee ee eee E.F. ASSMUS, Jr. 213 
MISCELLANEA ....... 2.020.000. cc ee eee een eee 214, 228 
PROBLEMS AND SOLUTIONS 

Elementary Problems and Solutions ............... 0.0.0.0. cece eee eee 215 

Advanced Problems and Solutions .............. 0.000000 eee ee ee eee 217 
REVIEWS 

Neyman: From Life. By Constance Reid .................... SAMUEL KOTZ 219 

Differential Equations: Classical to Controlled. 

By Dahlard L. Lukes............ 0.0.0.0... 0000 cece eee ee EMILIO ROXIN 223 

From the Calculus to Set Theory, 1630-1910: 

An Introductory History. Edited by |. Grattan-Guinness ........ ROBINE. RIDER 225 


LETTERS TO THE EDITOR ............ 0.0... 0 ee eee 


NOTICE TO AUTHORS 


See statement of editorial policy (volume 89, p. 3). Follow the format in current issues. Put your full mailing 
address in a line at the head of the paper, following the title and the author’s name. Send three copies of the 
manuscript, legibly typewritten on only one side of the paper, double-spaced with wide margins, and keep one as 
protection against loss. Prepare illustrations carefully, on separate sheets of paper in black ink, the original without 
lettering and two copies with lettering added. Rough copies of the illustrations should be included in the manuscript 
at the appropriate places. Supply the full five-symbol Mathematics Subject Classification number, as described in 
Mathematical Reviews, 1980 and later. (This is necessary for indexing purposes.) Send all main articles, letters to the 
editor, and editorial correspondence to the editor. Contributions to the other four special departments (Notes, 
Teaching of Mathematics, Unsolved Problems, and Problems and Solutions) should be sent to the appropriate one 
of the associate editors, whose addresses are given below. 


EDITOR 


P. R. HALMOS, Department of Mathematics, University of Santa Clara, Santa Clara, CA 95053. 


ASSOCIATE EDITORS 
G.L. ALEXANDERSON DAVID BORWEIN J. H. EWING LYNN A. STEEN 
THOMAS F. BANCHOFF W. WISTAR COMFORT RAOUL HAILPERN MARY R. WARDROP 
R. P. BOAS H. M. W. EDGAR RICHARD S. MILLMAN HERBERT S. WILF 


ALLAN L. EDMONDS SEYMOUR SCHUSTER 


Notes. Combinatorics and probability: SABRA S. ANDERSON, Department of Mathematics, University of 
Minnesota, Duluth, MN 55812. Analysis and linear algebra: SHELDON AXLER, Department of Mathematics, 
University of California, Berkeley, CA 94720. Topology, geometry, and abstract algebra: J. ARTHUR SEEBACH, 
Jr., Department of Mathematics, St. Olaf College, Northfield, MN 55057. Authors of fields not covered by these 
rubrics may send their notes to any one of the three editors. 

Teaching of Mathematics: ROBERT F. WARDROP, Department of Mathematics, Central Michigan University, 
Mt. Pleasant, MI 48859. 

Unsolved Problems: RICHARD GUY, Department of Mathematics and Statistics, The University of Calgary, 
Calgary, Alberta, Canada T2N 1N4. 

Proposed problems and solutions, both Elementary and Advanced: D. H. MUGLER, Department of Mathe- 
matics, University of Santa Clara, Santa Clara, CA 95053. 

Reprint Permission: A. B. WILLCOX, Executive Director, Mathematical Association of America, 1529 Eighteenth 
Street, N.W., Washington, DC 20036. 

Advertising Correspondence: Ms. ELAINE PEDREIRA, Advertising Manager, Mathematical Association of America, 
1529 Eighteenth Street, N.W., Washington, DC 20036. 

Subscription correspondence, changes of address, and inquiries about nondelivered or defective issues: Member- 
ship /Subscriptions Department, Mathematical Association of America, 1529 Eighteenth Street, N.W., Washington, 
DC 20036. 

Back Issues: P. and H. BLiss Co., Middletown, CT 06457. 

Microfilm Editions: University Microfilms International, Serials Bid Coordinator, 300 North Zeeb Road, Ann 
Arbor, MI 48106. < 


Editorial Assistants: Fanny Hailpern, Mary P. Jackson. 


The AMERICAN MATHEMATICAL MONTHLY (ISSN 0002-9890) is published by the Mathematical Association of America 


at 1529 Eighteenth Street, N.W., Washington, DC 20036 and Montpelier, VT, ten times a year: January, February, March, 


April, May, June-July, August-September, October, November and December. 


The annual subscription price for the AMERICAN MATHEMATICAL MONTHLY to an individual member of the Association 
is $22 included as part of the annual dues. (Annual dues for regular members, exclusive of annual subscription prices for 
MAA journals, are $22. Student, unemployed and emeritus members receive a 50% discount; new members receive a 30% 


dues discount for the first two years of membership.) The nonmember /library subscription price is $55 per year. 


Copyright © by the Mathematical Association of America (Incorporated), 1985, including rights to this journal issue as a 
whole and, except where otherwise noted, rights to each individual contribution. General permission is granted to 
Institutional Members of the MAA for noncommercial reproduction in limited quantities of individual articles (in whole or 


in part) provided a complete reference is made to the source. 
Second class postage paid at Washington, DC, and additional mailing offices. 


Postmaster: Send address changes to Membership/Subscriptions Department, Mathematical Association of America, 


1529 Eighteenth Street, N.W., Washington, DC 20036. 
PRINTED IN THE UNITED STATES OF AMERICA 


AWARD FOR DISTINGUISHED SERVICE TO 
PROFESSOR EVERETT PITCHER 


DAVID ROSELLE 
Virginia Polytechnic Institute and State University, Blacksburg, VA 24061 


This year’s recipient of the Award for Distinguished Service to Mathematics is a man who has 
given long, devoted and outstanding service to the American Mathematical Society and to the 
larger mathematics community. 

Everett Pitcher was born on July 18, 1912, in Hanover, New Hampshire. His father, Arthur 
Dunn Pitcher, had taken a Ph.D. in general analysis under E. H. Moore at the University of 
Chicago in 1910 and was a faculty member at Dartmouth at the time of Everett Pitcher’s birth. In 
1915, the Pitcher family moved to Cleveland where Professor Pitcher served as Head of the 
Mathematics Department until his premature death in 1923. It is of interest that Arthur Dunn 
Pitcher served as a member of the Council of the AMS. 

Everett Pitcher progressed through the public schools of Cleveland where his mother Wilimina 
Everett Pitcher also served as a teacher. He then attended Adelbert College of Western Reserve 
University and received the A.B. in 1932. From there, Everett proceeded to Harvard University 
where he received the degrees of M.A. (1933) and Ph.D. (1935). 

At Harvard, Pitcher especially remembers courses in Complex Variables from William Fogg 
Osgood, Real Variables from Joseph L. Walsh, and Integral Equations from Marston Morse. The 
contact with Morse was quite enough to cause young Pitcher to ask Morse to take him on as a 
student. Morse was then in the midst of writing up his Colloquium Lectures and thus Pitcher was 
immediately given four chapters to read while still in draft. The next academic year, Pitcher and 
Morse worked on a problem on distribution of conjugate points and this problem became the 
body of Pitcher’s dissertation. 

When Professor Morse left Harvard University in 1935 to take up an appointment at the 
Institute for Advanced Study, he took the new Dr. Pitcher with him as his first assistant. Everett 
remembers Professor Morse as being conscious of status and thinks that he may thus have 
received a salary of $1800 when other assistants were paid only $1500. 

Pitcher returned to Harvard in 1936 and served two years as a Benjamin Peirce Instructor. He 
was looking for a position of more permanence during 1937-38 and learned of just one opening, 
namely an instructorship at Lehigh, which he accepted. 

Everett Pitcher was on active duty in the U.S. Army Ordnance Department from May 1942 
until the end of 1945. His work dealt with the applications of mathematics and most of his duty 
was at the Ballistics Research Laboratory (BRL) working with a distinguished group of mathema- 
ticians brought together by Oswald Veblen. Colleagues included E. J. McShane, J. L. Kelley, 
Gerhard Hochschild, Herbert Federer, Leonard Tornheim, J. J. Levin, A. A. Bennett, Theodore 
Hailperin, Fritz John, J. W. Green, Frank Grubbs, Herman Goldstine, Irving Segal, I. J. 
Schoenberg, Hans Lewy, and Leo Zippin. The problems in which Pitcher was involved while at 
BRL were the accuracy of artillery firing, the terminal effects of artillery fire, and scientific 
intelligence; the last while on assignment in Paris and traveling in Austria during 1945. 

Subsequent to his WW II assignment, Pitcher returned to the Institute for Advanced Study for 
the year 1945-46 where he and J. L. Kelley co-authored “Exact Sequences in Homology Theory,” 
Ann. of Math. (2) 48, 682—709 (1947). Everett recalls this as one of his better papers and the 
subject matter as being as far from what he and Kelley had been doing at BRL as one could get. 

Pitcher returned to Lehigh University in the fall of 1946 and was promoted twice in two years, 
the second time to the rank of professor. In addition to his faculty assignments, he has served 
Lehigh University as Department Chairman (1960-78) and as Consultant to the President 
(1978—present). 

Everett has been enormously active in the professional activities of our community. He was 
Associate Secretary of the AMS (1959-66), a founder of SIAM and a member of its Board of 
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Trustees (1961-63). He has served as Secretary of the American Mathematical Society since 1967. 
He was elected to Phi Beta Kappa (1930), Sigma XI (1939) and is a member of the Pi Kappa 
Alpha fraternity. 

Everett was married to Sarah Mathiott Hindman in 1936 who died in 1972. Sarah and Everett 
have two children, Joan P. Morrison and Susan Pitcher-Cooper. In 1973 Everett married Theresa 
P. Sell, known affectionately to many of us as Terry. 

Everett Pitcher has served the mathematics community exceptionally well. His service to the 
Society for Industrial and Applied Mathematics, the Mathematical Association of America, and to 
Lehigh University have been noteworthy. I feel profoundly grateful to Everett Pitcher for his 
exceptional service to the American Mathematical Society. I know that you share my sense of 
gratitude and agree with the Association’s recognition of Everett Pitcher as the 1985 recipient of 
the Award for Distinguished Service to Mathematics. 


ALGORITHMIC THINKING AND MATHEMATICAL THINKING 


DONALD E. KNUTH 
Computer Science Department, Stanford University, Stanford, CA 94305 


My purpose in this paper is to stimulate discussion about a philosophical question that has 
been on my mind for a long time: What is the actual rdle of the notion of an algorithm in 
mathematical sciences? 

For many years I have been convinced that computer science is primarily the study of 
algorithms. My colleagues don’t all agree with me, but it turns out that the source of our 
disagreement is simply that my definition of algorithms is much broader than theirs: I tend to 
think of algorithms as encompassing the whole range of concepts dealing with well-defined 
processes, including the structure of data that is being acted upon as well as the structure of the 
sequence of operations being performed; some other people think of algorithms merely as 
miscellaneous methods for the solution of particular problems, analogous to individual theorems 
in mathematics. 

In the U.S.A., the sorts of things my colleagues and I do is called Computer Science, 
emphasizing the fact that algorithms are performed by machines. But if I lived in Germany or 
France, the field I work in would be called Informatik or Informatique, emphasizing the stuff that 
algorithms work on more than the processes themselves. In the Soviet Union, the same field is 
now known as either Kibernetika (Cybernetics), emphasizing the control of a process, or 
Prikladnaia Matematika (Applied Mathematics), emphasizing the utility of the subject and its ties 
to mathematics in general. I suppose the name of our discipline isn’t of vital importance, since we 
will go on doing what we are doing no matter what it is called; after all, other disciplines like 
Mathematics and Chemistry are no longer related very strongly to the etymology of their names. 
However, if I had a chance to vote for the name of my own discipline, I would choose to call it 


Donald E. Knuth, the Fletcher Jones Professor of Computer Science at Stanford University, is best known as the 
author of The Art of Computer Programming, a series of reference books that he plans to complete during the next 
two decades. His mathematical novelette, Surreal Numbers, has been used to help teach research methodology to 
undergraduates. During the past several years he has also developed new methods of mathematical typography that 
are now coming into wide use. 


This paper has been excerpted from the text of a talk published in [8]. The preparation of the original version 
was supported in part by National Science Foundation grant MCS72-03752 A03 and in part by Office of Naval 
Research contract N0014-76-C-0330. This revision has been supported in part by National Science Foundation grant 
MCS-83-00984. Many people too numerous to name have contributed their insights during informal discussions 
about the topics discussed here. 
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Algorithmics, a word coined about 20 years ago by J. F. Traub [12, p. 1]. 

The word algorithm stems from the name of al-Khwarizmi, the great ninth-century scientist 
whose name means “from Khwarizm.” When I learned the source of this word, I decided that I 
ought to make a pilgrimage to the Khwarizm region, since algorithms are so central to my life’s 
work. In 1979 I had the wonderful opportunity to fulfill this dream, because a unique symposium 
was organized in that year by the Academy of Sciences of the Uzbek S.S.R., with the support of 
the Soviet Academy of Sciences. This symposium, entitled “Algorithms in Modern Mathematics 
and Computer Science,” took place in Urgench, about 150 miles south of the Aral Sea—which 
was once known as Lake Khwarizm. 

The chance to participate in such a symposium gave a special focus to the questions that had 
been buzzing around in my head. It was a time for me to think about long-range issues, things that 
I rarely have a chance to contemplate because of the hectic pace of activities at home. The site of 
the symposium, with its rich history and the grand scale of its scenery, encouraged me to look 
backward to the roots of my activities, and to look ahead as well—wondering what my work was 
all about. I asked myself several questions with renewed intensity: What is the relation of 
algorithms to modern mathematics? Is there an essential difference between an algorithmic 
viewpoint and the traditional mathematical world-view? Do most mathematicians have an essen- 
tially different thinking process from that of most computer scientists? Among members of university 
mathematics departments, why do the logicians (and to a lesser extent the combinatorial 
mathematicians) tend to be much more interested in computer science than their colleagues are? 

In a sense, I had been thinking about such questions ever since my undergraduate days. I began 
to study higher mathematics in 1957, the same year that I began to work with digital computers; 
but I never mixed my mathematical thinking with my computer-science thinking in nontrivial 
ways until 1961. In one building I was a mathematician, in another I was a computer programmer, 
and it was as if I had a split personality. During 1961 I was excited by the idea that mathematics 
and computer science might have some common ground, because BNF notation looked mathe- 
matical; so I bought a copy of Chomsky’s Syntactic Structures and set out to find an algorithm to 
decide the ambiguity problem of context-free grammars (not knowing that this had been proved 
impossible by Bar-Hillel, Perles, and Shamir in 1960). Needless to say, I failed to solve that 
problem, although I found some useful necessary and sufficient conditions for ambiguity, and I 
also derived a few other results like the fact that context-free languages on one letter are regular. 
Here, I thought, was a nice mathematical theory that I was able to develop with my computer- 
science intuition; how curious! During the summer of 1962, I spent a day or two analyzing the 
performance of hashing with linear probing, but this did not really seem like a marriage between 
my computer science personality and my mathematical personality since it was merely an 
application of combinatorial mathematics to a problem that has relevance to programming. 

I think it is generally agreed that mathematicians have somewhat different thought processes 
from physicists, who have somewhat different thought processes from chemists, who have 
somewhat different thought processes from biologists. Similarly, the respective “mentalities” of 
lawyers, poets, playwrights, historians, linguists, farmers, and so on, seem to be unique. Each of 
these groups can probably recognize that other types of people have a different approach to 
knowledge; and it seems likely that a person gravitates to a particular kind of occupation 
according to the mode of thought that he or she grew up with, whenever a choice is possible. C. P. 
Snow wrote a famous book about “two cultures,” scientific vs. humanistic, but in fact there seem 
to be many more than two. 

Educators of computer science have repeatedly observed that only about 2 out of every 100 
students enrolling in introductory programming courses really “resonate” with the subject and 
seem to be natural-born computer scientists. (For example, see Gruenberger [3].) Just before 
traveling to the Urgench symposium I had some independent confirmation of this, when I learned 
that 220 out of 11000 graduate students at the University of Illinois were majoring in Computer 
Science. (The relative numbers are different now, but they are probably biased by economic 
influences more than by natural aptitudes.) Since I believe that Computer Science is the study of 
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algorithms, I conclude that roughly 2% of all people “think algorithmically,” in the sense that they 
can reason rapidly about algorithmic processes. 

During my visit to the University of Illinois in 1979, I met Gerrit DeYoung, a psychologist- 
interested-in-computing who had recently made an interesting experiment on two groups of 
undergraduate students taking introductory courses in computer science. Group I consisted of 135 
students intending to major in computer science, while Group II consisted of 35 social science 
majors. Both courses emphasized non-numeric programming and various data and control 
structures, although numerical problems were treated too. DeYoung handed out a questionnaire 
that tested each student’s so-called quantitative aptitude, a standard test that seems to correlate 
with mathematical ability, and he also asked them to estimate their own performance in class. 
Afterwards he learned the grades that the students actually did receive, so he had three pieces of 
data on each student: 


A = quantitative aptitude; 
B = student’s own perception of programming ability; 
C = teacher’s perception of programming ability. 


In both cases B correlated well with C (the coefficient was about .6), so we can conclude that the 
teachers’ grading wasn’t random and that there is some validity in these scores. The interesting 
thing was that there was no correlation between A and B or between A and C among the 
computer science majors (Group I), while there was a pronounced correlation of about .4 between 
the corresponding numbers for the students of Group II. It isn’t clear how to interpret this data, 
since many different hypotheses could account for'such results; perhaps psychologists know only 
how to measure the quantitative ability of people who think like psychologists do! At any rate the 
lack of correlation between quantitative ability and programming performance in the first group 
reminds me strongly of the feelings I often have about differences between mathematical thinking 
and computer-science thinking, so further study is indicated. 

I believe that the real reason underlying the fact that Computer Science has become a thriving 
discipline at essentially all of the world’s universities, although it was totally unknown twenty 
years ago, is not that computers exist in quantity; the real reason is that the algorithmic thinkers 
among the scientists of the world never before had a home. We are brought together in Computer 
Science departments because we find people who think like we do. At least, that seems a viable 
hypothesis, which hasn’t been contradicted by my observations during the last half dozen or so 
years since the possibility occurred to me. 

My goal, therefore, has been to get a deeper understanding of these phenomena; the “different 
modes of thought” hypothesis merely scratches the surface and gives little insight. Can we come 
up with a fairly clear idea of just what algorithmic thinking is, and contrast it with classical 
mathematical thinking? 

At times when I try to come to grips with this question, I find myself almost convinced that 
algorithmic thinking is really like mathematical thinking, only it concentrates on more “difficult” 
things. But at other times I have just the opposite impression, that somehow algorithms hit only 
the “simpler” kinds of mathematics...,. Clearly such an approach leads only to confusion and 
gets me nowhere. 

I looked at the collection of expository works called Mathematics: Its Content, Methods, and 
Meaning [1] and reread what A. D. Aleksandrov had to say in his excellent introductory essay. 
(Interestingly enough, I found that he made prominent mention of al-Khwarizmi.) Aleksandrov 
listed the following characteristic features of mathematics: 


e Abstractness, with many levels of abstraction. 
e Precision and logical rigor. 
e@ Quantitative relations. 


e Broad range of applications. 
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Unfortunately, however, all four of these features seem to be characteristic also of computer 
science. Is there really no difference between computer science and mathematics? 


A Plan. I decided that I could make no further progress unless I took a stab at analyzing the 
question “‘What is mathematics?”—analyzing it in some depth. Many other people have also 
explored this same question, and the answer is obviously that “Mathematics is what mathema- 
ticians do.” More precisely, the appropriate question should probably be, “What is good 
mathematics?” and the answer is that “Good mathematics is what good mathematicians do.” 

Therefore I took nine books off of my shelf, mostly books that I had used as texts during my 
student days but also a few more for variety’s sake. I decided to take a careful look at page 100 
(i.e., a ““random’’ page) in each book and to study the first result on that page. This way I could 
get a sample of what good mathematicians do, and I could attempt to understand the types of 
thinking that seem to be involved. 

From the standpoint of computer science, the notion of “types of thinking” is not so vague as 
it once was, since we can now imagine trying to make a computer program discover the 
mathematics. What sorts of capabilities would we have to put into such an artificially intelligent 
program, if it were to be able to come up with the results on page 100 of the books I selected? 

In order to make this experiment fair, I was careful to abide by the following ground rules: 
(1) The books were all to be chosen first, before I studied any particular one of them. (2) Page 100 
was to be the page examined in each case, since I had no a priori knowledge of what was on that 
page in any book. If somehow page 100 turned out to be a bad choice, I wouldn’t try anything 
sneaky like searching for another page number that would give results more in accord with my 
prejudices. (3) I would not suppress any of the data; every book I had chosen would appear in the 
final sample, so that I wouldn’t introduce any bias by selecting a subset. 

The results of this experiment opened up my eyes somewhat, so I would like to share them with 
you. Here is a book-by-book summary of what I found. 


Book 1: Thomas’s Calculus. I looked first at the book that had introduced me to higher 
mathematics, the calculus text by George B. Thomas [11] that I had used as a college freshman. 
On page 100 he treats the following problem: What value of x minimizes the travel time from (0, a) 
to (x,0) to (d, —b), if you must go at speed s, from (0,a) to (x,0) and at some other speed s, 
from (x,0) to (d, —b)? 


(0, a) 


a 


(d, —b) 


Fic. 1 


In other words, we want to minimize the function 


f(x) =ya*+x* fs, + yb +(d- x)” /sp. 


The solution is to differentiate f(x), obtaining 
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As x runs from 0 to d, the value of (sin@,)/s, starts at zero and increases, while the value of 
(sin 8, )/s, decreases to zero. Therefore the derivative starts negative and ends positive; there must 
be a point where it is zero, i.e., (sin#,)/s, = (sin8,)/s,, and that’s where the minimum occurs. 
Thomas remarks that this is “Snell’s Law” in optics; somehow light rays know how to minimize 
their travel time. 

The mathematics involved here seems to be mostly a systematic procedure for minimization, 
based on formula manipulation and the correspondence between formulas and geometric figures, 
together with some reasoning about changes in function values. Let us keep this in mind as we 
look at the other examples, to see how much the examples have in common. 


Book 2: A Survey of Mathematics. Returning to the survey volumes edited by Aleksandrov 
et al. [1], we find that page 100 is the chapter on Analysis by Lavrent’ev and Nikol’skii. It shows 
how to deduce the derivative of the function log, x in a clever way: 

log,(x +h) -—log,x 1 xth 1 ( a 


— “a = =] hd 
i j Ba x x 98 1+ 


The logarithm function is continuous, so we have 


x/h x/h 
lim 108, (1 + =| = biog, lim (1 + =} = Liog.e, 
h->0 x xX X << h->0 x x 


since it has already been proved that the quantity (1 + 1/n)” approaches a constant called e, 
when n approaches infinity through integer or noninteger values. Here the reasoning involves 
formula manipulation and an understanding of limiting processes. 


Book 3: Kelley’s General Topology. The third book I chose was a standard topology text [4], 
where page 100 contains the following exercise: “Problem A. The image under a continuous map of 
a connected space is connected.” No solution is given, but I imagine something like the following 
was intended: First we recall the relevant definitions, that a function f from topological space X 
to topological space Y is continuous when the inverse image f '(V) is open in_X, for all open sets 
V in Y; a topological space X is connected when it cannot be written as a union of two nonempty 
open sets. Thus, let us try to prove that Y is connected, under the assumption that f is continuous 
and X is connected, where f(X) = Y. If Y= V, U V,, where V, and V, are disjoint and open, 
then X= f-1(V,) Uf-'(V,), where f-'(V,) and f-1(V,) are disjoint and open. It follows that 
either f-'(V,) or f-1(V,) is empty, say f ‘(V,) is empty. Finally, therefore, V, is empty, since 
Vv,  f(f1(M)). QED. 


(Note that no properties of “open sets” were needed in this proof.) 


The mathematical thinking involved here is somewhat different from what we have seen before; 
it consists primarily of constructing chains of implications from the hypotheses to the desired 
conclusions, using a repertoire of fact§ like “f-'(A MN B) = f-'(A) Nf '(B)”. This is analogous 
to constructing chains of computer instructions that transform some input into some desired 
output, using a repertoire of subroutines, although the topological facts have a more abstract 
character. 

Another type of mathematical thinking is involved here, too, and we should be careful not to 
forget it: Somebody had to define the concepts of continuity and connectedness in some way that 
would lead to a rich theory having lots of applications, thereby generalizing many special cases 
that had been proved before the abstract pattern was perceived. 


Book 4: From the 18th Century. Another book on my list was Struik’s Source Book in 
Mathematics, which quotes authors of famous papers written during the period 1200-1800 A.D. 
Page 100 is concerned with Euler’s attempt to prove the fundamental theorem of algebra, in the 
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course of which he derived the following auxiliary result: “Theorem 4. Every quartic polynomial 
x* + Ax? + Bx* + Cx + D with real coefficients can be factored into two quadratics.” 

Here’s how he did it. First he reduced the problem to the case A = 0 by setting x = y — 4A. 
Then he was left with the problem of solving (x* + ux + a)(x* — ux + B)= x4 + Bx? + Cx + 
D for u, a, and B, so he wanted to solve the equations B=a+ B-u’?, C=(B-a)u, 
D = aB. These equations lead to the relations 2B = B + u? + C/u, 2a = B+ u? — C/u, and 
(B+ u*)? — C*/u? = 4D. But the cubic polynomial (u*)? + 2B(u?)? + (B? — 4D)u? — C? 
goes from — C? to +00 as u” runs from 0 to 0, so it has a positive root, and the factorization is 
complete. 


(Euler went on to generalize, arguing that every polynomial of degree 2” can be factored into 
2 


n 


two of degree 2”, via a polynomial of odd degree 3 ; ",) in u? having a negative constant term. 


But this part of his derivation was not rigorous; Lagrange and Gauss later pointed out a serious 
flaw.) 

When I first looked at this example, it seemed to be distinctly more “algorithmic” than the 
preceding ones—probably because Euler was essentially explaining how to take a quartic 
polynomial as input and to produce two quadratic polynomials as output. Input/output char- 
acteristics are significant aspects of algorithms, although Euler’s actual construction is compara- 
tively simple and direct so it doesn’t exhibit the complex control structure that algorithms usually 
have. The types of thinking involved here seem to be (a) to reduce a general problem to a simpler 
special case (by showing that A can be assumed zero, and by realizing that the resulting 
sixth-degree equation in u is really a third-degree equation in u*); (b) formula manipulation to 
solve simultaneous equations for a, 8, and u; (c) generalization by recognizing a pattern for the 
case of 4th degree equations that apparently would extend to degrees 8, 16, etc. 


Book 5: Abstract Algebra. My next choice was another standard textbook, Commutative 
Algebra by Zariski and Samuel [14]. Their page 100 is concerned with the general structure of 
arbitrary fields. Suppose k and K are fields with k C K; the transcendence degree of K over k is 
defined to be the cardinal number of any “transcendence basis” L of K over k, namely a set L 
such that all of its finite subsets are algebraically independent over k and such that all elements of 
K are algebraic over k(L); 1.e., they are roots of polynomial equations whose coefficients are in 
the smallest field containing k U L. The exposition in the book has just found that this cardinal 
number is a well-defined invariant of k and K, 1.e., that all transcendence bases of K over k have 
the same cardinality. 

Now comes Theorem 26: [fk C K C &, the transcendence degree of X over k is the sum of the 
transcendence degrees of K over k and of X over K. To prove the theorem, Zariski and Samuel let 
L be a transcendence basis of K over k and # a transcendence basis of ¥ over K; the idea is to 
prove that L U # is a transcendence basis of # over k, and the result follows since L and ¥ are 
disjoint. 

The required proof is not difficult and it is worth studying in detail. Let {x,,...,x,,, 
X1,---,Xy } be a finite subset of L U&, where the x’s are in L and the X’s in #, and assume 
that they satisfy some polynomial equation over k, namely 


E Ey — 
( +) > A(C15-.-5 Cy, Ey,. +5 Eq) Xft + xem pict: XyM = 0, 

ej yeees Cm 20 

E,,...,Ey20 
where all the a(e,,...,¢,,,/,,---, Ey) are in k and only finitely many a’s are nonzero. This 
equation can be rewritten as 

E E 
(* «) > y (e155 ms Eiseess By) Xf + xg) XBL XY = 0, 
E, yeeey Ey20 CLreees Cm 20 


a polynomial in the X’s with coefficients in K, hence all of these coefficients are zero by the 
algebraic independence over # over K. These coefficients in turn are polynomials in the x’s with 
coefficients in k, so all the a’s must be zero. In other words, any finite subset of L UY is 
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algebraically independent. 

Finally, all elements of K are algebraic over k(L) and all elements of “% are algebraic over 
K(£). It follows from the previously developed theory of algebraic extensions that all elements of 
H are algebraic over k(L)(£), the smallest field containing k UL UY#. Hence L U£& satisfies 
all the criteria of a transcendence basis. 

Note that the proof involves somewhat sophisticated “data structures,” i.e., representations of 
complex objects, in this case polynomials in many variables. The key idea is a pun, the 
equivalence between the polynomial over k in (*) and the polynomial over k(L) in (* *). In fact, 
the structure theory of fields being developed in this part of Zariski and Samuel’s book is 
essentially a theory about data structures by which all elements of the field can be represented and 
manipulated. Theorem 26 is not as important as the construction of transcendence bases that 
appears in its proof. 

Another noteworthy aspect of this example is the way infinite sets are treated. Finite concepts 
have been generalized to infinite ones by saying that all finite subsets must have the property; this 
allows algorithmic constructions to be applied to the subsets. 


Book 6: Metamathematics. I chose Kleene’s Introduction to Metamathematics |5| as a repre- 
sentative book on logic. Page 100 talks about “disjunction elimination”: Suppose we are given 
(1) KH A V B and (2) At C and (3) BE C. Then by a rule that has just been proved, (2) and (3) 
yield 
(4) AV BEC. 


From (1) and (4) we may now conclude “(5) - C”’. Kleene points out that this is the familiar 
idea of reasoning by cases. If either A or B is true, we can consider case 1 that A is true (then C 
holds); or case 2 that B is true (and again C holds). It follows that statement C holds in any case. 

The reasoning in this example is simple formula manipulation, together with an understanding 
that familiar thought patterns are being generalized and made formal. 

I was hoping to hit a more inherently metamathematical argument here, something like 
“anything that can be proved in system X can also be proved in system Y,” since such arguments 
are often essentially algorithms that convert arbitrary X-proofs into Y-proofs. But page 100 was 
more elementary, this being an introductory book. 


Book 7: Knuth. Is my own work [6] algorithmic? Well, page 100 isn’t especially so, since it is 
part of the introduction to mathematical techniques that appear before I get into the real 
computer science content. The problem discussed on that page is to get the mean and standard 
deviation of the number of “heads” in n coin flips, when each independent flip comes up “heads” 
with probability p and “tails” with probability q = 1 — p. I introduce the notation p,, for the 
probability that k heads occur, and observe that 

Pak — P "Pn-1,k-1 + q "Pn—-1,k , 
To solve this recurrence, I introduce the generating function 
G,,(z) = » PragZ* 
< k>0 
and obtain G,,(z) = (q + pz)G,_,(z), G,(z) = q + pz. Hence G,(z) = (q + pz)”, and 
mean(G,,) = nmean(G,) = pn; __ var(G,,) = nvar(G,) = pgn. 

Thus, the recurrence relation is set up by reasoning about probabilities; it is solved by formula 

manipulation according to patterns that are discussed earlier in the book. 


Book 8: Polya and Szeg6. The good old days of mathematics are represented by Polya and 
Szegdo’s famous Aufgaben und Lehrsatze, recently available in an English translation with many 
new Aufgaben [9]. Page 100 contains a real challenge: 


7 192ncos8 
217. = lim I - —__ d@0 = 27. 
n>oo—@ (2ne — 1) ---(2ne'® — n)| 
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Fortunately the answer pages provide enough of a clue to reveal the proof that the authors had in 
mind. We have 
ne? — kl? = 4n? + k? — Ankcos@ = (2n — k)’-+ 4nk(1 — cos@) 
= (2n —k)” + 8nk sin? 0/2. 
Replacing 6 by x/ Vn allows us to rewrite the integral as 


nt 2?" 00 
(anc ae fo fu) ae, 


where f,(x) = 0 for |x| > avn, and otherwise 


8nk x we 
f(x) J2n(cos x/fn —1) I] [+ sin? | 


l<k<n (2n — ky’ 2¥n 


cp (2102) oo — i - yb sin + kt) 
l<k<n n — 


4 _ 2 4 
= exp -7in2+ f=) + 5 » ark = + 0(5] 
n i<k<n\(2n-—k)y ” n 


exp{ - x*In2 —(1 — 1n2) x? +o(-*)), 


Thus, f,(x) converges uniformly to e-* in any bounded interval. Furthermore we have 
Lf, (x)| < J2n (cos x/yn — 1) and 


x x x 1 7 
cos ES oy Ft 5472 < -(5- a \= for |x| < <a/yn, 


since the cosine function is “enveloped” by its Maclaurin series; therefore | f,(x)| is less than the 
integrable function e~‘* for all n, where c=1-— 77/12. From this uniformly bounded 
convergence we are justified in taking limits past the integral sign, 


lim f f,(x) dx = f e* dx =yr. 


Finally, the coefficient in front of {%,, f,(x)dx is 2?"*!n!?/Vn(2n)!, which is equal to 
2V7 (1 + O(1/n)) by Stirling’s approximation, and the result follows. 

This derivation gives some idea of how far mathematics had developed between the time of 
al-Khwarizmi and 1920. It involves formula manipulation and an understanding of the asymptotic 
limiting behavior of functions, together with the idea of inventing a suitable function f, that will 
rigorously permit us to make the interchange lim, 00 feof, (x) dx = f%., (lim, ... f,(x)) dx. 
The definition of f,(x) requires a clear understanding of how functions like exp x and cos x 
behave. 


Book 9: Bishop’s Constructive Mathematics. The last book I chose to sample turned out to be 
the most interesting of all from the standpoint of my quest; it was Errett Bishop’s Foundations of 
Constructive Mathematics [2], a book that I had heard about but never before read. The interesting 
thing about this book is that it reads essentially like ordinary mathematics, yet it is entirely 
algorithmic in nature if you look between the lines. 

Page 100 of Bishop’s book contains Corollary 3 to the Stone-Weierstrass theorem developed on 
the preceding pages: Every uniformly continuous function on a compact set X C R can be arbitrarily 
closely approximated on X by polynomial functions over R. And here is his proof: “By Lemma 5, 
the function x > |x — x,| can be arbitrarily closely approximated on X by polynomials. The 
theorem then follows from Corollary 2.” 
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We might call this a compact proof! Before unwrapping it to explain what Lemma 5 and 
Corollary 2 are, I want to stress that the proof is essentially an algorithm; the algorithm takes any 
constructively given compact set X and continuous function f and tolerance e as input, and it 
outputs a polynomial that approximates f to within e on all points of X. Furthermore the 
algorithm operates on algorithms, since f is given by an algorithm of a certain type, and since real 
numbers are essentially algorithms themselves. 

I will try to put Bishop’s implicit algorithms into an explicit PASCAL-like form, even though 
the capabilities of today’s programming languages have to be stretched considerably to reflect his 
constructions. First let’s consider Lemma 5, which states that for each e > O there exists a 
polynomial p: R — R such that p(0) = 0 and ||x| — p(x)| < e for all |x| < 1. Bishop’s proof, 
which makes the lemma an algorithm, is essentially the following. 


function Lemma 5 (e: real): R-polynomial, 
var N: integer; g, p: R-polynomial; 


begin N = suitable function of ¢; 
1 2 non 
g(t) = 1 -Zicnen / \(- t"; p(t) = g(-t*) — g(1); 
Lemma 5 = p; 
end. 


Here N is to be computed large enough that |g(t) — (1 — t)'”7| < de forO <t <1. 

The other missing component of the proof on page 100 is Corollary 2, which states that if X is 
any compact metric space and if G is the set of all functions x > p(x, x,), where x, € X and 
where p(x, y) denotes the metric distance from x to y, then “.(G) is dense in C(X).” That is, 
all uniformly continuous real-valued functions on X can be approximated to arbitrarily high 
accuracy by functions obtained from the functions G by a finite number of operations of 
addition, multiplication, and multiplication by real numbers. Bishop’s Corollary 2 turns out to be 
false in the case that X contains only one point, since G and ./(G) then consist only of the zero 
function. (I noticed this oversight while trying to formulate his proof in an explicitly algorithmic 
way.) But the defect is easily remedied. 

For our purposes it is best to reformulate Bishop’s Corollary 2 as follows: “Let X be a compact 
metric space containing at least two points, and let G be the set of all functions of the form 
x > cp(X,X_), where c > 0 and x, € X. Then G is a separating family over X.” Pll repeat Bishop’s 
definition of separating family in a minute; first I want to mention his Theorem 7, the 
Stone-Weierstrass theorem whose proof I shall not discuss in detail, namely the fact that 2/(G) is 
dense in C(X) whenever G is a separating family of uniformly continuous functions over a 
compact metric space X. In view of this theorem, my reformulation of Corollary 2 leads to the 
corollary as he stated it. 

A separating family is a collection of real-valued functions G over X, together with a function 
5 from the positive reals R* into R*, and also together with two selection algorithms o and +. 
Algorithm o takes elements x, y of X and a positive real number e as input, where p(x, y) > e, 
and selects an element g of G such that for all z in X we have 


< 
p(x,z) < 8(e) implies |g(z)|<e, p(y,z) < 8(e) implies |g(z) — I) < «. 
Algorithm 7 takes an element y of X and a positive real number e as input, and selects an 
element g of G such that the second of the implications above holds, for all z in X. 
Thus the reformulated Corollary 2 is an algorithm that takes a nontrivial compact metric space 
X as input and yields a separating family (6,0,7), where o and 7 select functions of the form 
p(X, X,). Here is the construction: 


function Corollary 2(X: compact metric space; 
Yos ¥y: X-element): X-separating family; 
{ y) and y, are distinct elements of X } 
var 6: function R* > R*; 
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d: function X x X - R’; 
o: function X xX X xX R*™> C(X); 
7: function X X R* > C(X); 
begin d(x, y) = X.p(x, y); {this is the distance function in X} 
S(e) = min(e*, ed( yo, 1); 
o(x, y,€) = (function g(z: X-element): R; 
g = d(x,z)/d(x, y)); 
T(y,&) = (function g(z: X-element): R; 
g = (if d(y, 1) < 7d(. 1) 
then d(y,z)/d(y, yw) 
else d(y,z)/d(y, vi))) 
Corollary 2 = (6,0,7); 
end. 


My notation for the complicated types involved in these algorithms is not the best possible, but 
I hope it is reasonably comprehensible without further explanation. The selection rule o de- 
termined by this algorithm has the desired property since, for example, p(x, y) > e and p(y, z) 
< 5(e) < e* implies that |g(z) — 1| = |p(x,z) — p(x, y)|/o(%, y) < p(y, 2)/p(x, y) < €. 

Bishop’s proof of Corollary 3 can now be displayed more explicitly as an algorithm in the 
following way. If X is a compact subset of R, under Bishop’s definition, we can compute 
M = bound (X) such that X is contained in the closed interval [— M, M]. Let us assume that his 
Theorem 7 is a procedure whose input parameters consist of a compact metric space X, a 
separating family (6,0,7) over X that selects functions from some set GC C(X), and a 
uniformly continuous function f: X — R, and a positive real number e. The output of this 
procedure is an element A of W(G), namely a finite sum of terms of the form Cg,(x)... g,,(x) 
where m > 1 and each g, € G; this output satisfies |A(x) — f(x)| < e for all x in X. 

Here is the fleshed-out form of Corollary 3: 


function Corollary 3 (X: compact real set; 
f: X-continuous function; 
€: positive): R-polynomial, 
var p,g,r: R-polynomial,; M, B: real; yo, y,: X-element; 
A: &(G)-element, where G is the set of functions x > c|x — Xo]; 
begin M := bound (X); 
Yo t= element (X); 
if trivial (X) then r(t) = f(y») 
else begin y, = element (X \ { yo });3 
A := Theorem 7(X, Corollary 2 (X, yo, y1)) f538)3 
B := suitable function of A, see below; 
p(t) = Lemma 5 («/B); 
q(t) = 2Mp(t/2M); 
{||x — Xol — g(x — Xo)| < &/B for all x } 
r(x) ‘= substitute cq(x — x,) for each factor g(x) = c|x — Xo 
of each term of A; 
{B was chosen so that |q(x — x9) — |x — Xo|| < &/B 
implies that |r(x) — A(x)| < ye} 


end; 
Corollary 3 = r; 
end. 
Clearly it would be an extremely interesting project from the standpoint of high-level program- 
ming language design to find an elegant notation in which Bishop’s constructions are both 
readable and explicit. 
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Tentative Conclusions. What insights do we get from these nine randomly selected examples of 
mathematics? In the first place, they point out something that should have been obvious to me 
from the start, that there is no such thing as “mathematical thinking” as a single isolated concept; 
mathematicians use a variety of modes of thought, not just one. My question about computer-sci- 
ence thinking as distinct from math thinking therefore needs to be reformulated. Indeed, when I 
reflect further about my student days, I realize that I would not only wear my CS hat when 
programming computers and my math hat when taking courses, I also had other hats representing 
various modes of thought that I used when I was editing a student magazine or when I was acting 
as officer of a fraternity, etc. 

Thus, it seems better to think of a model in which people have a certain number of different 
modes of thought, something like genes in DNA. It is probable that computer scientists and 
mathematicians overlap in the sense that they share several modes of thought, yet there are other 
modes peculiar to one or the other. Under this model, different areas of science can be 
characterized by different “personality profiles.” 

After trying to distill out different kinds of reasoning in the nine examples, I came up with nine 
categories that I tentatively would diagram as follows. (Two x’s means a strong use of some 
reasoning mode, while one x indicates a mild connection.) 
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These nine categories aren’t precisely defined, and they may represent combinations of more 
fundamental things; for example, both formula manipulation and generalization involve the 
general idea of pattern recognition (spotting certain kinds of order). Another fundamental 
distinction might be in the type of “visualization” needed, whether it be geometric or abstract or 
recursive, etc. Thus, I am not at all certain of the categories; they are simply put forward as a 
basis for discussion. 

I have added a tenth row to the table labeled “algorithmic thinking,” trying to make it 
represent my perception of the most typical thought processes used by a computer scientist. Since 
computer science is such a young discipline, I don’t know what books would be appropriate 
candidates from which to examine page 100. It seems to me that most of the modes of thought 
listed in the table are common in computer science as well as in mathematics, with the notable 
exception of “dealing with infinity.” Infinite-dimensional spaces seem to be of little relevance for 
computer scientists, although most other branches of mathematics have been extensively applied 
in many different ways. 
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Computer scientists will notice, I think, that two types of thinking are absent from the 
examples we have studied, so this may be what separates mathematicians from computer 
scientists. In the first place, there is almost no notion of “complexity” or economy of operation in 
what we have discussed. Bishop’s mathematics is constructive, but it does not have all the 
ingredients of an algorithm because it ignores the “cost’’ of the constructions. If we carry out the 
details of his Stone-Weierstrass theorem with respect to simple functions, we are likely to wind up 
with a polynomial approximation of degree 10°, say, although a suitable polynomial of degree 6 
could have been found by a more efficient scheme. 

The other missing concept is related to the “assignment operation” = , which changes values 
of quantities. More precisely, I would say that the missing concept is the dynamic notion of the 
state of a process: “How did I get here? What is true now? What should happen next if I’m going 
to get to the end?” Changing states of affairs, or snapshots of a computation, seem to be 
intimately related to algorithms and algorithmic thinking. Many of the concepts of data struc- 
tures, which are so fundamental in computer science, depend very heavily on an ability to reason 
about the notion of process states, and we rely on this notion also when studying the interaction 
of processes that are acting simultaneously. 

Our nine examples don’t have anything resembling “n = n + 1”, except for Euler’s discussion 
where he essentially begins by setting x = x — 4A. The assignment operations in Bishop’s 
constructions aren’t really assignments, they are simply definitions of quantities, and those 
definitions won’t be changed. This discrepancy between classical mathematics and computer 
science is well illustrated by the fact that Burks, Goldstine, and von Neumann did not actually 
have the notion of assignment in their early notes on computer programming; they used a curious 
in-between concept instead (see [7]). 

A computer scientist also seems to be much more willing to deal with a multitude of quite 
different cases than does a traditional mathematician. Data structures in computer science needn’t 
be homogeneous, and algorithms can involve many different kinds of steps. Sometimes that is a 
weakness of computer scientists, because we don’t try as hard as we should to find uniformity; but 
sometimes it is a strength because we can deal fluently with concepts that are inherently 
non-uniform. 

At the Urgench symposium I had the pleasure of hearing a stimulating lecture by G. S. Tseytin 
about these issues; I heartily recommend his paper [13]. 
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tedious drill and leave the teacher free to discuss the important conceptual aspects of integration. I 
take the consensus to be that change of variable, integration by parts, and the use of partial 
fractions must be taught: change of variable because of its theoretical importance and because it 
gives the student a chance to see how a hard problem becomes easy when looked at in the right 
way; integration by parts because of its constant use in both pure and applied mathematics as a 
theoretical tool; and partial fractions because one can show how only two transcendental 
functions (the logarithm and the arctangent) are necessary for integrating all rational functions. 

Having agreed to this, what problems do we discuss to make our points? It seems to me they 
should be conceptual problems that the student can appreciate. I offer one example that I have 
found useful. 

The students know how to compute the area, 7r”, and the perimeter, 277, of a circle of radius 
r. The good students may even have wondered why this same mysterious 7 appears in both 
formulae. For those who have not, this question gets their attention. 

Assuming the students know how to use the definite integral to find area and arclength, one 


has 
_ r 2_ 7) _ r r 
A=4] Vr x- dx, P ‘—S dx. 


Now, the change of variable, x = ru, gives immediately that 
= (4f°vi — x? dx} 
0 


and 


1 1 
p= 2{2/ Tas ax} 


So, it is a question of relating 


[vi-x dx to [) Ta oe 


But the identity 1 = (1 — x*) + x” and integration by parts yield 


[ Faas dx = 2f'Vi x4 dx. 


We’ve actually proved the existence of a number, 7 say, with A = wr and P = 27r; we can 
calculate its approximate value by numerical integration; and we’ve done so using change of 
variable and integration by parts. Who could ask for anything more? 


2 


142. MISCELLANEA 


Beauty is perfect, and perfection (such is human nature) holds our attention but for a little 
while. The mathematician who after seeing Phédre asked: “Qu’est-ce que ca prouve?’ was not 
such a fool as he has been generally made out. 


—W. Somerset Maugham, Cakes and Ale. 


ANSWER TO PHOTOS ON PAGE 182 


They are the brothers Alfred and Richard Brauer. 


THE ANATOMY OF LOW DIMENSIONAL STABLE SINGULARITIES 


JONATHAN BRITT 
Department of Mathematics and Computing, Southampton College of Higher Education, Southampton 
S09 4WW, England 


Preface. In this article we examine the geometry of certain sets, known as singularity sets, 
associated with a particular mapping called the Whitney cusp mapping. We will also be looking at 
the geometrical structure of the inverse image of those sets under that mapping. 

Initially we will consider the mapping from R? to itself and then from R? to R*. We will then 
examine the most general case of the Whitney cusp mapping that is from R” to R*. Finally we 
will explore, in the same way, the structure of the swallowtail mapping from R? to itself. 


Fic. 1 


1. Introduction. An easy way to visualise some types of differentiable mappings from a plane 
to a plane is to imagine the first plane being shaped into a smooth curved surface which is then 
projected downwards onto the other plane as in Fig. 1. As the mapping f is differentiable, at each 
point x it has a derivative Df(x) which will be the 2 x 2 matrix of partial derivatives, providing a 
“linear approximation” to the mapping at each point. This approximation is seen by looking at 
what the derivative does to the tangent plane at the point p on the surface. If we imagine the 
tangent plane at p to be made of some opaque material and that a light is shining down from 
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Southampton College of Higher Education. I was awarded my B.A. and M.Sc. at Watwick University in 1969 and 
1971, respectively, and my Ph.D. at Southampton University in 1982. My research, which was carried on part time, 
was into the application of differential topology, in particular singularity theory, to mechanical systems with 
symmetry, such as the motion of a spinning top. I am still actively working in this field. 

About a third of my teaching is Data Processing and Computing and I am particularly interested in the teaching 
of programming. 

In my spare time I am involved both with Hospital Radio and an Amateur Dramatic group in the New Forest 
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My favourite way of relaxing is either to go for a ride in the New Forest (on a horse) or to go for a lesson to learn 
how to ride correctly! I have three children and an Afghan hound which I can occasionally be seen chasing, in a vain 
attempt to get him to come when I call! 
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above, then the image of a rectangular piece of the tangent plane can be visualised as the area in 
shadow on the plane below. 

However, as in the example shown in Fig. 1, there may be points at which the tangent plane is 
vertical, and so the images of those tangent planes will be lines, not planes. (Analytically, these 
points are where the derivative has rank less than 2.) Such points are called singular points while 
points such as p, where the derivative maps tangent planes to planes, are called regular points. 
These terms will be defined precisely a little later. The mapping behaves very differently at 
singular points from the way it does at regular ones, where locally it behaves as if it were a 
mapping onto its codomain. Thus, in order to visualise fully the behaviour of a mapping, we need 
to understand what is happening at its singular points. This information provides a kind of 
skeleton on which to hang the clothing of regular points! 

Sometimes when a mapping is perturbed slightly its singular points disappear entirely or are 
changed to another form, and in such cases the mapping is called unstable. If, however, the 
singular points cannot be removed or altered by perturbations of the mapping or, more precisely, 
if they are merely replaced by equivalent ones nearby, then the mapping is stable. 

The natural question arises, can any unstable mapping be perturbed into a stable one? For 
mappings of the plane to itself the answer is yes, and in fact H. Whitney proved in his paper [10] 
that for differentiable mappings from R” to R”, provided n and p are less than or equal to 5, the 
answer is yes as well. 

When nv and/or p are greater than 5 the position is somewhat more complicated and the 
detailed results can be found in J. Callahan’s paper [2] which is a very easy to read and intuitive 
introduction to the study of singularities. For a more formal and detailed look at the theory see 
either the book by C. Gibson [4] or that by M. Golubitsky and V. Guillemin [5]. 

When studying a mapping, useful information can be gained by examining not only the set of 
singular points (the singularity set) itself but also the image of that set under the mapping. 

For example, consider the mapping from the plane to itself given by: 


f(x,y) = (x xy — y*). 


A way of seeing this map is given in Fig. 2. The (x, y)-plane is first pleated at the origin and 
then projected onto the (u, v)-plane. Since the derivative of f is 


1 0 
Dix.) = [ <a} 


the singularity set, that is where the derivative has rank less than 2, is the parabola x = 3y’. This 
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is the projection “downwards” of the fold lines in Fig. 2, together with the origin which 
corresponds to the pleat itself. The image of the singularity set is a curve with a cusp point whose 
equation is 4u? = 27v’, and it is this that has given the mapping the name of the Whitney Cusp. 
Now let us look at the full inverse image of the cusp curve, that is all the points taken by f to 
points on the cusp curve. This will help us to see more explicitly how f behaves geometrically. In 
algebraic terms it will provide insight into the configurations of solutions of the equations 


f(x,y) = (4,5) 
for various choices of a and b, because the number of solutions to those equations changes 
precisely where (x, y) crosses the inverse image of the cusp curve. Note that this inverse image 
includes the singularity set with some other points besides. 

To see those other points imagine parallel rays of light shining horizontally from all of the 
(u,v)-plane except the inside of the cusp curve. If the pleated surface is transparent, then a 
portion of that surface will be in darkness. Looking up from the (x, y)-plane to the surface, we see 
an area in shadow on the plane. The boundary of that area is the other part of the inverse image 
of the cusp curve. This is shown in Fig. 3. 


Fic. 3 


Our interest in the Whitney cusp mapping arose, in fact, from studying the motion of a 
symmetric top spinning under gravity. The top can perform several different kinds of peculiar 
motion depending on its energy E and angular momentum J about the vertical. These two can be 
expressed in terms of four variables, namely the angle between the top’s symmetry axis and the 
vertical axis, the time derivative of that angle and the angular velocities around those two axes. In 
this way we get a mapping from R* to the (E, J)-plane, which close to its singularity set turns out 
to be equivalent to a generalisation of the Whitney cusp mapping that we looked at above. So 
information about the inverse image of the generalised cusp map gives us insight into how 
different values of the variables will affect the motion of the top. Further details can be found in 
[1]. 

Now we need some formal definitions and notation. We will consider differentiable mappings 
from R” to R”, where n > p. Such a map f has a derivative Df(x) at each point x of R”. The 
derivative is a p X n matrix whose ijth element is the partial derivative of the ith component of 
the mapping with respect to the jth variable. It necessarily has rank less than or equal to p. We 
classify singular points of such a map by the amount that the derivative loses rank at that point. 


DEFINITION. The singularity set S,(f) is the set of points in R” where the derivative of f has 
rank exactly equal to p — k. 
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Next we will say what we mean by a stable mapping. To start with, we will call two maps from 
R” to R? equivalent when there are suitable coordinate changes in both the domain R” and the 
codomain R?” which convert one map into the other. These coordinate changes will be differentia- 
ble maps with differentiable inverses; such maps are called diffeomorphisms. More precisely: 


DEFINITION. Two maps f and g from R” to R? are equivalent if there are diffeomorphisms h 
from R” to itself, and k from R” to itself, such that g = ko foh. 


We are more often concerned with /ocal equivalence where we require g = kof oh to hold 
only in the neighborhood of a particular point: 


DEFINITION. A map f from R” to R?” is stable if every map g sufficiently close to f is 
equivalent to f. Local stability at a point is defined in the same way but using local equivalence. 


To explain the term “sufficiently close” we need a topology on the set of differential maps 
from R” to R”. Such topologies with reasonable properties exist, but it is beyond the scope of this 
article to go into the details of them. Suffice it to say here that for two maps to be considered 
“close” their values and their derivatives to some high order must all be close throughout the 
region under consideration. Full details can be found on page 42 of [5]. 

We are going to be looking exclusively (until Section 5) at mappings into the plane and so the 
only singularity sets that can exist are S,(f) and S,(f). However, it is implicit in the work done 
by Whitney in [10] that: 


THEOREM. If f is a mapping from R” to the plane and S,(f) is non-empty, then f is unstable. 


In other words, if we have a singular point belonging to S,(f), then by perturbing the mapping 
we can either change the singularity to one belonging to S,(f) or destroy it altogether. So we only 
need to study singularities belonging to S,(/). 


Sketch of the proof of the Theorem. Since Df(x) is given as a2 X n matrix, we look at the space 
of all 2 X n matrices. In that space the set of all matrices of rank 0 is just one point, the zero 
matrix, while the set of matrices with rank 1, which is given by n — 1 equations, is a “hyper- 
surface” or manifold T of codimension n — 1. We can think of x > Df(x) as a mapping from R” 
into the space of matrices, giving an n-dimensional manifold M parametrised by x. Typically 
such a manifold M will meet a codimension n — 1 manifold T in a 1-dimensional curve but will 
not contain the zero matrix point at all. If S,(f) is not empty, then the manifold M does contain 
the zero point and a small perturbation will, by the remark above, cause the manifold to miss that 
point and thus destroy the S,(f) singularity. In contrast, were M meets the manifold T of rank 1 
matrices and does so in general position, then a small perturbation will change the point, but not 
the fact, of intersection. This shows that singularities belonging to S,(/) are unstable, so that 
stable mappings can have singularities belonging only to S,(/). | 


In the case of a mapping into the plane the set S,(f) will consist of curves and in the more 
general case of a mapping from R” to R?” it will be a p — 1 dimensional manifold. 

Now in order to scrutinize the singulaxities of f more closely we restrict f to S,(f), on which it 
has rank at most p — 1, and see at what points, if any, f restricted to S,(f) drops rank by 1. (We 
are in fact using one form of the implicit function theorem which tells us that we may perform 
these operations.) The set of such points is denoted S, ,(f). This idea can be generalised; for 
details of how see pages 174-190 of [4]. 

After this groundwork we go back to our example and examine the cusp mapping in more 
detail. 


2. The Whitney cusp mapping from the plane to itself. In 1955 H. Whitney [9] proved that the 
most complicated singularity that can arise in a stable mapping of the plane to itself is locally 
equivalent to the mapping which we have already met, that is: 


f(x,y) = (x, xy — y*). 
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From our previous discussion we can see that the set S,(f) is just the parabola x = 3y” and 
the set S, ,(/) is the origin itself. The image of S,(f) under f is the cusp curve 4u’ — 270? = 0 
and the image of S, ,(f) is the actual cusp point u = v = 0. 

Points inside the cusp curve are the image of three distinct points in the (x, y)-plane, while 
those outside have only one inverse image point. Those on the cusp curve, except for the origin, 
have two distinct inverse image points one of which lies on the singularity set and the other on the 
second curve we found in the Introduction. 

Finding the equation of this second curve is straightforward. We replace u by x and v by 
xy — y° in the equation of the cusp curve, to get 


4x? - 27( xy — yy = 0, 
that is 
(x - 3y?)"(4x — 3y”) =0. 
Hence the equation of this second curve is 4x = 3y* and it too is a parabola. 
Thus the complete inverse image of the cusp curve consists of these two parabolae. We next 
investigate an analogous cusp mapping from R° to the plane, and we shall find that these two 


parabolae embed in the new inverse image of the cusp curve in a most interesting and unexpected 
way. 


3. The cusp mapping from R°? to the plane. Again for a stable mapping from R? to R? the 
most complicated singularity that can exist is a so-called cusp point. In his paper of 1958 [10], 
Whitney gave a standard form for this mapping, namely 


f(x,y,z) =(x,xy — y? — 27), 
where now the cusp point is at the origin. 
By means of a similar calculation as was done for the mapping from the plane to itself, it is 
easy to see that 


Si(f) = {(x,¥,0); x = 3y?} 
and 


Sii(f) — {(0,0,0)}. 


As before, the image of S,(f) is the cusp curve 4u? — 27v* = 0. Since we are dealing with a 
mapping from R° to the plane, the inverse image of a point will in general be a curve. This will be 
true for points on the cusp curve itself as well, although we might expect the curves we get from 
such points to be “less well behaved” in some sense than those that come from points not on that 
curve. 

We will start by finding the inverse image of the origin. A simple calculation shows that this is 
given by 


{(0, 952); y° = —z*}, 


which is a cusp curve lying in the ( y, z)-plane. 

If we were to restrict the mapping to the plane z = 0 we would be back in the situation we 
discussed in Section 2. Hence there will be two parabolae lying in that plane and mapping onto 
the cusp curve, namely S,(/) and 


{(x,y,0); 4x = 3y*}. 


We can write down the equation of the full inverse image of the cusp curve, just as we did in 
Section 2. It is: 


(3.1) 4x3 — 27(xy — y? — 2?) =0. 
To get some idea of what this might look like it is helpful to take slices parallel to the 
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y>O y=0 y<0 
(A double isolated point exists.) 


whisker 


swallow tail point 


cusp lines 


h 


(a) 


(b) 


Fic. 5 


(x, z)-plane. Choosing different constant values of y, we get pictures of the kind shown in Fig. 4. 
These look like cross sections of the swallowtail as shown in Callahan [3] or on page 177 of Poston 
and Stewart [8] and as sketched in Fig. 5(a). In the two references given above the swallowtail is 
described as the bifurcation set of the real swallowtail catastrophe. However it is a classical object 
extensively studied in the last century as a set of points in (a, b,c) space for which the quartic 
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equation 


x* + ax? + bx+c=0 
had repeated roots, real or complex. 

There is a slight difference between the two uses—in the second case there is an isolated 
whisker (corresponding to repeated complex roots) which together with the line of self intersection 
forms a parabola. In the bifurcation set picture this whisker does not appear, because these 
complex roots are irrelevant to real bifurcations. For a fuller discussion of the whisker we refer the 
reader to pages 130-132 of Poston and Stewart [7]. (The whisker will in fact turn out to be half of 
S,(f), the other half being the line of self intersection when y < 0.) 

The double isolated point which occurs in Fig. 4 when y > 0 corresponds to this whisker, so 
we suspect that the inverse image of the cusp curve as given by Equation 3.1 will be equivalent to 
the set of points in (a, b, c) space where 


x* + ax? + bx +c¢=0 


has repeated roots. This set is given algebraically by the vanishing of the discriminant A, where 


A = S$? - 27T’, 
2 
a 
(3.2) =p t® 
ac b? a? 
Y= 6 16 216 


See for example page 120 of [7]. In fact we can prove: 


PROPOSITION. There is an origin preserving diffeomorphism from R° to itself which takes the set 
given by Equation 3.1 to that defined by Equation 3.2. 


Proof. A comparison of the two equations shows us that we may define the mapping by the 
bijection: 


— gis3e 

y=4 6” 
7-2 
1° 


A quick calculation shows that this map has non-vanishing J acobian, hence is a diffeomor- 
phism, and that it converts Equation 3.1 to Equation 3.2 and thus the proposition is proved. | 


We now know that the inverse image of the cusp curve is a swallowtail like that shown in Fig. 
5(a) but bent upwards slightly to the shape shown in Fig. 5(b). However, we would like more 
detail as to which points of the inverse image correspond to individual points on the cusp curve. 
To get this, using the fact that under the mapping wu is just equal to x, we intersect the cusp curve 
with a line u = k*. The inverse image of this line is just the plane x = k”, and so the curves in 
which that plane intersects the swallowtail will be the inverse images of the two points on the cusp 
curve and the line u = k*. Fig. 6(a) shows the intersection of the plane and the swallowtail 
consisting of two curves and a double isolated point. 

The coordinates of the two points where the line u = k* meets the cusp curve are (k?, 2k? /3/3) 
and (k*, —2k?/3,3), (where k° is taken to be positive). So the inverse images of the two points 


are respectively 
YB) NE 3 
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ges _kY _ 2k 
Jy 2B Jy B . 


These curves are sketched in Fig. 7, and it is easy to see that the inverse image of the point b 
with coordinates (k*, —2k?/3,3) is the curve with a point of self intersection whilst that of the 
point a with coordinates (k?,2k? /3,3) is the other curve together with the double isolated point. 

Having found the inverse image of points on the cusp curve we turn our attention to points 
lying inside and outside of that curve. Again we take the line u = k? and look at points on the 
line. Their inverse images will be in the plane x = k* as before. Consideration of Figs. 6(a) and 7, 
and a little bit of simple algebra, enables sketches of the inverse images to be made easily. These, 
together with the inverse images of points on the cusp curve, are shown in Fig. 8. Points outside 
the cusp curve will have only one curve as their inverse image while points inside will have two. 


and 


FIG. 6(a) 
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FIG. 6(b) 


One of those will be a closed curve around the whisker and the other an open curve in the “tail 
portion” of the space enclosed by the swallowtail. 

Restricting to z = 0, the behaviour described in Section 2 of points outside having an inverse 
image of one point and points inside having three points in their inverse image can be clearly seen. 
Furthermore, Fig. 8 shows that the inverse images of points not on the cusp curve are regular 
curves, but for points on the cusp curve the inverse images are curves with singular points, either a 
double isolated point or a point of self-intersection. (Again we are using the implicit function 
theorem here.) This illustrates our earlier remark about expecting the inverse image of the cusp 
curve points to be “less well behaved” in some way. 

We can summarise all of this by looking back at Fig. 6 and imagining a tide rising around the 
swallowtail. Before the tide reaches the swallowtail, that is when x = —k?’, the intersection of the 
plane and the swallowtail is empty as is that of the line u = —k* and the cusp curve. This is 
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u= —k? 


4 


FIG. 6(c) 


shown in Fig. 6(c). The tide rises until the water just starts to lap at the feet of the swallowtail, 
then the cusp curve on which the swallowtail sits is clearly seen as the inverse image of the cusp 
point in the (u, v)-plane, see Fig. 6(b). The inverse images of the two half lines v > 0 and v < 0 
are the two regions of the plane x = 0 separated by the cusp curve. 

As the tide rises further we get the picture shown in Fig. 6(a). The two points where the cusp 
curve intersects the line u = k” have the two curves with the isolated point that we described 
above as their inverse image. These two points divide the line u = k* into 3 segments labelled A, 
B and C in Fig. 6(a). The inverse image of the line segment A is the region of the plane labelled L 
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U 


FIG, 7 


while that of the line segment C is the region R. The two regions shown shaded form the inverse 
image of the central Section B of the line. 

Finally notice how as the rising tide changes Fig. 6(b) to Fig. 6(a) the cusp point in the plane 
x = 0 bifurcates into four separate points, two regular and two singular, while the cusp curve 
becomes two separate curves. 


4. The cusp mapping from R” to the plane. The general form for the Whitney cusp mapping 
from R” to the plane, as given in [10], is 


F(X Ys 25-065 Z_—2) = (x, xy —y? _ zy + z5 bee +z? 4). 


In fact in Whitney’s paper there is a “+” sign in front of z7 as well, but we choose the minus sign 
to maintain the convention used in Section 3. There the two possible choices give an equivalent 
geometric picture of the inverse image (by reflection in the xz plane and the wu axis). In this 
section what is important is whether the signs in front of the z* terms are all the same. 

If all the signs are the same then the cusp mapping can be put in the form: 


F(X, Vs 2 5+++5Zy-2) = (x, xy -y a zy ~— tt —z2_). 


We refer back to Fig. 8. The inverse image of the point 4 on the cusp curve will consist of an 
isolated point (k7, k/,3,0,...,0) and a “manifold of revolution” formed by revolving the curve 
shown in Fig. 8(d) as the inverse image of the point 4 around the y-axis in the hyperplane x = k?. 
Similarly the inverse image of the points 1, 2, 3 and 5 in Fig. 8 will be obtained by revolving 
around the y-axis their inverse images as given in Fig. 8, again in the hyperplane x = k*. Hence 
no essentially new features appear in the geometry of the inverse image set in this case. 

However, if some of the signs in front of the z* terms are plus and some minus then, by 
renumbering the coordinates if necessary, we can write the mapping as 
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Hence we can regard the mapping as being the composition of a cusp mapping g from R* to 
the plane and a mapping A from R” to R* given by 


2(x,y,p.q) =(x,xy— y? — p? + q’) 
and 


A(X, Y5215+-+52Z,-2) = (x,y,p,q). 
The inverse image under h of any point in the subset K of R*, where K is given by: 


K = {(x,y,p.9)3 p,q #0}, 


is just the “torus” S’~' x S"~"~?. However, for a point in R* with either p or q = 0, the inverse 
image degenerates into a sphere, S"~"~? or S’~', respectively. Hence, once we have found the 
inverse image of a point under the cusp mapping g, we take the Cartesian product of points that 
lie in the intersection of the inverse image and K with the “torus,” but for those points not in K 
we take the product with the appropriate sphere to generate the inverse image under f. (Of course 
if a point in R* has p and gq = 0, then its inverse image under h will just be one point.) We will 
refer to this process as forming a “generalised manifold of revolution”. 

It is straightforward to show that the inverse images under the mapping g of the points 1, 2, 3, 
4 and 5 on the line u = k? are as shown in Figs. 9(a) to (e). 

These pictures have several interesting features. Firstly, inverse images of a point are un- 
bounded in the positive y direction in contrast to the position in Section 3, where the inverse 
images always had y less than or equal to a particular value that depended on that point. 
Secondly, notice that the inverse images of all points (including the points 3 and 4) are now 
connected sets. 

We can see too that there is a duality in the pictures in the following sense: if we take the 
inverse image of the point 1 and rotate it through 90° in the ( p, g) plane and then through 180° 
in the (y, p) plane, we arrive at the inverse image of the point 5. The inverse images of the points 
2 and 4 are dual in the same way, while that of 3 is self dual. These geometric facts of course 
correspond to the equivalence between replacing y by —y and interchanging p and g in R* and 
replacing v by —v in R* under the mapping g. 

Lastly, let us see what happens if we, intersect each of the sets with planes g = constant. 
Because of the symmetry of the pictures we can restrict our attention to qg > 0. 

For the set in Fig. 9(a), when g = 0 we have a curve like that shown in Fig. 8(a). As q 
increases, the curve fills out its “waist” and expands, but remains a simple open curve. In part (b) 
of Fig. 9, for g = 0 we have the curve shown in Fig. 8(b), but for g > 0 we obtain a curve like that 
shown in part (a) of that figure. In the case of Fig. 9(c), for g = 0 we have the two curves shown 
in Fig. 8(c). As qg increases the open and closed curves approach each other until they collide to 
form the curve in Fig. 8(b), then as g continues to increase we again get curves like those seen in 
Fig. 9(a). This pattern holds true for all the pictures until finally for the inverse image of point 5 
as q increases from 0 all the different curves shown in Fig. 8 appear in order. (Of course the two 
curves that are the inverse images of points 2 and 4 appear only instantaneously.) Fig. 10 
illustrates these curves appearing by showing a tide rising around the surface of Fig. 9(e). 
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To see algebraically why this happens, compare the map g with the cusp map from R°? to the 
plane (Section 3) and take various different values of q’. 

In what we have discussed above we have covered the main features of the real geometry that 
can occur in the inverse image of a Whitney cusp mapping from R” to the plane. The inverse 
image of the cusp curve will consist of either a “manifold of revolution” of one of the curves in 
Fig. 8(b) or (d) or a “generalised manifold of revolution” of one of the surfaces in Fig. 9(b) or (d). 


5. Further directions. The cusp is the most complicated stable singularity that occurs in 
mapping from R” to the plane. However, if we look at maps from R° to itself another more 
complicated singularity can occur, namely the swallowtail. This is given up to equivalence by 


f(x, y,z) = (x,y, xz + yz? + 24), 


Staying with the equidimensional case, as the dimension increases by one there is precisely one 
new type of stable singularity which appears. If we now increase the dimension of the domain but 
leave that of the codomain fixed, then the number of different types of singularities remains the 
same. An explicit formula for a general form for all these mappings can be found in the paper by 
B. Morin [6]. 

A possible next step then would be to examine the inverse images of the singularity sets of 
these higher dimensional singularities. For the swallowtail mapping from R? to itself the approach 
that we have used so far is quite feasible, but once we go beyond that case the algebra involved 
becomes fairly forbidding and a more general approach might offer more chance of success. 

So we will conclude by giving a very brief description of what does happen in the case of the 
swallowtail mapping from R° to itself. With f as given above, the singularity set S,(f/) is the 
cubic surface with equation 

4z7>+2yz+x=0 
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shown in Fig. 11(a), and its image under f is the swallowtail set we saw in Fig. 5(a) but without 
the whisker. (This set is redrawn in Fig. 11(b).) 

When it comes to finding the inverse image of this singularity set, the easiest way to proceed is 
to take v = constant slices and look at the inverse images of the curves that this gives. For v > 0 
no new points at all appear. 
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fold line 


fold line 


image of the fold line 


image of the fold line 


FIG. 12. (The reader is encouraged to take a piece of acetate and draw on it the ellipse. Then by experimenting with 
folding the sheet and looking down from above he will see how the double duck billed loop is formed.) 


1985] 


THE ANATOMY OF LOW DIMENSIONAL STABLE SINGULARITIES 


Si(¢) S1($) 


$(S1(¢)) $(S(¢)) 
Fic. 13 


Si(f) 


Fic. 14 


199 


200 JONATHAN BRITT [March 


Fic. 15 


Fic. 16 


1985] THE ANATOMY OF LOW DIMENSIONAL STABLE SINGULARITIES 201 


However, if we take a slice v = —k*/2 through the swallowtail and try to find the inverse 
image of the curve that this gives, we find that those points not belonging to S,(/) are given by 
the equations: 


x=k?r— 4r 


and 


Although these equations seem at first difficult to interpret geometrically, we can do so by 
rewriting the second equation as 


k? 
z*+2rz+ are =, 


which gives an ellipse in the (7, z) plane. 
Then if we consider a mapping ¢ from the plane to itself defined by 


o(r,z) =(k’r - 4r’,z), 


the set of points we are interested in is the image of the ellipse under ¢. If we analyse ¢ using the 
techniques discussed above, we find that ¢ has two lines of fold points, which together make up 
S,(), and no other singularities at all. The set S,(@) is given by 


Si() = {(r.2): r= +55}: 


Fig. 12 gives a visual interpretation of how the mapping ¢ behaves. In particular, notice that 
when the ellipse crosses the fold line obliquely the image of the ellipse is a line tangential to the 
image of the fold line, but where the ellipse crosses perpendicular to the fold line the image of the 
ellipse is a cusp (though not, however, a cusp arising from a Whitney cusp map!) Fig. 13 gives an 
accurate drawing of both the ellipse and its image under ¢, a double duck billed loop. 

The complete inverse image of the v = —k*/2 slice through the swallowtail is drawn in Fig. 
14. For details of how the duck billed loop maps to the swallowtail see Fig. 15. Putting this 
information together enables us to sketch the entire inverse image of the swallowtail set in Fig. 16. 


Acknowledgements and Dedication. I would like to thank David Chillingworth for all the help and advice he gave 
to me in the writing of this paper, which is dedicated to the memory of my father Albert Britt who died suddenly on 
27th January 1984. 


References 


1. J. P. Britt, Singularity theory and geometry in the motion of a top, Ph.D. Thesis, Southampton University, 
1982. 
J. J. Callahan, Singularities and plane maps, this MONTHLY, 81 (1974) 211-240. 
, Singularities and plane maps J], this MONTHLY, 84 (1977) 765-803. 
C. G. Gibson, Singular Points of Smooth Mappings, Research notes in Mathematics 25, Pitman, London, 


2. 
3. 
4, 
1979. 
5. M. Golubitsky and V. Guillemin, Stable Mappings and Their Singularities, Graduate Texts in Mathematics 
14, Springer-Verlag, New York, 1973. 
6. B. Morin, Formes canoniques des singularités d’une application différentiable, C. R. Acad. Sci. Paris, t260 
(1965) 5662-5665, 6503-6506. 
7. T. Poston and I. N. Stewart, Taylor Expansion and Catastrophes, Research Notes in Mathematics 7, Pitman, 
London, 1976. 
8. , Catastrophe Theory and its Applications, Pitman, London, 1978. 
9. H. Whitney, On Singularities of mappings of Euclidean spaces, I. Mappings of the plane into the plane, Ann 
of Math., 62(1955) 374-410. 
10. , singularities of mappings of Euclidean spaces, in Symposium internacional de topologia algebraica, 
285-301, Universidad Nacional Autonoma de Mexico and UNESCO, Mexico City, 1958. 


CHARACTERIZING MAXIMALLY LOOPED CLOSED CURVES 


F. J. CRAVEIRO DE CARVALHO* 
Departamento de Matematica, Universidade de Coimbra, 3000 Coimbra, Portugal 


1. Introduction. It is easy to convince oneself after a few minutes of experimental sketching 
that there are no analogous curves to those shown in Fig. 1 with an even number of double points. 


Fic. 1 


We shall prove that this is indeed the case as a corollary to Theorem 3.1. The relevant property 
of the curves of Fig. 1 is that they are “maximally looped”. We also calculate the rotation number 
of maximally looped curves, and compute the number of double tangents for those without 
inflection points. 


2. Definitions. A closed curve is a C' immersion f: S' > R?. A pair { x, y} of distinct points 
of S' is a double point, crossing, or self-intersection of f if f(x) = f(y). We assume that no three 
distinct points of S' have the same image and that f is se/f-transverse. By self-transverse we mean 
that if {x, y} is a double point, then the tangents to f at x and y are distinct. These restrictions 
are not very serious. In fact, with a suitable topology for the set of maps g:S' > R’, the 
immersions we are interested in form an open and dense subset. According to [1] and [14], they 
are called clean immersions and normal curves respectively. In this note the immersions are 
always non-injective. 

By composing with g: R > S' given by g(t) = (cos2at, sin2at), we can consider f to be a 
periodic map f: R — R?’ with period 1. 

A loop of f is a restriction of f to a closed interval [a,b] such that f|[a, b) is injective and 
f(a) = f(b). We shall not distinguish between two loops of f with the same image in R?. 

Let {x,, ¥,},---»{X,> y, } be the n pairs of points of [0,1) corresponding to the double points 
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of f. We assume that x, < x, < --: <x, and x, < y,, for every i. It is clear that f has at most 
2n loops. If they exist, they are given by f|[x;, y,], f|Ly,, x; + 1], for i= 1,...,. In this case we 
say f is maximally looped. For example, the curves of Fig. 1 are maximally looped. 

The word, w(f), associated with f is written as follows: 


Arrange the points x,;, y,, i= 1,...,”, in their natural order in R to obtain a sequence 
Z12Z5...Z5,. Then 


w(f) = 2425...25,- 
The idea of associating the word w(/) with f goes back to Gauss [4]. He conjectured that if f 


has n double points, then, for each 7, the number of symbols between x, and y, is even. This 
conjecture was proved in [10]. See also [8], [11], [13]. 


3. A characterization of maximally looped curves. 


PROPOSITION 3.1. Let f:.S' — R* have n double points. Then f has 2n loops if and only if 
Wf) = XyXq--- Xp, ViY2-+ + Vn 


Proof. If w(f) = X,X5..-X, 1 )2---Y,, then f has 2n loops. 
On the other hand, suppose that w(/) is not of this form. Then at least one of the y’s must be 
out of order. This can occur in two ways: 


(1) there is a pair {x;, y,} such that x, < y, < x,, or 
(2) there are 2 pairs {x,, y,;}, {x,, y,}, with i <j, such that y appears before y, in w(f), Le., 
Xx, SX, SV SY 


In case (2) we do not have 2n loops since f|[x,, y,] is not a loop (x,, y, © [x,, y;] and yet 
f(x,) = f(;)). As far as case (1) is concerned, i < n (since x, < y,) and y, <x, <y, <x, + 1, 
thus f|[ y,, x; + 1] is not a loop. 


THEOREM 3.1. If f: S' > R? has n double points and more than n loops, then n is odd. 


Proof. Since the number of loops is greater than the number of double points, at least one 
double point gives rise to two loops. We may assume without loss of generality that { x,, y,} gives 
rise to the two loops f|[x,, y,] and f|[.y,, x, + 1]. Then 


(1) y, # x, (otherwise y, < x, < y, < x, + land f|[y,, x, + 1] is not a loop) and 
(2) y, € y, (otherwise x, < x; < y, < y, and f|[x,, y,] 1s not a loop). 


Thus w(f) = X1...X,Vi2Zn42+++Zon, that 18 2,4552Z)435--+5Zo, ATE Yo, ¥3,---,y, IN Some 
order. 
Successive arcs in the sequence 


(*) F\(15Zn42)9 FV (Zna2s nea) 00+ ofl (Zon—19 Zan) of (Zan + 1) 


must lie in different components of R* — f([x,, y,] since f|(z,;,z,,) crosses f([x,, y,]) transver- 
sally at f(z;,,),fori=n+ 1,...,2n‘— 1, and Zon41 = X, + 1 (See Fig. 2). 

Also by transversality f|(),,2Z,,,5) and f|(z,,,x, + 1) must lie in the same component of 
R* — f({x,, y,]) (up to orientation, the two possibilities are shown in Fig. 3). 

Thus the arcs 


(+ *) AW Zn42) Al Ene 3s Znt adore of (Zan—25Zan—1) FI (Zon %1 + DV) 
must lie in one component of R* — f([x,, y,]) and the rest of the arcs 
(* * «) P\(2n4292n43) of (Ene as Znesdo<+ oF (Zan Z2n) 


must lie in the other component. 
Since there is one more arc in (* *) than in (* * *) the total number, n, of arcs in (*) must be 


odd. 
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REMARK. Theorem 3.1 is, in fact, a corollary to Nagy’s result. It follows immediately after 
having established w(f) = x,x,...X,)1Z)42--+-29,- However we have preferred to give an 
independent proof. 


An immediate consequence of Theorem 3.1 is 
COROLLARY 3.1. A maximally looped curve must have an odd number of double points. 


It is natural to ask the following question. Suppose k is an integer such that 2 < k < n,if n is 
even, and 2 < k < 2n, if n is odd. Is there a curve with k loops? 

We do not know the complete answer to this question but something can be said. If k = n > 2, 
then Fig. 4 shows that there are curves with n double points and n loops. 

From experimental sketching evidence it seems reasonable to conjecture that if 2<k <n, 
then there are closed curves with k loops. However, if k > n that is not the case. For instance, for 
n = 3,if k > n and f has k loops, then k = 6. In fact, if k > n there must be a double point 
giving rise to 2 loops. We shall assume that it is {x,, y,}. Therefore w(f) = x,x,x3y,252Z¢. 
Nagy’s result implies z, = y, and z, = y,;. Consequently f has 6 loops. Similarly, if n = 5, then 
the only integers k > 5 for which there are closed curves with k loops are k = 6 and k = 10. The 
words are 


XyX_X4XqXsVyV2VsV4V3 OL XyX_XzX4Xs Vy V4V3V2Vs, ANA x, X_X3X4X5 V1 V2 V3 V4 Ys - 


Notice that there is no f with w(f) = x1%4%3X4X5 V1 4 Vs Yo V3- 


4. Rotation numbers and double tangents of maximally looped curves. For some of the ideas we 
deal with in this section references [3] and [9] are suitable. We start by recalling the notion of 
rotation number of a closed curve. Roughly speaking, it is the total net angle through which the 
tangent turns while traversing the curve. If g: R > R? with period L is an orientation-preserving 
reparametrization by arc-length of f, then the rotation number of f is given by {j-k(s) ds, where 
k(s) is the curvature of g at s. 

If we look again at Fig. 1, we see that for the curves in the top row the rotation number is 47 
or —4m, depending on the orientation, and is zero for the curves in the bottom row. The next 
theorem shows that for maximally looped curves these are the only possibilities. 


THEOREM 4.1. If f: S' — R? is such that f has 2n loops, then 
|rot f| = 0 or 4z, 


where rot f stands for the rotation number of f. 


Proof. Without loss of generality we may assume that f(R) lies entirely in one of the closed 
half-spaces determined by the tangent to f at 0. We shall use a result due to Whitney [14]. He has 
shown that 


rot f= 27[t+1+(n*-7 )], 


where n™ (resp., n_) is the number of positive (resp., negative) double points. We recall that a 
double point {x, y},x < y, is negative (resp., positive) if the vectors f’(x), f’(y) determine the 
same orientation as (1,0),(0,1) in R? (resp., the opposite orientation). 

We modify slightly the loop f:[x,,y,] > R? near x, and y, so that we obtain a C’ 
embedding g: S' > R* (see Fig. 5). 

Next we consider the two possible orientations of ¢(S‘). 


(1) S' is given the usual counterclockwise orientation and g(S') inherits the orientation 
induced by g. 
(2) g(S') is oriented as the boundary of a closed disc with the standard orientation in R* [7]. 


If the two orientations of g(S') agree, then each time f:[y,,x, + 1] > R* crosses g(S') from 
the outside (resp., inside) to the inside (resp., outside) of g(S') we have a negative (resp., positive) 
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double point. If the two orientations disagree, then we have positive and negative double points, 
respectively. Using Whitney’s formula above we conclude that |rot f| = 0 or 47. 


Note. The reader familiar with the notion of regular homotopy and the Whitney-Graustein 
theorem will see that up to orientation and regular homotopy the only maximally looped curves 
are those we show in Fig. 1 [14]. 


The curves in the top row of Fig. 1 have exactly n double tangents. Theorem 4.2 states that this 
is always true for maximally looped C* curves without inflection points. Such curves are examples 
of double ovaloids [12]. 

Let us suppose that f: R > R* is C*. Denoting the tangent to f at x € R by T., we say that a 
pair {x,y} C [0,1) of distinct points is a double tangent if 7, = 7). It is regular if the curvature 
of f at x and y is nonzero. If a double tangent is regular, then it is of type (a) or (b) (see Fig. 6). 


(a) (b) 
FIG. 6 


We say the double tangent is external in case (a) and internal in case (b). 

An inflection point x of f is regular if the curvature has nonzero derivative at x. 

Suppose that f: R > R? is self-transverse, and that the double tangents and inflection points 
are regular. Denote by E, J, n and F the numbers of external double tangents, internal double 
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tangents, double points, and inflection points, respectively. It is known ((2], [5]) that 


E=Ttn+5F 


THEOREM 4.2. If f: S' > R* has 2n loops and no inflection points, then f has exactly n double 
tangents. 


Proof. If f has no inflection points, then its curvature is always positive or always negative. 
Since the curvature of f is the same as the curvature of any orientation-preserving reparametriza- 
tion of f by arc-length, it follows that rotf is either > 0 or <0. Therefore |rot f| = 47. 
Halpern has shown in [6] that 


4n7I < (\rotf| — 47)(2|rot f| — 27). 
Consequently J = 0. The result now follows from the equality E = +n + $F. 


5. Final comments. There are a few directions in which we could try to pursue the ideas we 
dealt with in this note. We mention two. 


(1) What results hold if triple points, quadruple points, etc., are allowed? 

(2) Every closed curve without double points separates R? in 2 regions. This result was used in 
our work but it is not true for an arbitrary surface. What happens if R* is replaced by a torus, for 
instance? The situation is then quite different. While, for example, there is no f: S' > R* such 
that w(f) = x,X y,y>, it is possible to obtain f: S' — Torus with word x,x,y,y). Fig. 7 shows 
how. 


at 


FIG. 7 


We end up this note by calling the interested reader’s attention to the article by J. W. Bruce 
and P. J. Giblin, “Generic Geometry”, which appeared in this MONTHLY, vol. 90, 529-545. Their 
techniques and results are a different approach to global problems in the theory of smooth plane 
curves. 
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Numerical Analysis, T(16-17: 1, 2), L. Sparse Matrix Technology. Sergio Pissanetzky. Academic Pr, 
1984, xiii + 321 pp, $55. [ISBN: 0-12-557580-7] An introductory survey of the special numerical 
techniques available for sparse-matrix problems. Covers the solution of linear algebraic equations, 
eigenanalysis, and related topics. FORTRAN implementations of some algorithms are given. AQ 


Numerical Analysis, P. Lecture Notes in Mathematics-1066: Numerical Analysis. Ed: David F. Grif- 
fiths. Springer-Verlag, 1984, xi + 275 pp, $14 (P). [ISBN: 0-387-13344-5] Proceedings of the Tenth 
Dundee Biennial Conference on Numerical Analysis held June 28-July 1, 1983. Contains fifteen papers 


by the invited speakers that cover a broad range of topics. AO 
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Analysis. Ed: M. Cwikel, J. Peetre. Springer-Verlag, 1984, 239 pp, $11 (P). [ISBN: 0-387-13363-1] 
Proceedings of the conference held in Lund, Sweden, August 21-September 1, 1983. JAS 


Functional Analysis, $(16-17). Teoremi e problemi dell’analisi funzionale. Aleksandr Kirillov, 
Aleksej GviSiani. MIR, 1983, 358 pp. Italian translation of Russian problem book on functional 
analysis. Summary of major definitions and theorems, 851 problems, hints or solutions for all prob- 
lems. Covers measure and integration, linear spaces and operators, Fourier transforms, spectral 
theory. KS 
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Functional Analysis, S(16-18). Uniform Convexity, Hyperbolic Geometry, and Nonexpansive Mappings. 
Kazimierz Goebel, Simeon Reich. Pure & Appl. Math., V. 83. Dekker, 1984, ix + 170 pp, $35. [ISBN: 
0-8247-7223-7] From lectures by Goebel in a nonlinear functional analysis seminar at U.S.C. Relates 
nonexpansive and holomorphic mappings on complex, infinite-dimensional Banach spaces. Uses only 
metric methods and assumes only modest background in functional analysis and complex analysis. DA 


Analysis, T*(18: 4), S*, P. Groups and Geometric Analysis: Integral Geometry, Invariant Differential 
Operators, and Spherical Functions. Sigurdur Helgason. Pure & Appl. Math., V. 113. Academic Pr, 
1984, xix + 654 pp, $39.50. [ISBN: 0-12-338301-3] Treats harmonic analysis, integral geometry, and 
differential operators on homogéneous spaces: inversion theorems for general Fourier transforms 
(assumes the ordinary one), Radon transforms in any dimension, and spherical transforms. Local and 
global solvability, and eigenfunctions of invariant differential operators. Representations of Lie 
groups. Gives a few exercises, with solutions. Mentions applications. Claims to be a reference on 
integral geometry but ignores Gel’fand’s recent work where, e.g., the difference in Radon inversion 
formulae for even or odd dimensions is related to the different homotopy types of the unit sphere. 
Nevertheless, contains a lot of good mathematics. YN 


Analysis, P. Lecture Notes in Mathematics-1061: Séminaire de Théorie du Potentiel Paris, No. 7. F. 
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Hirsch, G. Mokobodzki. Springer-Verlag, 1984, iv + 281 pp, $14 (P). [ISBN: 0-387-13338-0] 


Analysis, P. Nondiscrete Induction and Iterative Processes. F-A. Potra, V. Ptak. Research Notes in 
Math., V. 103. Pitman, 1984, 207 pp, $21.95 (P). [ISBN: 0~273-08627-8] Surveys papers devoted to 
nondiscrete mathematical induction and sketches the present state of the method. Applications to 
numerical analysis, factorization theorems, transitivity in operator algebras, stability of openness 
for linear mappings and exactness of sequences, interpolation spaces and iterative construction of 


solutions to nonlinear partial differential equations. DA 


ee 


tions. Annie Cuyt. Springer-Verlag, 1984, ix + 138 pp, $8 (P). [ISBN: 0-387-13342-9] Generalizes 
the notion of Padé approximation for real functions to nonlinear operators between Banach spaces. 
Includes existence theorems, covariance properties, recurrence relations, comparison with other gen- 
eralizations, and applications. to solutions of nonlinear equations. RM 


Analysis, P. Lecture Notes in Mathematics-1065: Padé Approximants for Operators: Theory and Applica~ 


Analysis, P. Non-linear Integrable Systems--Classical Theory and Quantum Theory: Proceedings of RIMS 
Symposium. Ed: M. Jimbo, T. Miwa. World Sci (US Distr: Heyden & Son), 1983, vi + 289 pp, $29. 
[ISBN: 9971-950-32-4] Proceedings of a May 1981 symposium held at Kyoto University. LAS 


Differential Geometry, S**(18), P. Global Riemannian Geometry. Ed: T.J. Willmore, N.J. Hitchin. 
Ser. in Math. & Its Applic. Halsted Pr, 1984, 213 pp, $64.95. [ISBN: 0-470-20017-0] Twenty-two 
brief research-expository articles, forming an admirably well-written account of recent advances and 
open problems in Riemannian geometry. Sample: M.F. Atiyah, on Yang-Mills equations; §. Kobayashi, 
on Hermitian vector bundles; R. Penrose, on general relativity; S.T. Yau, on minimal surfaces. 
Definitely requires a good background in algebraic topology and differential geometry. Should bring 
a stimulating overview of the field to graduate students in quest of a dissertation topic. Rela~ 
tively expensive. YN 


and Submanifolds of Finite Type. Bang-yen Chen. 
World Sci (US Distr: Heyden & Son), 1984, xi + 352 pp, $19 (P); $37. [ISBN: 9971-966-03-4; 9971- 
966-02-6] First, surveys Sard’s, Morse”’s, de Rham’s, and Frobenius” theorems, Hodge decomposition, 
etc., sometimes without purpose (says on p. 165 that tight immersions are important, but never 
defines nor uses them). Good review, but not an introductory course. Second, relates the spectrum 
of eigenvalues of the Laplacian on a compact Riemannian manifold to geometric notions such as 
volume, mean curvature, and minimal area. Interesting mathematics for the reader who doesn’t mind 


dangling grammar and is already quite familiar with differential geometry. YN 


Differential Geometry, T(18: 1), S**, P**®,. Differentiable Manifolds: Forms, Currents, Harmonic 
Forms. Georges de Rham. Transl: F.R. Smith. Grund. der math. Wissenschaften, B. 266. Springer- 
Verlag, 1984, x + 167 pp, $28.50. [ISBN: 0-387-13463-8] Historical introduction by §8.S. Chern. 
Excellent English translation of a classic that fills the gap between graduate textbooks and 
research papers in differential geometry. Discusses topics on "currents," some still difficult to 
find elsewhere, but underlying numerous recent developments. Direct and inverse images of currents 
(fundamental to double fibrations); homology of currents on manifolds (applies to minimal surfaces); 
double currents (yield smoothing kernels and fundamental solutions to partial differential equa- 
tions); regularity of Green”’s operator and harmonic currents. Culminates with de Rham’s proof of 
Hodge”s decomposition. An enriching supplementary reading. YN 


Algebraic Topology, $(18), P*. Instantons and Four-Manifolds. Daniel S. Freed, Karen K. Uhlenbeck. 
Math. Sci. Res. Institute Pub., V. 1. Springer-Verlag, 1984, 232 pp, $15. [ISBN: 0-387~96036-8] The 
outcome of a seminar at the new Berkeley Institute which studied S. Donaldson’s proof that certain 
4-—dimensional manifolds, constructed by M. Freedman in his solution to the 4-dimensional Poincaré 
conjecture, are not smoothable. The proof of Donaldson’s result in topology uses the Yang-Mills 
equations of "ultra-modern" physics! Audience: the professional researcher. RB 


Algebraic Topology, S(18), P. The Smith Conjecture. Ed: John W. Morgan, Hyman Bass. Academic Pr, 
1984, xv + 243 pp, $49.50. [ISBN: 0-12-506980-4] Is the fixed point set of a periodic orientation 
preserving self-homeomorphism of S”’ always an unknotted circle? The 1978 affirmative answer for 
diffeomorphisms combined work of mathematicians in diverse areas within topology, geometry, algebra. 
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This volume includes presentations of the various pieces, as given by the principals at a 1979 
Columbia symposium, plus an introduction and generalizations, RB 
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Negligibility. J.J. Dijkstra. CWI Tract, V. 2. Math Centrum, 1984, iii + 109 pp, Dfl. 16,70 (P). 
[ISBN: 90~6196-268-4] An investigation in infinite-dimensional topology which looks at separable 
metrizable spaces that share many topological properties with £7, but yet are not homeomorphic to 
it. List of references. No exercises. CEC 


Topology, P. Lecture Notes in Mathematics-1060: Topology. Ed: L.D. Faddeev, A.A. Mal’cev. 
Springer-Verlag, 1984, vi + 389 pp, $18.50 (P). [ISBN: 0-387-13337-2] Proceedings of the Interna- 
tional Topological Conference held in Leningrad, August 23-27, 1982. Contains papers on general 
topology, algebraic topology, and "applications" to related mathematical areas. JAS 
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Operations Research and Mathematical Economics. Ed: G. Hammer, D. Pallaschke. Springer-Verlag, 
1984, ix + 478 pp, $27.50 (P). [ISBN: 0-387~-12918-9] Proceedings of the Eighth Symposium on Opera- 
tions Research held at the University of Karlsrtihe from August 22-25, 1983. Includes thirty-seven 
papers on optimization theory, control theory, mathematical economics, game theory, graph theory, 
fixed point theory, statistics, measure theory, and applications. AO 


Operations Research, $(17~18), P. Markovian Control Problems: Functional Equations and Algorithms. 
A. Federgruen. Math. Centre Tracts, No. 97. Math Centrum, 1983, vii + 212 pp, Dfl 31 (P). [ISBN: 
90~6196-165-3] Author’s 1978 doctoral dissertation on Markovian control and optimization. Treats 
Markovian decision theory and many~person stochastic games, emphasizing successive approximation 
methods and algorithms. RM 


Operations Research, P, L. A Model-Management Framework for Mathematical Programming. Kenneth H. 
Palmer, et al. Wiley, 1984, xix + 392 pp, $42.50. [ISBN: 0-471~80472-X] Describes a support system 
for mathematical programming applications developed at Exxon. All activities associated with linear 
programming (matrix generation, report writing, data management, model solution) are integrated into 
a single system. The system described is a model for the design of similar systems. AO 


Optimization, T(18), P. Explicit Methods of Optimization. Jean-Pierre Aubin. Gauthier-Villars (US 
Distr: Heyden & Son), 1984, 287 pp, $44. [ISBN: 2-04-015575-9] Theoretical introduction to optimi- 
zation with primary motivation applications to microeconomic theory. Discusses aspects of optimal 
control theory, quadratic and convex programming and applications to quadratic economic models. 
Appendix on the MODULECO package for macroeconomic models. RM 


Optimization, T(16-17), P. Optimization: Theory and Applications, Second Edition. S&.S. Rao. Wiley, 
1984, xiv + 747 pp, $24.95. [ISBN: 0-470-27483-2] Unaltered reprint of the original 1978 edition 


(TR, May 1981). LAS 
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zation. Ed: P. Thoft-Christensen. Springer-Verlag, 1984, ix + 934 pp, $49.50 (P). [ISBN: 0-387- 
13185-X] Ninety papers from the Eleventh IFIP Conference held in Copenhagen, Denmark, July 25-29, 
1983. AO 


Optimization, P. Progress in Combinatorial Optimization. Ed: William R. Pulleyblank. Academic Pr, 
1984, xi + 374 pp, $43.50. [ISBN: 0-12-566780-9] Twenty-one papers presented at a conference held at 
the University of Waterloo during the summer of 1982. Represents a broad cross~section of current 
work in the field. AO 


Optimization, T(16-17: 1), S, L. Introduction to Dynamic Programming. Leon and Mary W. Cooper. 
Intern. Ser. in Modern Appl. Math. & Comp. Sci. Pergamon Pr, 1981, ix + 289 pp, $19.50 (P). [ISBN: 
0-08-025064-5] Relatively self-contained, readable introduction to dynamic programming. Nice bal- 
ance among theory, methods, and applications, illustrated by interesting computational examples. 
Exercises, final chapter surveys some real applications. RM 

Probability, FP. Stochastic Functional Differential Equations. S~E A Mohammed. Research Notes in 
Math., V. 99. Pitman, 1984, vi + 245 pp, $21.95. (P), [ISBN: 0-273-08593-X] Existence of solutions 
and dependence on the initial process; Markov trajectories; the infinitesimal generator; regularity 
of the trajectory field; examples; further developments, problems and conjectures. Almost entirely 
new results. A first chapter provides background. DA 


Probability, P. Probabilistic Analysis and Related Topics, Volume 3. Ed: A.T. Bharucha-Reid. 
Academic Pr, 1983, x + 260 pp, $47. [ISBN: 0-12-095603-9] Five survey articles on topics pertaining 
to analytic properties of random functions, the third volume in a series of occasional collections 
of papers on this topic. LAS 


Probability, P. Lecture Notes in Mathematics-1064: Probability Measures on Groups VII. Ed: H. 
Heyer. Springer-Verlag, 1984, x + 588 pp, $27.70 (P). [ISBN: 0-387-13341-0] Proceedings of a 
conference held in Oberwolfach, West Germany, April 24-30, 1983. JAS 


Probability, T(15-17), S, P. Statistical Analysis of Stationary Time Series. Ulf Grenander, Murray 
Rosenblatt. Chelsea, 1984, 308 pp, $17.95. [ISBN: 0-8284-0320-1] A corrected version of the 1957 


edition. A “rigorous mathematical discussion" of the topic. FLW 
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Probability, P. Lecture Notes in Mathematics-1059: Séminaire de Probabilités XVIII: 1982/83. Ed: J. 


Azéma, M. Yor. Springer-Verlag, 1984, iv + 518 pp, $26 (P). [ISBN: 0-387-13332-1] Proceedings of 
the Paris Seminar for the academic year 1982-1983. JAS 


Statistics, T(15~17: 1, 2), S, L. Statistical Inference. Vijay K. Rohatgi. Wiley, 1984, xiv + 940 
pp, $44.95. [ISBN: 0-471-87126-5] Presupposes calculus. Introduces statistical ideas near the 
beginning and integrates parametric and non-parametric inferences. Considers the usual probability 
distributions and statistical tests, large sample theory, estimation, measures of association, con- 
tingency tables, and analysis of variance. No regression. Many exercises. FLW 


tion. Thomas H. and Ronald J. Wonnacott. Appl. Prob. & Stat. Wiley, 1984, xxii + 746 pp, $28.95. 
[ISBN: 0-471-09716-0] This new edition of this careful text (that presupposes only high school alge- 
bra) includes exploratory data analysis, the jackknife, path analysis, Box-Jenkins ARIMA forecast- 
ing, decision trees, and robust estimation as well as nonparametric texts, multiple regression, and 
Bayesian inference. (Second Edition, TR, March 1978.) FLW 


Statistics, S(14-18), L. Statistical Modelling. Warren Gilchrist. Wiley, 1984, xv + 339 pp, 
$29.95. [ISBN: 0-471-90380-9] Attempts to "bring together and present in a structured fashion those 
statistical and related ideas that are needed for the construction of quantitative models." 
Includes chapters on identification, estimation, validation, and iteration. No exercises. FLW 


Statistics, P. Proceedings of the Twenty-Ninth Conference on the Design of Experiments. US Army 
Research Office (PO Box 12211, Research Triangle Park, NC), 1983, xx + 356 pp, (P). Papers from an 
October 1983 conference held at the Uniformed Services University of Health Sciences in Bethesda, 


Maryland. LAS 


Statistics, T7(15~-17: 1, 2), S. Probability Theory and Mathematical Statistics for Engineers. V.S. 
Pugachev. Transl: I.V. Sinitsyna, P. Eykhoff. Pergamon Pr, 1984, xvii + 450 pp, $90. [ISBN: 0-08- 
029148-1] Probability, random variables and vectors, estimation of parameters and distributions, 
regression, analysis of variance, and brief treatments of some multivariate topics. No problem 
sets. Note the price! FLW 
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Social Research. John Fox. Wiley, 1984, xx + 449 pp, $39.95. [ISBN: 0-471-09913-9] Presupposes 
calculus, matrix algebra, and an introduction to probability and statistics. Tries to blend theory 
and applications, with informal derivation or intuitive justifications given for most results. 
Examples are real data. Takes up linear regression, analysis of variance, linear-model problems, 
structural-equation models, and log and log-linear models. FLW 


Computer Literacy, S*(13-18), L*. The RS-232 Solution. Joe Campbell. Sybex, 1984, xiv + 194 pp, 
$16.95 (P). [ISBN: 0-89588-140-3] A very thorough yet highly readable tutorial and reference for 
anybody who has experienced frustration when trying to connect computers or peripherals. It pro- 
vides a description of the RS-232C "standard," instructions for making a few simple but very effec- 
tive tools, and a number of case studies which are presented with remarkable good humor in the face 
of misinformation and poor design by manufacturers. Indeed, one of the aims of this book is to 
enable a run-of-the-mill user to get around the missing or wrong information that is usually pro- 
vided. This book is a jewel. JAS 


Computer Programming, T(13: 1). The Basics of BASIC. Alfredo C. Gomez. Holt, Rinehart & Winston, 
1983, xii + 303 pp, $17.95 (2). [ISBN: 0-03-063069-xX] Concise, no-nonsense, machine-independent, 
business-oriented introductory text. Includes chapters on strings, matrices, files. Discusses 
flowcharts and structured programming. Short exercise sets at ends of chapters. Mentions features 
of advanced versions in text. Quick reference guides in appendices for BASIC-PLUS, Applesoft II, 
TRS-80 Level II, Microsoft. KS 


Computer Programming, T(13: 1), S. A Guide to Programming in Applesoft, Second Edition. Bruce Pres- 
ley. D Van Nostrand Reinhold, 1984, x + 232 pp, $16.50 (P). [ISBN: 0-442-27249-9] Excellent text 
for beginning Applesoft programming. Many applications included but will need supplementary 
material if used for graphics. Written in clear, understandable language. TR 


everreneereets = GnDOERORIGDIaIAIRARIEOSS: «—««SSLALDEEEDEEEPEREEIQSOTRSSTRENORAOREED CN ee Sy 


Len Lindsay. Reston, 1984, xii + 467 pp, (P). [ISBN: 0-8359-0784-8] COMAL is a language designed to 
teach the elementary principles of computer programming. It is intended to be the “ideal" first 
language, by incorporating the best facilities of BASIC, Pascal, and LOGO. The language actually 
exists and can be run on a number of microcomputers, including the COMMODORE-64. MS 


Computer Programming, T(13: 1), S, L. Graded Problems in Computer Science. Andrew D. McGettrick, 
Peter D. Smith. Addison-Wesley, 1983, vi + 314 pp, $14.95 (P). [ISBN: 0-201-13787-9] Language- 
independent text for first programming course. Covers programming constructs, good programming 
practice, some algorithm design strategies. Examples in readable pseudocode. Extensive, interest- 
ing problem sets range from easy to challenging. Majority of problems is numerical; some string 
processing. KS 


Computer Programming, 1T(13: 1). Problem Solving and Structured Programming in BASIC. Elliot B. 
Koffman, Frank L. Friedman. Addison-Wesley, 1979, xvi + 444 pp, $21.95 (P). [ISBN: 0-201-03888-9] 
Text for introductory programming course using BASIC. Describes minimal BASIC, BASIC-PLUS and Dart- 
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mouth BASIC. Emphasizes designing algorithms using stepwise refinement. Most sample problems 
include data table, flow diagram and computer program. Uses if-then-else and while loops. Chapters 
on string processing, matrix commands, files. KS 


Computer Programming, T(13: 1). This is BASIC: An Introduction to Computer Programming. Robert F. 
Sutherland. Macmillan, 1984, xv + 447 pp, $18.95 (P). [ISBN: 0-02-418370-9] Leisurely introduction 
to programming using minimal BASIC language (e.g., FOR-NEXT introduced in Chapter 12). Uses 
flowcharts. Some comments on good programming practice and debugging procedures. Numerous examples 
and exercises; most require no more than elementary algebra. Appendices on files and structured 


programming. KS 


Computer Programming, S(13). Applied Basic for Microcomputers. Roy A. Boggs. Reston, 1984, x + 277 
pp, $16.95 (P). [ISBN: 0-8359-0042-8] This book is a good resource for elementary file manipulation. 
Contains primarily business applications. Designed for use with Apples, TRS-80°s, and IBM-PC’s. 
Attempts to control for differences in each specific version of BASIC with varying degrees of suc- 
cess. TR 


VAX-11. James F. Peters, III, Hamed M. Sallam. Reston, 1985, xviii + 684 pp, (P). [ISBN: 0-8359- 
0549-7] Focus is on business computing (especially accounting applications) and BASIC, with the 
intent of teaching both the language and how to use it to solve business problems. Many interest- 
ing, complete sample programs, with runs, but skimpy on exercises. RM 


Computer Programming, T(13: 1). PASCAL. Charles H. Goldberg, Walter S. Brainerd, Jonathan L. 
Gross. Boyd & Fraser, 1984, xii + 465 pp, $22.95 (P). [ISBN: 0-87835-140-X] Text for first com- 
puter science course. Emphasizes top-down programming. Comments on style, testing, debugging, and 
common errors throughout. Introduces simple data structures; recursion, searching and sorting. 
Includes examples of simulation, graphics, computer-assisted instruction. No previous computing 
experience or college-level mathematics assumed. KS 


Software Systems, L. Inside CP/M-86: A Guide for Users. David E. Cortesi. Holt, Rinehart & Wins- 
xvi + 261 pp, $18.45 (P) [ISBN: 0-03-070671-8]; Inside Concurrent CP/M: A Guide for Users, 1984, 
xvii + 247 pp, $18.45 (P). [IESBN: 0-03~070669-6] These three books present a relaxed, even breezy, 
view of various branches of the CP/M operating system. The presentations are quite complete from 
the user’s point of view. However, a person wanting deeper system information (e.g., disk directory 
structure) will find nothing. JAS 


Software Systems, P. Lecture Notes in Computer Science-174: EUROSAM 84. Ed: John Fitch. Springer- 
Verlag, 1984, xi + 396 pp, $18 (P). [ISBN: 0-387-13350-X] Proceedings of the third of the interna- 
tional computer algebra conferences held in Europe every 5 years. The 37 papers span topics ranging 
from languages and algorithms for simplification, integration, factorization, number theory and com- 
putational group theory, to applications including differential equations, algebraic number computa~ 
tions and Groebner bases. RB 


Software Systems, L. The CP/M Plus Handbook. Alan R. Miller. Sybex, 1984, xii + 248 pp, $15.95 
(P). [ISBN: 0-89588-158-6] A straight-forward tutorial introduction and reference to CP/M-3, also 
known as CP/M Plus. The book is primarily aimed at the absolute novice though it does have a brief 


chapter on the directory structure and BDOS (system) calls. JAS 


Software Systems, P. Approaches to Prototyping. Ed: R. Budde, et al. Springer-Verlag, 1984, xi + 
458 pp, $23 (P). [ISBN: 0-387-13490-5] Edited papers from a working conference on "user-oriented 
software construction" held in Namur in West Germany in October 1983. JAS 


Computer Science, P. Lecture Notes in Computer Science-169: Feedback Shift Registers. Christian 
Ronse. Springer-Verlag, 1984, 145 pp, $9 (P). [ISBN: 0-387-13330-5] Develops a general algebraic 
theory of feedback shift registers or “clocked switching circuits" over a finite field, using alge- 
bras of operators on sequences over the field, in which the identity, the shift, and the constants 
play a central role. Randomness properties are investigated. RM 


Computer Science, T(16-17: 1), P. Distributed Computing. Ed: Fred B. Chambers, David A. Duce, Gil- 
lian P. Jones. Academic Pr, 1984, xii + 327 pp, $22.50. [ISBN: 0-12-167350-2] This text describes 
research activities in the area of distributed, or parallel, computing architectures. The new non- 
von Neumann architectures described in this text include the dataflow model, and high speed local 
networks. The text also describes research work in the area of programming languages for parallel 
systems (e.g., PROLOG), and distributed operating systems. MS 


Computer Science, P. Dataflow Computation. A.P.W. B&éhm. CWI Tract, No. 6. Math Centrum, 1983, iii 
+ 208 pp, Dfl. 29,80 (P). [ISBN: 90-6196-272-2] A study of dataflow computation (a particular paral- 
lel processing architecture) focusing on issues of computational complexity and program correctness. 
AO 


Computer Science, T(15-17: 1, 2), $, L. Computability, Complexity, and Languages: Fundamentals of 
Theoretical Computer Science. Martin D. Davis, Elaine J. Weyuker. Comp. Sci. & Appl. Math. 
Academic Pr, 1983, xix + 425 pp, $35. [ISBN: 0-12-206380-5] Readable text covering elementary recur- 
sion theory, formal languages and automata, propositional and predicate logic, computational com- 
plexity, classification of unsolvable problems. Assumes substantial programming experience. Flexi- 
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ble organization makes this suitable for many courses. Short exercise sets at end of most sections. 
KS 


Computer Science, P, L*. Advances in Computers, Volume 23. Ed: Marshall C. Yovits. Academic Pr, 
1984, xiii + 377 pp, $48. [ISBN: 0-12-012123-9] Seven expository papers on supercomputers and VLSI, 
information and computation, videogames, decision support systems, digital control systems, informa- 
tion privacy, and parallel sorting algorithms. LAS 


Computer Science, P. Lecture Notes in Computer Science-176: Mathematical Foundations of Computer 
Science 1984. Ed: M.P. Chytil, V. Koubek. Springer-Verlag, 1984, xi + 581 pp, $25 (P). [ISBN: 0- 
387-13372-0] Papers contributed to the 11th Symposium on Mathematical Foundations of Computer Sci- 
ence in Prague, September 1984. Emphasis on complexity and computability questions related to 
logic, formal languages, and automata. RM 


Computer Science, P. Information Technology and the Computer Network. Ed: Kenneth G. Beauchamp. 
NATO ASI Ser. F: Computer & Systems Sci. Springer-Verlag, 1984, viii + 271 pp, $34.50. [ISBN: 0- 
387-12883-2] Collection of papers presented at the NATO Advanced Study Institute on Information 
Technology and the Computer Network, 1983. Intent is to provide a study of the current achievements 


in network technology. JRG 


Control Theory, $(16-18), P*. Control and Dynamic Systems, Volume 19: Nonlinear and Kalman Filtering 
Techniques, Part 1 of 3. Ed: C.T. Leondes. Academic Pr, 1983, xvii + 373 pp, $37.50. [ISBN: 0-12- 
012719-9] Five papers on nonlinear dynamic systems, applications of Bayesian estimation techniques, 
and computational techniques. AO 


Control Theory, P. Control and Dynamic Systems: Advances in Theory and Applications, V. 20: Non- 
linear and Kalman Filtering Techniques, Part 2 of 3. Ed: C.T. Leondes. Academic Pr, 1983, xix + 
410 pp, $44. [ISBN: 0-12-012720-2] Seven papers make up the second volume in a trilogy on nonlinear 
and Kalman filters. LAS 


Control Theory, T(17: 1), P. Oscillation Theory of Optimal Processes. George M. Smirnov. Wiley, 
1984, x + 154 pp, $29.95. [ISBN: 0-471-89399-4] The set of all control functions that satisfy the 
Pontryagin maximum principle is classified on the basis of an analysis of oscillatory properties. 
Growing out of results first obtained in 1971-1977, this book includes additional material proved in 
1980-1983. AWR 


Control Theory, P. Lie Groups: History, Frontiers and Applications, Series B: Systems Information 
and Control, Volume II. Ed: L.R. Hunt, C.F. Martin. Math Sci Pr, 1984, vi + 450 pp, $50. [ISBN: 
0-915692-37-6] Proceedings of the Berkeley-Ames Conference on nonlinear problems in control and 
fluid dynamics held in Berkeley, May 1983. JAS 


Applications, P*. Ocean Tides: Mathematical Models and Numerical Experiments. G.I. Marchuk, B.A. 
Kagan. Transl: E.V. Blinova, L. Ya. Yusina. Pergamon Pr, 1984, xii + 292 pp, $65. [ISBN: 0-08- 
026236-8] Account of recént (1960’s and 1970’s) work by and views of Russian authorities on tides. 
Much material not hitherto available in English. For researchers in oceanography and solid earth 


geophysics. JK 


Applications, P. Lecture Notes in Computer Science-168: Methods and Tools for Computer Integrated 
Manufacturing. Ed: U. Rembold, R. Dillmann. Springer-Verlag, 1984, xvi + 528 pp, $20.80 (P). 
[ISBN: 0-387-12926-X] Lectures from a September 1983 advanced course held at the University of 


Karlsrithe. LAS 


Applications (Astronomy), L. Cosmic Discovery: The Search, Scope, and Heritage of Astronomy. Martin 
Harwit. MIT Pr, 1984, xi + 334 pp, $9.95 (P). [ISBN: 0-262-58068-3] With respect to the discipline 
of astronomy, the author addresses the questions: "What do we know now?", "What was the process of 
discovery?", and "How should we plan for further discovery?" Interesting and lucid, but rather 
sophisticated. JRG 


Applications (Biology), P. Lecture Notes in Biomathematics~54: Mathematical Ecology. Ed: S.A. 
Levin, T.G. Hallam. Springer-Verlag, 1984, xii + 513 pp, $24.50 (P). [ISBN: 0-387-12919-7] Proceed- 
ings of the research part of a December 1982 conference held at the. International Center for 
Theoretical Physics in Miramare-Trieste, Italy. Papers deal with autoecology, population biology, 
ecosystem theory, diffusion models, and applications to fisheries and epidemiology. (Expository 
lectures from the first part of the conference are scheduled to be published separately as a text- 
book.) LAS 


Applications (Chemistry), P. Lecture Notes in Mathematics-1063: Orienting Polymers. Ed: J.L. Erick- 
sen. Springer-Verlag, 1984, vii + 166 pp, $9.50 (P). [ISBN: 0-387-13340-2] Proceedings of a 
workshop held March 21-26, 1983 at the University of Minnesota. These papers provide an introduc~ 


tion to the problems of interest and a survey what is known about them. AO 


Beckmann. Ed: H. Hauptmann, W. Krelle, K.C. Mosler. Springer-Verlag, 1984, xi + 378 pp, $28.50. 
[ISBN: 0-387-13652-5] A hard-to-classify range of papers running from air pollution reductions in 
Munich to a discussion of whether the ellipsoid method can ever be an efficient way to solve linear 
programming methods. AWR 
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Decision Analysis. Ed: M. Graver, A.P. Wierzbicki. Springer-Verlag, 1984, viii + 268 pp, $15.50 
(P). [ISBN: 0-387-13354-2] Proceedings of an international workshop on interactive decision analysis 
and interpretative computer intelligence held in Laxenburg, Austria, September 20-23, 1983. JAS 


Applications (Kconomics), P*. Lecture Notes in Economics and Mathematical Systems-230: Macro- 
Economic Planning with Conflicting Goals. Ed: M. Despontin, P. Nijkamp, J. Spronk. Springer- 
Verlag, 1984, vi + 297 pp, $17.50 (P). [ISBN: 0-387-13367-4] The principal novelty lies in quite new 
approaches to solve macro-economic problems mathematically, rather than in new mathematics. Util- 
izes mainly game theory, mathematical programming, and fuzzy sets in order to rank the policy mak- 
ers” preferences, resolve conflicts between competing goals, and compare outcomes. Reports genuine 
case studies, for instance interactions between European countries, the Tasmanian economy, and water 


treatment on San Francisco Bay. YN 
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tal. Ed: Ginter Bamberg, Klaus Spremann. Springer-Verlag, 1984, vi + 306 pp, $18 (P). [ISBN: 0- 
387-12923-5] Proceedings of the second international workshop on risk and capital held at: the 
University of Ulm in June 1983. LAS 


Applications (Economics), P. Economics Consequences of Population Change in Industrialized Coun- 
tries. Ed: Gunter Steinmann. Stud. in Contemp. Econ., V. 8. Springer-Verlag, 1984, x + 415 pp, 
$23.50 (P). [ISBN: 0-387-13513-8] Papers from a June 1983 conference held at the University of 
Paderborn assessing with mathematical and economic models the impact of low fertility rates. All 
papers are in English, by German authors. LAS 


Applications (Engineering), P. Problems of Randomness in Communication Engineering. Ed: Kenneth W. 
Cattermole, John J. O’Reilly. Math. Topics in Telecommunications, V. 2. Wiley, 1984, 342 pp, 
$32.50. [ISBN: 0~471-80763-X] A tutorial introduction to the mathematics of stochastic systems and 


processes together with applications of these ideas in communications theory. AO 


Applications (Engineering), T(17-18: 1), P. Elements of Digital Satellite Communication, Volume I. 
William W. Wu. Computer Sci Pr, 1984, xv + 607 pp, $44.95. [ISBN: 0-914894-39-0] An introduction to 
the design, planning, and operation of digital satellite communication systems. AO 


Applications (Engineering), T(16-17: 1), S. The Finite Element Method Displayed. Gouri Dhatt, Gil- 
bert Touzot. Wiley, 1984, xv + 509 pp, $39.95. [ISBN: 0-471-90110-5] Very attractive text for 
engineering students. Many, many examples. FORTRAN programs. Main topics: approximation of the 
unknowns, integral formulation, matrix formulation, numerical methods, programming techniques. 
Translated from French by Gilles Cantin. DA 


Applications (Engineering), P. Numerical Methods in Coupled Systems. Ed: R.W. Lewis, P. Bettess, E. 
Hinton. Ser. in Num. Methods in Engin. Wiley, 1984, xiv + 618 pp, $74.95. [ISBN: 0-471-90122-9] 
This volume grew out of the International Conference on Numerical Methods for Coupled Problems, 
Swansea, Wales, held September 1, 1981 in honor of 0.C. Zienkiewicz in his 60th year. JAS 


Applications (Engineering), T(18: 1), P. Variational Inequalities and Flow in Porous Media. M. Chi- 
pot. Appl. Math. Sci., No. 52. Springer-Verlag, 1984, vii + 118 pp, $16 (P). [ISBN: 0-387-96002-3] 
Contents of one-semester graduate course given in 1981-82 at Brown. Two-chapter introduction to the 
variational inequality and obstacle problems. Chapter 3 is on regularity theory. Chapter 4 is on 
the dam problem. No exercises or problems. References. JK 
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Models of Fluctuating Growth. Ed: R.M. Goodwin, M. Kriiger, A. Vercelli. Springer-Verlag, 1984, 
xvii + 277 pp, $17.50 (P). [ISBN: 0~387-13349-6] Revised papers from an international symposium held 
in Siena, Italy, March 24-27, 1983. JAS 


Applications (Management), P. Mathematical Methods in Management. Geoffrey Gregory. Handbook of 
Applicable Math. Wiley, 1983, xii + 154 pp, $31.95. [ISBN: 0-471-90131-8] A volume in a series 
addressed to professional people who discover that their career demands more mathematics than their 
academic training provided. It is designed to accompany the six core volumes covering basic 


mathematical concepts. AWR 


Applications (Medicine), $(15-16), L. Mathematical Methods in Medicine. Richard Bellman. World 
Scientific (US Dist: Heyden & Sons), 1983, xiv + 252 pp, $18 (P); $33. [ISBN: 9971-950-21-9; 9971- 
950-20-0] Mathematical modelling of physiological phenomena and the attendant analytic difficulties, 
using methods from differential equations, control theory and dynamic programming. Bibliography. 
JIRG 


Applications (Medicine), P. Applications of Optimal Control Theory in Biomedicine. George W. Swan. 
Pure & Appl. Math., No. 81. Dekker, 1984, xi + 285 pp, $55. [ISBN: 0-8247-7192-3] Two chapters on 
mathematical background, followed by self-contained chapters on diabetes, thyroid disease, adminis~ 
tration of drugs, cancer therapy, and the circulatory systems. AWR 
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Equation. Ed: C. Cercignani. Springer-Verlag, 1984, vii + 243 pp, $13 (P). [ISBN: 0-387-12899-9] 
Lectures from the Centro Internazionale Matematico Estivo held in Montecatini, Italy, June 10-18, 
1981. TAS 
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to the Quantum Theory of Irreversible Processes. Ed: L. Accardi, A. Frigerio, V. Gorini. 
Springer-Verlag, 1984, vi + 411 pp, $19 (P). [ISBN: 0-387-12915-4] Proceedings of a September 1982 
workshop held at Villa Mondragone, Italy. LAS 


Applications (Physics), S(18), P. Advances in Nonlinear Waves, V. 1. Ed: Lokenath Debnath. 


Research Notes in Math., V. 95. Pitman, 1984, 328 pp, $25.95 (P). {[ISBN: 0-273-08609-x] A diverse 
collection of highly technical papers. MU 


Applications (Physics), S(16), P, L. Concepts of Particle Physics, V. I. Kurt Gottfried, Victor F. 
Weisskopf. Oxford U Pr, 1984, xv + 189 pp, $22.50. [ISBN: 0-19-503392-2] This, the first of two 
volumes, is a descriptive survey of modern particle physics, virtually devoid of mathematics. MU 


Applications (Physics), P. Monopoles in Quantum Field Theory. Ed: N.S. Craigie, P. Goddard, W. 
Nahm. World Scientific (US Dist: Heyden & Son), 1982, xxi + 440 pp, $21 (P); $49. [ISBN: 9971-950- 
29-4; 9971-950-28-6] Proceedings of the Monopole Meeting held in Trieste, Italy, December 1981. JAS 


Applications (Physics), P. Lecture Notes in Physics-195: Trends and Applications of Pure Mathematics 
to Mechanics. Ed: P.G. Ciarlet, M. Roseau. Springer-Verlag, 1984, 422 pp, $20.50 (P). [ISBN: 0- 
387-12916-2] Proceedings of a symposium (fifth in a series) held in Palaiseau, France from November 
28 to December 2, 1983. LAS 


Applications (Physics), T(16-17: 1, 2), S, L. Mathematical Methods with Applications to Problems in 
the Physical Sciences. Ted Clay Bradbury. Wiley, 1984, ix + 702 pp, $37.95. [ISBN: 0-471-88639-4] 
Primarily for majors in physics. Prerequisites include calculus, differential equations and basic 
physics. Classical mechanics, electromagnetism and quantum mechanics are developed along with the 
mathematics. Emphasis on linear algebra. Good chapters on vectors and tensors and on curvilinear 
coordinates. Nice treatment of computer-oriented numerical methods. Wide coverage is sacrificed 
for completeness. Numerous problems-—routine and otherwise. JK 


Applications (Physics), P*. Applications of the Monte Carlo Method in Statistical Physics. Ed: K. 
Binder. Topics in Current Physics, V. 36. Springer-Verlag, 1984, xiv + 311 pp, $32. [ISBN: 0-387- 
12764-X] Nine papers that survey recent developments in the use of Monte Carlo techniques in sta- 


tistical physics. AQ 


Applications (Physics), P*, L? Sound Propagation in Stratified Fluids. Calvin H. Wilcox. Appl. 
Math. Sci., No. 50. Springer-Verlag, 1984, ix + 198 pp, $19.80 (P). [ISBN: 0-387-90986-9] Presents 
a theory of the propagation of transient sound waves in fluids whose densities and sound speeds are 
functions of the depth. Highly mathematical with unusual clarity. The numerous proofs are care- 
fully presented. MU 


Applications (Psychology), P, L. Computation and Cognition: Toward a Foundation for Cognitive Sci- 
ence. Zenon W. Pylyshyn. MIT Pr, 1984, xxiii + 292 pp, $25. [ISBN: 0-262-16098-6] A systematic 
analysis of the science of “informavores"--consumers and processors of information--that is to the 
foundations of cognitive science "what a hole in the ground is to the foundation of a house.” 
Pylyshyn-~-a psychologist and computer scientist-~argues that cognition is a species of computation, 
that the semantic contents of mental states are encoded in the same general way as Computer 
representations are encoded. LAS 


Applications (Psychology), T(16), S, P. Mathematical Models of Attitude Change, Volume 1: Change in 
Single Attitudes and Cognitive Structure. John E. Hunter, Jeffrey E. Danes, Stanley H. Cohen. 
Human Commun. Res. Ser. Academic Pr, 1984, xv + 339 pp, $59. [ISBN: 0-12-361901-7] Translates 
existing verbal models of communication and information processing into mathematical models; sug- 
gests alternate models, and develops new theories of changes in attitude toward the source of infor- 
mation. Reports statistical analyses of field studies to fit the models to the data, often a diffi- 
cult task. Assumes only a familiarity with calculus and matrix algebra, but then uses regressions 
to compute multivariate correlation matrices, and swiftly shoves the particulars of a linear system 
of difference equations into a quoted comphter program (Euler’s method?). A book for gifted stu- 
dents in psychology, communication, marketing, and related fields. YN 
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ON GELBAUM’S ALGORITHM FOR COMPUTING 
THE MINIMAL POLYNOMIAL OF A MATRIX 


RONALD J. LEACH 
Department of Mathematics, Howard University, Washington, DC 20059 


B. Gelbaum [1] recently gave an algorithm for computing the minimal polynomial of an n X n 
matrix A by embedding the matrix in row-major form in C n’ and then applying the Gram-Schmidt 
process. The purpose of this note is to compare the complexity of his algorithm with existing 
algorithms. 

We use Gelbaum’s notation as follows. Let ¢(A) denote the embedding of A in Cc”, by 
spreading out the n-rows into one long row, and let 


Oy = o(1)/Vn, 


P 
Xp +1 = o( A?**) » (6(A?**),O,)O,, 
k=0 


On 41 _ X,41/|| Xp +ills 


be the vectors in C” determined by the Gram-Schmidt process. The minimal polynomial 
Ay + AA + +:: +A,” has A, = (6(A”),O,). The degree m is the first integer for which 
X,, = 0. To compute the inner product of two n?-vectors we may use n? multiplications and 
(n* — 1) additions. Note that all operations are on complex numbers. As is common in arithmetic 
complexity theory (see for example S. Winograd [3]), we ignore the number of additions. To 
compute (¢(A?*"), O, )O, we must again compute n* multiplications of numbers. Computation 
of the norm of an n?-vector requires n* additions and the taking of a square root, which will be 
done by Newton’s method and whose complexity is independent of n. Finally there is a division 
step. Hence the multiplicative complexity of the minimal polynomial is bounded above by: 


[ n2(multiplications for each inner product) | 
+ | n?(multiplications of the entries O, by the inner product)| 
x [ n(number of terms in worst case) | 
y ,{ number of squarings to compute norm + 
number of divisions of n?-vector by norm 
x C(constant for computing square roots) 
= O(n?) 
with a constant essentially equal to twice that for computing square roots. 

There are a number of methods for reducing the number of multiplication steps in computing 
the inner product. The best results, due to Winograd, reduce the number of multiplications to 
approximately 3n*/4 at the expense of increasing the number of additions and complexity of the 
program. Since we are only considering orders of magnitude, these refinements will not be used. 

The traditional method [2, p. 286] requires computing the determinant of D = AJ — A and 


dividing by the greatest common divisor of the (n — 1) minors of D. The classical computation of 
a determinant by row reduction requires n comparisons in the first column with (n — 1) 
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multiplications of (n column) rows by scalars for a total of n* — n multiplications. After the 

n* — n subtractions, we consider the second column which has (n — 1) multiplications of (n — 2 

column) rows by scalars or (n — 1)(n — 2). (It is sufficient to transform A to triangular form.) 
In general we have 


n(n —1) +(n-1)(n- 2) + +++ +2(1) 
= 4n° + lower order terms 


for the number of multiplications to get D in triangular form. There will then be n multiplica- 
tions (of diagonal entries) to compute det(D), requiring O(n*) multiplications. 

As before, we do not consider the more sophisticated algorithms for computing determinants. 
We remark that there is an algorithm of V. Strassen that uses divide-and-conquer to reduce the 
number of multiplications but with a similar order of magnitude. 

A similar situation holds for the computation of each of the (n — 1)-minors of D. Since there 
are n — 1 such minors, the multiplicative complexity is (n — 1)O(n*) = O(n?) with constant no 
more than 1/3. We compute the gcd of the polynomial which are the (n — 1)-minors of D using 
the Euclidean algorithm on the first two, then on their gcd and the third, etc. An upper bound for 
the number of multiplication /division steps when dividing two polynomials of degree n — 1 is n 
divisions and O(n”) multiplications. Since there are n — 1 polynomials, there are O(n*) such 
computations. 

Thus the classical method requires 


O(n*) + O(n?) + O(n*) = O(n?) 


multiplication /division steps with the coefficient of n° at most 1. This is theoretically of the same 
order as Gelbaum’s algorithm, but with a smaller constant. However, the analysis of Gelbaum’s 
algorithm assumed a worst case of degree n. If for example the minimal polynomial is linear, then 
Gelbaum’s algorithm takes O(n*) steps while the classical one requires O(n°) steps. Exactly the 
opposite situation occurs if the minimal polynomial is the characteristic polynomial. 

There is one final factor to be considered. With the rapid increase in computing power and 
decrease in cost, a major factor in cost is programmer time. We invite the interested reader to 
design and program an algorithm for the traditional procedure. 
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CONTINUOUS FUNCTIONS WITH A DENSE SET OF PROPER LOCAL MAXIMA 


\ VLADIMIR DROBOT 
Department of Mathematics, University of Santa Clara, Santa Clara, CA 95053 


MICHAL MORAYNE 
Instytut Matematyczny, Uniwersytet Wroclawski, Wroclaw, Poland 


We say that a real valued function f: [0,1] — R has a proper local maximum at x € [0,1] if 
there exists e > 0 such that f(y) < f(x) for all y € [0,1] satisfying 0 < |y — x| < e. 

The sets of such proper local maxima have attracted some attention; see the references. For 
any function f, the set of its proper local maxima is known to be at most countable. A strong 
converse is due to Z. Zalcwasser [8] and states that for any two disjoint countable sets 
A,B Cc [0,1], there exists a differentiable function f: [0,1] R such that A and B are, 
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respectively, exactly the sets of its proper maxima and minima. For a new proof of this theorem 
see [3]. In a recent paper [5] in this MONTHLY, E. E. Posey and J. E. Vaughan constructed an 
explicit example of a continuous function which assumes a proper local maximum at each point of 
a dense set. The construction is very reminiscent of a construction of a continuous nowhere 
differentiable function. This analogy goes further. As in the case of nowhere differentiable 
functions (see [4], p. 45), the existence of a continuous function with a dense set of proper local 
maxima can also be proved by the Baire category method. More precisely, let C[0,1] denote the 
space of all continuous real valued functions defined on [0,1] with the metric 


d( f,g) = sup{|f(x) — g(x)|: x €[0,1]} 
(this is a complete metric space) and let D be the subset of C[0,1] consisting of all those functions 
whose set of proper local maxima is dense in [0, 1]. 

Recall that a subset of a metric space is said to be of first category if it is the union of a 
countable collection of sets each of which has a closure with empty interior. A subset of a metric 
space is called residual if its complement is of first category. The Baire Category Theorem states 
that in a complete metric space the empty set is not residual. In this note we give a straightforward 
proof that D is residual in C[0,1], thus showing that D is nonempty. Our proof is certainly 
simple enough to be used as an exercise in any course which discusses the Baire Category 
Theorem. The result itself is not really new; for example it can be derived from Theorem 2.1, 
p. 212 and Lemma 3.1, p. 215 of [1]. We will also show that D is a Borel subset of C[0, 1]. 

We introduce some notations. The letter J will always denote a closed subinterval of [0,1], 7° 
will denote the interior of J, and |J| its length. A rational interval is an interval with rational 
endpoints. Given any interval J c [0,1] define 


A(I) = { f € C[0,1]: there is x € 7° such that f(x) > f(y) forall ye I\{x}}. 
Let J c J° and put 
B(I,J) = { f € C[0,1]: sup{ f(x): x EJ} > sup{ f(x): x ET\I°}}. 


THEOREM. The set D is a residual Borel subset of C[0, 1}. 


Proof. We first show that for every interval J, the set A(J) is dense in C[0,1]. Let f © C[0,1] 
and let e > 0. Let z = sup{ f(y): y € J}. Choose a nontrivial open interval J C J such that 
f(x) > z — « for every x € J. Now define g € C[0,1] as follows: g equals z + e at the midpoint 
of J, g equals f at both endpoints of J; on J the graph of g consists of two straight line segments 
connecting the three points already defined; on I\ J, g equals f. Now it is clear that g © A(J) 
and d(f,g) < 2. Thus A(/) is dense in C[0,1], as desired. 

Next we show that A(J) is a Gs set. For each J c I° C [0,1], the set B(I,/) is open. Indeed if 
f © BU, J), then the open ball with center f and radius 


1(sup{ f(x): xeJ}— sup{ f(x): xe T\J°}) 


is entirely contained in B(J, J). We shall show for each closed interval J ¢ [0,1] that 


ie. @) 

(1) A(1) = 1) 

Suppose then that f belongs to the nght hand side of (1). This means that for each i = 1,2,3,..., 

there is a closed interval J; such that f © B(J, J.) and |J,| < 1/i. From the definition of B(J, J) it 

is clear that (\72,J; contains all points where f attains its maximum on J. Since |J,| > 0, 1% J; 

consists of a single point x. Because each J, C I°, we see that x € J° and that f attains its 
maximum on 7 at x and only at x. Thus f € A(J). The opposite inclusion is obvious. 
Thus the set A(J) is a dense G, subset of C[0,1], so it is residual. Therefore the set 


{ UB(I, J): J is a closed interval, Jc?r, yi< >}. 


X= N{ A(T): 1c [0,1], 7 rational} 
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is also residual. It is also quite clear that X C D so that D is residual too. 
To see that D is a Borel set it is enough to notice that 


D= () U{A(I): Ic T, I rational}. 
TC{0, 1], T rational 


That finishes the proof. 


It is perhaps worth mentioning that it we define a set E to consist of all functions with a dense 
set of proper minima, then E is also residual by a similar argument. Thus the set DN E 
consisting of all functions which have their sets of proper minima and maxima dense in [0,1] is 
again residual and thus nonempty. To construct a specific example of a function f with such 
property along the lines of [5] would probably be quite complicated. 


We are indebted to Professor K. M. Garg for helpful discussions concerning this paper. 
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THE DIAMETER OF A GRAPH AND ITS COMPLEMENT 


FRANK HARARY 
Department of Mathematics, University of Michigan, Ann Arbor, MI 48109 


ROBERT W. ROBINSON 
Department of Mathematics, Southern Illinois University, Carbondale, IL 62901 


Dedicated to Gerhard Ringel and Horst Sachs 


The most active mathematician in the world [1] is well known for his personalized, picturesque 
terminology for kinship relations. He has also proposed what might be called “The Divine Book 
of Proofs”, thereby extending significantly the scope of the Platonist philosophy of mathematics. 
In this apocryphal book all theorems which can ever be discovered are included, together with the 
best possible, most elegant proof of each. We offer a candidate for such a proof of the result, 
independently discovered by Ringel [3] and Sachs [4], that every nontrivial self-complementary 
graph has diameter 2 or 3. This will follow as an immediate corollary of an easy theorem on the 
diameter of a graph and its complement. We follow the notation of [2], where definitions not 
included here may be found. 

A few concepts about graphs are needed. The complete graph K, has p points with every two 
points adjacent (joined by a line). Let G be a graph having the same points as K,. By definition, 
the complement G of graph G also has this point set and contains just those lines of K, which are 


212 MARY L. BOAS AND R. P. BOAS [March 


not in G. Then G is self-complementary when G and G are isomorphic. The distance d,(u, v) is 
the minimum length of a u-v path in G. The diameter d(G) is the maximum distance which occurs 
in G, with the convention that d(G) = oo when G is disconnected. 


THEOREM. If d(G) > 3, then d(G) < 3. 


Proof. As the diameter d(G) > 3, there are two points uw and v of G at distance d,(u, v) > 3. 
Then in the complement, dag(u, v) = 1 at once. Now let x and y be any two points of G. By the 
hypothesis u and v cannot both be adjacent in G to x, so that da(u, x) = 1 or da(v, x) = 1, say 
the former (without loss of generality). Similarly ds(u, y) = 1 or dg(v, y) = 1. Thus in G there is 
a path xuy or xuvy, and dca(x, y) < 3 as required. 


COROLLARY. Every nontrivial self-complementary graph G has diameter d(G) = 2 or 3. 


Proof. Every nontrivial graph G has diameter d > 1, with equality only if G is complete. If G 
is also self-complementary, then it cannot be complete, so d(G) > 2. As the diameter is a 
graphical invariant, d(G) = d(G). Now there are two possibilities: either d(G) = 2 or d(G) > 3. 
In the latter case, the theorem guarantees that d(G) < 3 and hence that d(G) = 3. 


Note that the two self-complementary graphs with five points, the pentagon C, and the graph 
shaped like the letter A, have diameter 2 and 3, respectively. 
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THE TEACHING OF MATHEMATICS 
EDITED BY MARY R. WARDROP AND ROBERT F. WARDROP 


Material for this department should be sent to Professor Robert F. Wardrop, Department of Mathematics, Central 
Michigan University, Mount Pleasant, MI 48859. 


SIMPLIFICATION OF SOME CONTOUR INTEGRATIONS 


Mary L. Boas 
Department of Physics, DePaul University, Chicago, IL 60614 


' 
R. P. Boas 
Department of Mathematics, Northwestern University, Evanston, IL 60201 


In introductory courses on complex analysis we learn to evaluate interesting real integrals by 
integrating around suitable contours. Common examples are {Sx 'sinxdx, for which we 
integrate z~‘e’” around a semicircle (center at 0, diameter along the real axis); or the Fresnel 
integrals f° sin x* dx and {S°cos x* dx, for which we integrate z~‘/*e’? around a quarter-circle. In 
such problems we end up having to show that 


n/2 . 
lim Rf" 85 d8 = 0, 
R-> © 0 


where A = 1/2 for the Fresnel integrals, and A = 0 for integrals of the form {[~, e’*R(x) dx, 


1985] THE TEACHING OF MATHEMATICS 213 


where R is a rational function with the degree of the denominator greater than the degree of the 
numerator. The integral involved in the limit is not elementary, and has to be estimated by some 
device before we can say that its limit is 0. The naive argument that R*e~ ®°™° — 0 for each 0 
except 0 is inadequate, since it would purport to show equally well that R{7/*e ®*"° dé > 0, 
which is false. Indeed, if this were true it would follow that fe’? dz along |z| = R from 8 = 0 to 
a/2 tends to zero as R > oo. This would lead to [@e'* dx = {e-” dy, although the first integral 
does not converge. 

The usual device is to apply the inequality sin@ > 20/7 in order to replace the integral by one 
that can be evaluated explicitly. Students often find this step difficult because they do not know 
the inequality, and one has to digress in order to prove it. 

An easier device is to break the integral into {j/° + 7/7. The second integral causes no 
difficulty; in the first, since 2cos@ > 1 for 0 < @ < 7/3, we have 


R [7 Ran dd < 2B" e- Rsimlcos 60. 
0 0 
The new integral can be evaluated explicitly and evidently approaches 0 as R > oo. 

Another approach is to give up the traditional use of circular-arc contours. For the Fresnel 
integrals (the more difficult case), take the contour to be the square with vertices at 0, R, R+ ik, 
and iR (initially with a small indentation at 0). The integrals along (R, R + iR) and (iR, R + iR) 
are 


R 1 wpe: R 1 ee; 
/ (R + iy)* eR dy and [ (x + IR) eM FIR) dy, 
0 0 


The sum of their absolute values does not exceed 
R _ R _ 
/ ( R? + y?)° 7? oy dy +f (x? + R?)" D7? oR dy. 
0 0 


Since R* + y? > R? and x* + R? > R? (and A — 1 < 0), the sum does not exceed 


RI [ "o> dy + eR" dx = R*"'(1-—e 8) + Re ® 5 0. 
0 0 


For integrals e'*R(x) dx, the rectangular contour is used in [1] and [3]. 

Still another method [2] is to use the triangular contour with vertices iR, 0, and R. This gives a 
slightly more complicated equation for the part of the integral that approaches 0, but has only one 
integral instead of two to estimate. 
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E. F. ASSMUS, JR. 
Department of Mathematics, Lehigh University, Bethlehem, PA 18015 


Most (and perhaps all) mathematicians involved in the teaching of calculus are asking 
themselves these days about the standard “techniques of integration” chapter: How much should 
be retained now that symbolic antidifferentiation is so readily available on home computers? 
Herbert Wilf described a more general question in his “The disk with the college education” [this 
MONTHLY, 89 (1982) 4-8], and I would like to suggest a general point of view by giving a 
particular answer to the “techniques of integration” question. 

The point of view should, it seems to me, be that the home computer can free the student from 
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tedious drill and leave the teacher free to discuss the important conceptual aspects of integration. I 
take the consensus to be that change of variable, integration by parts, and the use of partial 
fractions must be taught: change of variable because of its theoretical importance and because it 
gives the student a chance to see how a hard problem becomes easy when looked at in the right 
way; integration by parts because of its constant use in both pure and applied mathematics as a 
theoretical tool; and partial fractions because one can show how only two transcendental 
functions (the logarithm and the arctangent) are necessary for integrating all rational functions. 

Having agreed to this, what problems do we discuss to make our points? It seems to me they 
should be conceptual problems that the student can appreciate. I offer one example that I have 
found useful. 

The students know how to compute the area, zr”, and the perimeter, 277, of a circle of radius 
r. The good students may even have wondered why this same mysterious 7 appears in both 
formulae. For those who have not, this question gets their attention. 

Assuming the students know how to use the definite integral to find area and arclength, one 


has 
_ r 2_ 3 _ r r 
A=4f vr x* dx, P ‘(S dx. 


Now, the change of variable, x = ru, gives immediately that 
= (4fovi — x? dx r? 
0 


and 


1 1 
po 2{2f’—— ar} 


So, it is a question of relating 


[vax dx t0 [a 


But the identity 1 = (1 — x”) + x? and integration by parts yield 


f) i ae 2 fie dx. 


We've actually proved the existence of a number, 7 say, with A = wr* and P = 2ar; we can 
calculate its approximate value by numerical integration; and we’ve done so using change of 
variable and integration by parts. Who could ask for anything more? 


142. MISCELLANEA 


Beauty is perfect, and perfection (such is human nature) holds our attention but for a little 
while. The mathematician who after seeing Phédre asked: “Qu’est-ce que ca prouve?’ was not 
such a fool as he has been generally made out. 


—W. Somerset Maugham, Cakes and Ale. 


ANSWER TO PHOTOS ON PAGE 182 


They are the brothers Alfred and Richard Brauer. 
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Let g(x) be a polynomial with positive degree and integer coefficients which is irreducible over 
the rationals. Prove the following: (a) there exists an integer n such that g(7) is not a square; (b) 
for every positive integer r there exists a positive integer n such that p||g(n) for at least r+ 1 


primes p. (Here p||g(n) means p|g(n) but p-/g(n).) 


SOLUTIONS OF ELEMENTARY PROBLEMS 
Permutations and Derangements 


FE 2947 [1982, 334]. Proposed by Edward T. H. Wang, Wilfrid Laurier University, Waterloo, 
Canada. 


Let n and k be integers such that 0 < k < n. Show that 
n—k 
k —k , 

© (FP Ev" Yor 

i=0 i= 
where D,, = m! U7" )(—1)’/r! denotes the derangement number of (1,2,..., m). 

Solution 1 by Michael Woltermann, Washington and Jefferson College. Let S = {1,2,...,n} 

and T = {1,2,...,n—k}. Then (“)D, -, is the number of permutations of S which fix i 


n—-l 


permutations of S which derange JT. But by the principle of inclusion-exclusion, 


elements of S\T and derange the remaining elements. Thus rk (*)D _, is the number of 


=i (- 1)( " ; ‘Non — j)! is the number of permutations of S fixing none of 1,...,” — k, ie., the 
number of permutations of S which derange T. 


Solution II by C. S. Karuppan Chetty, Regional Engineering College, Tiruchirapalli, India. Let 
f(x) = (1 — x) 1e~*. The coefficient of x” in f(x) is D,,/m! and hence D,, = f‘”(0). By the 
Leibniz rule we have 


(1) | ferro) | -5 (*) a, 
Since 


rove Bea (gests 


(1 _ x)" k—-j+l1 
the right-hand side of (1) is also equal to eo (— 1y(" ; ‘on — j)! and the result follows. 


Solution UI by O. P. Lossers, Eindhoven University of Technology, Eindhoven, The Nether- 
lands. Integration by parts yields D,_; = {, (x — 1)""'e-* dx, so 


k ok _ 
» (*) D,., = i » (K\(x —1)" ‘e*dx = i (x - 1)" *xke-* dx. 
=o\! 0 j=0\! 0 

On the other hand, since (n — j)! = fx” Je™* dx, 


Seal Noone Le Ben jens 


jz 
= [1 1)" “xke-* dx, 


as desired. 
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Also solved by M. ASié (Yugoslavia), K. L. Bernstein, J. C. Binz (Switzerland), L. Carlitz, I. Gessel, S. H. Greene, 
V. Hernandez (Spain), J. G. Heuver, T. B. Kraus, C. Levesque and G. Lord (Canada), R. B. Nelson, I. Paasche 
(West Germany), U. Peled (Canada), O. G. Ruehr, J. Schwaiger (West Germany), K. Seyffarth (Canada), University 
of South Alabama Problem Group, J. Suck (West Germany), A. Varecza (Hungary), K. Williamson, P. Y. Wu 
(Republic of China), and the proposer. 


ADVANCED PROBLEMS 


Solutions of these Advanced Problems should be mailed in duplicate to Professor D. H. Mugler, Department 
of Mathematics, University of Santa Clara, Santa Clara, CA 95053, by July 31, 1985. The solver’s full post-office 
address should be on each sheet. 


6491. Proposed by Jon Borwein, Dalhousie University, Canada. 
(a) Show that 


41s 


(b) More generally, show that for each positive integer k 


» kr,{n)e—"” 
4 = ——__ 


» nr(n)eo"” 


n=0 


where r,(7) is the number of distinct representations of n as a sum of k integral squares. 


6492. Proposed by Anatole Beck, University of Wisconsin- Madison. 
For which a is f(x) = fo |t|*|sin(1/t) |?" dt differentiable at x = 0? 


SOLUTIONS OF ADVANCED PROBLEMS 
A Probability Problem 


6433 [1983, 402]. Proposed by Edmund Butler, New Carrollton, MD. 


Let X,, X5,...,X,,4, be a sequence of independent random variables uniformly distributed on 
[0,1]. For any sequence ¢,,€,,...,¢€, Of +1’s let 
P(e, E4>- "99 ey, = Pr{ sign( X; ~~ X,) = €,, Sign( X, ~ X;) = Eo >- ‘. , Sign( X,, ~~ X41) = é, }. 
(a) Show that M, = Max, _.,P,(&, &,..-,&,) 18 assumed when the -e’s alternate in sign. 
(b) Find lim Mi" " 


> © 


Composite solution based on solution by L. E. Mattics, University of South Alabama, Mobile, 
Alabama to part (a); and solution by Alain Tissier, Montfermeil, France to part (b). Let 
E., = (€5 €)5---5&)> & = £1, and let A,,,,(£,,) be the set of rearrangements y,, yy,..., Y.+41 Of 
1,2,...,2 + 1 such that sign(y,,, — y,) =e, i= 1,2,...,n”. Then 


|A,41(E l= |A,41(-£,)| and P,(E,) = |A,.,(£,)|/(n + 1)!. 
Denote ton )| by C,,, when ¢,,€,,...,¢, alternate in sign and set C) = C, = 1. 
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Assume it has been shown that C,,,, > |A,,.,(E,,)| for all sequences E,, with 1 < m <n (the 
initial case is trivial). To calculate the number of elements in A, ,5(E,,,,) UA,42(- £41) = K; 


note that there are at most ("*C_iGar-) rearrangements y,, y>,.--5 Vn4> Of 1,2,...,n2 + 21in 


K with n + 2 in the ith position, and there are precisely that many if ¢,,¢€,,...,¢, alternate in 
sign. Hence 


n+2 
+ 1 
2|An.2( Ens iI< iu ("* 1 


JOG = 2C,42> 
i=1 


and this establishes (a) by induction. 
To establish (b), note that M, = C,,,/(n + 1)! and set 


f(x) =14+ % Mx" 
n=0 
Then the identity above yields 2 f’(x) = f(x)? + 1 so that 
f(x) = secx + tan x. 


It is known that, for |x| < 7/2, 


— lEonl on 5 Bank (yan _ gn) 20-1 
secx = xX (Qn) and tanx = X ny 4 — 4") xert, 
where the £’s are Euler numbers and the B’s are Bernoulli numbers. It is also known that 
Maal 9f 2)" ggg Baal, 2 
(2n)! 7 (2n)! (20)°" 
Since 
M),-2 = (oi — 4") and M),_, = on ; 


it follows that M, ~ 2(2/7)"** and hence that lim, ,,,M}/" = 2/m. 
Also solved by Ira Gessel, Lajos Takacs, and the proposer. 


Ira Gessel pointed out that (a) is an immediate consequence of a result of I. Niven, A combinatorial problem of 
finite sequences, Nieuw Arch. Wisk. (3) 16 (1968), 116-123; and that D. André, Développements de séc x et de 
tang x, C. R. Acad. Sci. Paris, 88 (1879), 965-967, showed that f(x) = tan x + secx. 


A Periodic Sequence 


6439 [1983, 569]. Proposed by Morton Brown, University of Michigan. 


Let {a,,} be a sequence of real numbers satisfying the relation a,,, = |a,|— a,_,. Prove that 
{a,,} is periodic with period 9. 


4 
Solution by James F. Slifker, Florida International University, Miami, Florida. The sequence 
contains some nonpositive terms, for if a, > 0, a,,, > O and a,,, > 0, then a,,, = —a, < 0. 
In fact, the sequence contains two consecutive nonpositive terms, for if a, > 0, a,,, < 0 and 
a,+2 > 0, then a,,5 =4,,, +a, < Oand a,,5 = —a, < 0. 
Suppose therefore that a,, = —a < 0 and a,,,, = —b< 0. Then the segment of the sequence 
starting at a,, and ending at a,,, 49 1S 


—a,—b,a+b,a+2b,b,—-a- b,a,2a+b,a+ b,—-a,—b. 


Since 4,,,9 =4,, and 4,,.49 = 4,41, it follows that the sequence has period 9. Further, since 
Amsa = ~4m+ 1 and 4,45 = —4,,,>, the period cannot be less than 9 except when every a,, = 0. 


Also solved by the proposer and sixty-one others. The problem turned out to be rather elementary. 
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A Rational Function Which Is a Polynomial 


6441 [1983, 569]. Proposed by F. W. Schmidt and R. Simion, Southern Illinois University. 


Let a, < a, < -:: <a, be n distinct positive integers. Show that the rational function 
x8 — xX 
l<i<j<n x! —_— x/ 
is actually a polynomial. 


Solution by R. H. Jeurissen, Mathematics Institute, Nijmegen, The Netherlands. It suffices to 
show that every zero of the denominator is also a zero of the numerator with at least equal 
multiplicity. Clearly 0 is a zero of the numerator with multiplicity a,(n — 1) + a,(n — 2) 
+ +++ +2a,_, + a,_ ,, and this is at most its multiplicity in the denominator since a, > i for 
i= 1,2,...,n —1. Moreover all other zeros of the denominator are roots of unity. For r= 
1,2,...,n — 1, let z, be a primitive rth root of unity. Then z, is a zero of x* — x” if and only if 
a; = a; (mod r). Suppose, for s = 1,2,...,7, that k, of the numbers a,,a,,...,a, are in the 
residue class mod r of s. Then the number of pairs (i, 7) with 1 <i <j <n for which a, = a, 


(mod r) equals 
s(')- 


s=l1 


and so this is the multiplicity of z, as a zero of the numerator. The above expression, as a function 
of k,,k,,...,k,, has minimum value when |k, — k,| < 1 for 1 <s <t<_r, which is precisely 
what happens when a, = i, since then 


k, =[n/r]+1forl<s<n-—r{n/r] and k, =[n/r] otherwise. 


Also solved by C. Bandt & W. Tefera (Ethiopia), Mihaly Bencze (Romania), Aage Bondesen (Denmark), F. W. 
Dodd & L. E. Mattics, Ira Gessel, Robert Gilmer & Warren Nichols, Tadeusz Januszkiewicz & Ryszard Szwarc 
(Poland), O. P. Lossers (Netherlands), J. G. Mauldon, Pei Yuan Wu (Republic of China), D. Richman, Allen J. 
Schwenk, C. Wildhagen (Netherlands), and the proposers. 


REVIEWS 


EDITED BY ALLAN L. EDMONDS AND JOHN H. EWING 
COLLABORATING EDITOR FOR FILMS: SEYMOUR SCHUSTER 


Neyman: From Life. By Constance Reid. Springer-Verlag, New York, 1982. 298 pp. $19.80. 


SAMUEL KOTZ 
Department of Management Science & Statistics, University of Maryland, College Park, MD 20742 


Sir Robert Giffen begins one of the earliest textbooks on the subject, entitled simply Statistics* 
with the following comment: “I propose to write a handbook on Statistics without giving a formal 
definition of the word. There are perhaps too many definitions in existence.” He continues: 


*As indicated on the front page, the volume was written about 1898-1900 and edited by Henry Higgs with the 
assistance of G. U. Yule and published by Macmillan, London, in 1913. G. U. Yule and M. G. Kendall’s classical 
textbook, Introduction to the Theory of Statistics, originally authored by Yule in 1911 and then joined by Kendall in 
the 11th edition in 1937, may be familiar to many older readers of this MONTHLY. 
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“There has been much debate also on the question of whether Statistics is a distinct science or 
not... . It seems to be quite unnecessary to debate whether the whole field of Statistics thus dealt 
with or a portion of it can be treated as a distinct science... .” Giffen explains: ““The main fact 
always is that... Statistics undoubtedly constitutes a branch of knowledge; that in the use of 
them, scientific method is indispensible; and that...the knowledge which is derived from 
Statistics in each study is frequently so separate that a distinct account can be given of each.” 

How relevant are these words, written some 85 years ago, to the debate on the role of statistics 
still evident in scientific circles? The number of definitions has increased quite substantially since 
Giffen’s days. For a comprehensive collection of definitions of statistics, see the article by Walter 
F. Willcox (1935), a prominent American statistician. This list was extended by Nalimov in 1974, 
containing over 100 definitions in Russian, and was retranslated into an English edition in 1981. 
A comprehensive discussion of definitions of statistics is given in T. A. Bancroft’s 1970 Presiden- 
tial Address published in the Journal of the American Statistical Association in 1971. 

Some 25 years after Giffen*, in response to a suggestion by E. R. Hedrick, Chairman of the 
American Mathematical Association’s Dictionary Committee, to submit sample definitions for the 
Dictionary, a prominent mathematician, H. L. Rietz of the State University of lowa,** presented 
a brief paper “to explain the meaning of Statistics and of the associated expressions: statistical 
data, statistical methods, theory of statistics, mathematical statistics, statistical probability and to 
suggest a list of terms and expressions from statistics, probability and insurance that should 
probably be included in the dictionary.” Some quotes from his paper are very illuminating, both 
from the aspect of their substance as well as from an historical perspective. “Statistics is a 
comparatively new word. Its first occurrence in English seems to be in J. F. von Bielfeld’s, The 
Elements of Universal Erudition, translated by W. Hooper, London 1770.” *** “Statistics... gradu- 
ally came to mean an exposition of the attributes of the state by numerical methods... . The word 
next came to denote the figures used in such descriptions... . This use of the word prevails at the 
present time, but the data may refer to the state or to any other subject... .” “The expression 
statistical methods means methods which are suitable for the description and characterization of 
statistical data... . A set of mathematical propositions that relate to statistical methods is often 
called mathematical statistics... . The general problem of mathematical statistics in its ideal form 
is to determine a system of drawings to be carried out with urns of fixed composition, in such a 
way that the results of the sets of drawings lead, with a high degree of probability, to a table of 
values identical with statistical data... .” ““Mathematical statistics is thus one branch of the 
theory of probability... .” “The concept of statistical probability is involved whenever the 
properties of an aggregate are predicted or inferred by observations of a sample taken from the 
aggregate. Many such inferences are drawn by persons unfamiliar with mathematical statistics, 
and there is practically no doubt that many conclusions thus obtained are invalid.” (Even though 
it was written over 60 years ago, how relevant this last sentence is to the current state of affairs in 
quantitative scientific practice.) “Mathematical statistics aims to establish criteria that give 
numerical values to the degrees of confidence to be placed in such inferences.” 

In Current Problems of Mathematical Statistics, a paper delivered at the International Congress 
of Mathematicians in 1954 in Amsterdam (over 30 years after Rietz’s definitions), Jerzy Neyman! 
assesses the status of mathematical statistics in the 50’s: 


* This MONTHLY, vol. 29 (October 1922) 333-337. 

**H, L. Rietz’s seminal works on fundamental aspects of mathematical statistics appeared in, among other 
journals, Ann. Math. Statist. (1931-1939), Bull. Amer. Math. Soc. (1932) and Biometrika (1931-1932). (See Ann. 
Math. Statist., 1944, 15, p. 105, for a complete bibliography.) Rietz is also the author of Carus Mathematical 
Monograph No. 3, Mathematical Statistics. 

*** See, however, G. U. Yule (1905) for a different initial source. 

As pointed out in the book under review: “In 1954, Neyman and Tarski— Berkeley’s “anti-Poles”— were two of 
five mathematicians from the U.S. who were invited to give featured addresses at the International Congress of 
Mathematicians.” 
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In general, the present stage of development of mathematical statistics may be compared with that of 
analysis in the epoch of Weierstrass. During the preceding decades, a very considerable number of special 
problems were solved without particular care for generality and rigor. The workers in the field were in too 
great a hurry to broaden the domain of research to bother with what they considered troublesome details. At 
the present moment, we observe the inevitable reaction. The basic concepts of the theory are being revised, 
the commonly accepted assertions are being examined in order to determine the exact domain of their 
validity and, just as in the times of Weierstrass and immediately thereafter, counterexamples are being 
constructed in order to show the insufficiency of the old proofs.” 


This perceptive assessment of mathematical statistics in 1954 should be a source of great 
satisfaction to a mathematical statistician in 1983. In less than 30 years, we have progressed from 
the era of K. Weirstrass (1815-1897), through the period of the last universalist, H. Poincaré 
(1854-1912), to the times of N. Bourbaki. By now, the rigor and generality of arguments in 
mathematical statistics have beeen well and routinely established; asymptotic results are proved 
under precise and verifiable “regularity conditions.” “Troublesome” mathematical details do not, 
in general, present insurmountable difficulties to modern day mathematical statisticians, who can 
also resort to simulation and computer-oriented methodology. Whether the characterization of 
mathematical statistics as a branch of probability, valid in 1922, is fully justified today is 
questionable. Many branches of modern statistics, such as experimental design, regression 
analysis, sample surveys, classification and grouping techniques (in particular, clustering), multidi- 
mensional scaling and factor analysis, are at best weakly related to probability theory but are 
substantially mathematically oriented and can be appropriately considered as topics in mathe- 
matical statistics. 

This substantial leap forward is primarily due to the efforts and accomplishments of Jerzy 
Neyman, a Russian-born Pole, teaching in London in the late thirties. On August 12, 1938, he got 
off the train at the Southern Pacific Station in Berkeley and for over 40 years, charted and 
substantially dominated the course of the rapid development of mathematical statistics, not only 
at Berkeley, but on the international scene as well. It should be noted that prior to his arrival at 
Berkeley, among other achievements, the indefatigable Jerzy Neyman originated the famous 
Neyman-Pearson Lemma which even today is considered the fundamental tool of hypothesis 
testing despite more recent appearances of competitive approaches. He put forward the theory of 
confidence intervals, one of the most important (albeit controversial) approaches to data analysis, 
contributed substantially to the theory of contagious distributions, which is very useful in 
biological and ecological applications, paved the way for a new dimension of modern sampling 
theory by developing sampling from stratified populations, and pioneered a new approach in the 
field of experimentation by devising a model for randomized experiments. 

In her biography of Jerzy Neyman (1894-1981), who received the National Medal of Science 
for “laying the foundations of modern statistics and devising tests and procedures that have 
become essential parts of the knowledge of every statistician,’ Constance Reid, with relatively few 
technicalities and scientific minutiae, succeeds remarkably well in portraying Neyman’s strong 
and colorful personality and his contributions to statistical methodology and applications. As 
W. H. Kruskal, past President of the American Statistical Association, writes in a letter to the 
author: “Your biography of Jerzy is a delight and a source of instruction; what a tour de force to 
present so clear a picture with so few technical excursions.” The book is based on extensive 
interviews the author conducted with Neyman in the years 1978-1979, as well as personal 
interviews and written communications that the author diligently carried out with persons closely 
associated with Neyman in the U.S. and abroad. Notable among them was Egon Pearson whose 
joint groundbreaking work with Neyman in the thirties propelled and charted the course of 
Neyman’s “complicated but not uninteresting” and highly productive life. 

The skillful author presents a panorama of Neyman’s life and activities for 85 years as she 
relates his birth on April 16, 1894, in Bendery, “the gate of Bessarabia,” of Polish parents, 
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completion of his undergraduate studies in pure mathematics in 1917 in Kharkov in the Ukraine, 
his first job as statistician at the Polish National Institute in Bydgoszcz (Bromberg) from 
1921-1923, his subsequent employment in Warsaw as an assistant at the University (1923), and as 
a special lecturer at the Central College of Agriculture. It was during his tenure in the College of 
Agriculture in 1925 that he received a fellowship to study statistics at University College in 
London with the “father of modern statistics,” the famous Karl Pearson (1857-1936). We read 
with great interest the thrilling description of the development of his long friendship with Karl 
Pearson’s son Egon, and the subsequent tension between them and their rival, the brilliant but 
apparently obnoxious Sir Ronald A. Fisher, his visits to Paris and his encounters with Borel and 
Lebesgue, his return to Poland and continued association and elaborate correspondence with 
Egon Pearson which resulted in the publication of their classical “joint statistical papers” on 
Hypothesis Testing during the years 1933-1938, and his return to England in early 1934 where he 
remained until 1938. In this year he accepted the offer from the University of California “to 
coordinate the work in Statistics in the University and especially to develop the subject on the 
mathematical side.” The rest is history, presented by Ms. Reid in a most informative and 
entertaining manner, with great affection, enthusiasm and talent. 

I would, however, hesitate to endorse this book as the sole, authoritative source for the history 
of modern statistics for the uninitiated. There is an obvious and sometimes unjustified overempha- 
sis of Neyman’s contributions (undoubtedly groundbreaking) at the expense of other American 
statisticians. As W. H. Kruskal points out in a private letter, “I see no mention of John Tukey 
anywhere, nor of George Box, although Joan Box’s biography of Fisher has three index entries. 
The late Jimmie Savage appears twice in the Index but the text entries are brief and superficial.’ 
When asked in 1979 about his attitude toward the currently popular Bayesian view of statistical 
inference, Professor Neyman’s reply to the author (page 274) was: “It does not interest me. I am 
interested in frequencies.” This minor episode may serve as an indication of Neyman’s “occa- 
sional refusal to pay attention to important scientific work done by other statisticians” (in the 
words of Professor Kruskal). The author’s justifiable fascination with her “subject,” especially his 
gallantry, charm, progressive and liberal thinking, devotion to colleagues and students causes her 
to underplay this aspect of his personality. 

Unfortunately, a list of Neyman’s contributions is not included in the book and the reviewer 
found somewhat annoying the author’s habit of interrupting the smooth and absorbing chrono- 
logical narration with light anecdotes and personally witnessed episodes in Neyman’s life and 
routines in 1978 and 1979. Some of what Reid gives us in these excursions may seem suspiciously 
like padding. 

Nevertheless, I recommend that anyone even remotely associated with statistics join the author 
in her interesting and enthusiastically sympathetic description of the life of a remarkable person 
and great scientist, whose dedication and unyielding determination elevated statistics to the status 
of a bona fide scientific and academic discipline and who was a major contributor to its 
foundation. To acquire a concurrent but somewhat different view of the course of the develop- 
ment of modern statistics, I would recommend J. Box-Fisher’s biography [2] of her father, R. A. 
Fisher, to supplement Ms. Reid’s volume. 

A more accurate, broader and perhaps deeper history of statistics, with an emphasis on 
American contributions still awaits its author. However, we should all be grateful to Ms. Reid for 
stepping off into unchartered territory and laying the foundation for such a work. 


Personal Note. Since 1978, the reviewer, with Professor N. L. Johnson of the University of 
North Carolina, has been engaged in editing an 8-volume Encyclopedia of Statistical Sciences. In 
1980, when the editors approached the late Professor Neyman to contribute to the ESS, his 
response was most encouraging; regrettably, the transitory nature of human life prevented him 
from fulfilling his commitment. We were gratified, however, to receive from him several unsoli- 
cited letters advising us on the work and suggesting entries with the names of suitable authors 
appropriate for our endeavor. We are proud of this association with Neyman during his last years. 
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Differential Equations: Classical to Controlled. By Dahlard L. Lukes. Mathematics in Science and 
Engineering, vol. 162. Academic Press, New York, 1982. x11 + 322 pp. $37.50. 


EMILIO ROXIN 
Department of Mathematics, University of Rhode Island, Kingston, RI 02881 


The subject of differential equations always played a central role in the development of the 
infinitesimal calculus. Indeed, an obvious generalization leads from the problem of “antideriva- 
tive’ x = dx/dt = f(t) to the differential equation (E)x = dx/dt = f(t,x). From the very 
beginning, Isaac Newton made differential equations an almost universal tool to model the 
physical world, and the success was so impressive that it opened the doors for a great development 
and understanding of the laws of Nature. Differential equations have been applied not only to the 
physical world (say, physics and astronomy), but also to much more “human” sciences such as 
economics, physiology, population dynamics, epidemiology and management sciences. They have 
even had an impact on philosophical questions like causality and determinism. 

It would be interesting to make a thorough study of the emphasis and style of the most 
representative books on differential equations over the years. This would show, besides the 
technical advances, the changing attitudes of mathematicians and the different expectations of 
what this theory was supposed to accomplish. This review certainly cannot present such an 
exhaustive study, but will contribute just a few remarks of this nature and address, in particular, 
the introduction of control theory into the subject of differential equations. 

In the beginning, the naive approach to the problem posed by the differential equation (E) was 
“to find a function x = x(t) which satisfies (E) identically in t”. From this starting point many 
useful methods have been developed, ineluding classification of the equations into different types, 
and also numerous tricks, to be used skillfully, in finding the unknown function. 

The next step was to discover that the correct question to ask was to solve the “initial value 
problem,” and to give conditions assuring existence and uniqueness of the solution. These 
questions found their proper place within the theory in the early 19th century work of Cauchy. 
They were significantly improved within the present century by the introduction of the 
“Carathéodory conditions,” which allow f(t, x) in (£) to be just measurable (hence discontinu- 
ous) in ¢. This is of utmost importance in the theory of optimal control, where “bang-bang” 
controls are in some sense “best.” 

Linear equations of the forms x = Ax and x = Ax + b, and in particular equations and 
systems of equations with constant coefficients, are an ideal ground where with relatively modest 


224 EMILIO ROXIN [March 


effort one can obtain many explicit results. Physically speaking, linearity means the principle of 
superposition, which is basic for a simple formulation of the laws of elasticity, vibrations, 
electromagnetic circuits and so on. Its successful use was the basis on which the growing 
technology of the early 20th century flourished. It was the time of “linear engineering,” when 
nonlinearities, if too strong to be neglected, were taken into account by tables, diagrams, or, as a 
last resort, by a conveniently generous safety factor. That it is possible to achieve a lot with such 
methods is shown by the power plants, electrical networks and machinery, bridges, skyscrapers 
and so on, which were built within this period. 

Meanwhile, the ‘“‘mathematical” interest expanded into the analytic theory. Ordinary differen- 
tial equations were considered within the framework of (real or complex) analytic functions. This 
gave uS power series expansions and the study of singular points. Periodic systems of 0.d.e.’s 
inherit some of the nice properties of the systems with constant coefficients. Floquet theory is 
already classical, but the interest in periodic systems and periodic solutions carried over into the 
area of optimal control theory, where it is a problem of current research interest. 

Solutions of o.d.e.’s were characterized as “dynamical systems,” a name which may carry some 
application-oriented flavor, and soon a bridge was built into the most highly abstract regions of 
topology (“topological dynamics’’). 

The impossibility of “solving” most differential equations in closed form made it imperative to 
look for other methods to attack the problem. Numerical methods were developed early and, in 
the present era of the computer, have received a tremendous boost. Limitation of space is the 
reason for not addressing this subject here and for leaving out several other important areas such 
as boundary value problems, eigenfunctions, etc. 

For mildly nonlinear equations, a representation of the form x = linear part + e times 
perturbation, and a corresponding development of the solution in powers of e proved very useful. 
We find these in the work of H. Poincaré. These power series also gave birth to the very curious 
“asymptotic series,” which can be very good for numerical computations even in the case when 
they are divergent! Related to this approach is also the “method of averaging,” where a nonlinear 
oscillating system can be treated as a linear one, using some “average” parameters. 

The methods above still try to obtain approximations to the solution of a differential system. 
The “second” or “direct”? method of Lyapunov, in contrast, uses inequalities in order to obtain 
upper bounds needed to ascertain stability properties of the solutions. This method was intro- 
duced at the turn of the century and revived in the thirties. This led to the consideration of all 
kinds of differential inequalities. 

The post-World War II era saw a growth in interest in nonlinear differential equations in 
several directions. In particular stability theory and nonlinear phenomena were in the spotlight. 

Topological properties of the solutions, an idea going back to I. Bendixson, became suddenly 
of interest. This subject, which includes limit sets, invariant sets, and index theory, became known 
as “geometric theory of differential equations” in the work of Lefschetz which had a great impact. 
It included the birth of modern optimal control theory. 

The genesis of control theory can be traced back in history. James Watt’s centrifugal regulator 
is usually cited, but earlier regulating systems can be found in Antiquity and in the Middle Ages. 
In World War II this became the subject 'of “servomechanisms.” Controls acted upon a system via 
“feedback.” The systems considered were mostly linear, but a “linear theory” of a sustained 
oscillation (as is the case in any radio oscillator circuit) is a self-contradiction: if the system were 
really linear, the oscillations would necessarily grow to infinity or die out to zero. Hence 
nonlinearities are essential. Extremely nonlinear feedback, implemented by on-off relays 
(=“bang-bang”’ controls), proved to be the cheapest, and in some mathematical sense, the 
optimal control for many systems. Bushaw proved this for the harmonic oscillators, x = y, 
y = —x + u(t), with control u bounded —1 < u < 1 but otherwise arbitrary; the state (x, y) is 
supposed to be brought from a given initial state to the rest position (0,0) in minimum time. The 
optimal solution turns out to be a control of the “bang-bang” type: it only takes values +1, and 
the whole problem reduces to finding the switching points. 
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The subject of optimal control mushroomed in the fifties and became well established and 
rounded off in the sixties. The famous “Pontryagin’s principle” was established. Its roots, of 
course, were in the classical calculus of variations. Parallel developments led R. Isaacs to create 
the “differential games,” which are control systems with two controllers with opposed objective, 
and R. Bellman to develop “dynamic programming,” which is the discrete counterpart of optimal 
control. 

The present book of D. L. Lukes brings a new and innovative look to several areas of the 
theory of ordinary differential equations. Matrix manipulations for solving linear systems are 
developed systematically (the “ABC” method). Time varying linear systems of a particular kind 
(““commutative” systems) are shown to share many nice properties of the systems with constant 
coefficients. Finally, control systems get a respectable place. In particular, questions like controlla- 
bility and stabilizability by linear feedback, which are of practical importance, are well treated. 
Many of the given results are due to the author or at least show his personal flavor. 

While this book does not claim to be the last word in research accomplishments on the subject, 
represented by other more encyclopedic works, it certainly is in line with an up-to-date outlook. It 
is a very refreshing addition to the existing literature. 


From the Calculus to Set Theory, 1630-1910: An Introductory History. Edited by I. Grattan- 
Guinness. Duckworth, London, England, 1980. 306 pp. 


ROBIN E. RIDER 
History of Science and Technology Program, The Bancroft Library, University of California, Berkeley, CA 94720 


For many years the standard guide to the history of the calculus was Carl Boyer’s 1939 work, 
republished in 1949 as The History of the Calculus and Its Conceptual Development, and reissued in 
a popular paperback edition by Dover in 1959. Since that date, the contours of scholarship in the 
history of mathematics have begun to change. An encyclopedic guide to the previous literature, 
edited by the late Kenneth O. May and published in 1973, identified areas of strength and 
weakness in the historiography and offered guidance to those undertaking research in the field. 
New journals for the history of mathematics have been founded; the most accessible to English- 
speaking readers is Historia mathematica, also a product of May’s energy and enthusiasm for the 
subject. In addition to the traditional genres of encyclopedic histories and collections of bio- 
graphical anecdotes, readers can now choose among intellectual biographies rich in both personal 
and mathematical detail’, analyses of the institutional milieux and social contexts in which 
mathematicians operated’, detailed studies of topics in mathematics’, and new editions of the 
works and papers of individual mathematicians*. The authors of these recent studies have taken 
advantage of both published and unpublished sources, and have in some cases expanded their 
purview to include minor figures as well as major heroes in the history of mathematics. 

Six historians of mathematics, all of whom have participated in this resurgence of interest and 
activity, have joined forces in From the Calculus to Set Theory, 1630-1910. The book covers more 
than the standard territory of the history of calculus. It is intended as an historical introduction to 
a set of interrelated developments in mathematics: the origins of the differential and integral 
calculus in the 17th and 18th centuries; the elaboration of mathematical analysis in the 19th and 
early 20th centuries; and the reexamination of the foundations of mathematics, especially the 
progress of set theory and mathematical logic in the latter part of the 19th century. The level of 
exposition, sometimes heavily mathematical, presupposes considerable mathematical knowledge. 
As the editor cautions, “this is not the place from which the mathematics can be learnt” (p. 7). 
The book is thus directed at readers with some training in mathematical analysis and set theory, 
especially upper-division undergraduates and graduate students. 
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The six chapters are each sufficiently self-contained to stand alone; any can profitably be read 
by those with interests in a particular mathematical domain. Chapter 1, by Kirsti Moller Pedersen, 
guides the reader through the thicket of techniques and special methods for solving problems of 
rectification and quadrature, tangents, and maxima and minima of curves during the period 
1630-1660. Chapter 2, by H. J. M. Bos, explores the work of Newton, Leibniz, and Leibniz’ 
18th-century followers in the creation and application of a coherent, unified differential and 
integral calculus. Bos concludes with an admirable summary of the influential work of Euler, the 
starting point for the next chapter. In Chapter 3, Ivor Grattan-Guinness traces the route by which 
mathematical analysis developed from Euler through Cauchy and the Weierstrass school to about 
1880. Chapter 4, by Thomas Hawkins, goes over the same chronological ground, but with special 
attention to the development of the integral and its theory by such mathematicians as Cauchy, 
Riemann, and Lebesgue. In Chapter 5, Joseph Dauben summarizes his research on Georg Cantor 
and the genesis of set theory. General questions of the foundations of mathematics from 1870 to 
1910 are discussed by Robert Bunn in the final chapter. 

To survey mathematical research over the course of nearly three centuries demands selectivity, 
and the contributors to this book have in general chosen to focus on the well-known names in the 
history of mathematics. What is too often left out is information and interpretation regarding the 
historical context in which the great mathematicians operated. For example, both Bos and 
Dauben treat briefly related philosophical questions and the importance of personality. Readers 
interested in pursuing these questions further, however, will have to turn elsewhere—to such 
works as A. Rupert Hall’s book on the prolonged and vitriolic dispute engendered by competing 
claims to the invention of the calculus, and Dauben’s book-length biography of Cantor, which 
interweaves psycho-emotional, mathematical, and philosophical themes>.. Likewise, Bos and 
Grattan-Guinness both mention in passing the links between mathematical analysis and other 
sciences, especially the rational mechanics of the 18th century and the mathematical physics of the 
19th. Physical problems stimulated investigations in mathematical analysis and helped shape the 
form of results; many of the key figures in the history of mathematical analysis played important 
roles in the development of physics as well. Grattan-Guinness’ book (with J. Ravetz) on Fourier 
and the study of heat explores the connection between mathematics and physics in the work of 
one individual at a time when French mathematical methods profoundly influenced physical 
science’. A growing number of studies on mathematical physics in the 19th century consider the 
relation between Continental mathematical methods and British physics’. The book under review 
also gives little indication of the variety of opinions and the vigor of ongoing debate regarding the 
origins of a rigorous formulation of calculus. For example, Grattan-Guinness’ assessment of 
Cauchy’s work differs markedly from that of Judith Grabiner. To learn about this debate, 
however, one must turn to their books on the foundations of mathematical analysis’. 

Such shortcomings are more striking in light of the book’s announced purpose. The editor and 
contributors, who are conscious of the tendency in mathematics to stress “the accumulation of 
mathematical knowledge,” wish instead to explore “the growth of mathematical understanding, 
the appreciation of why a mathematical theory developed and took its form, and not merely that 
it does have its content” (p. 3). To downplay the historiographical issues, the social context, the 
philosophical and personal dimensions, the connections with other sciences, is to undermine the 
purpose of the book. Inclusion of developments in set theory and the foundations of mathematics 
is a commendable revision to the usual history of the calculus. But an attempt to balance this 
largely internalist account with considerations of the broader historical context would be more 
welcome. Mathematicians as well as other scientists (and historians of science) should know, and 
would like to know, more about the evolution of so important a concept as the calculus. 


Notes 
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2. Paul L. Rose, The Italian Renaissance of Mathematics: Studies on Humanists and Mathematicians from 
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LETTERS TO THE EDITOR 


Material for this department should be prepared exactly the same way as submitted manuscripts (see the inside 
front cover) and sent to Professor P. R. Halmos, Department of Mathematics, University of Santa Clara, Santa 
Clara, CA 95053. 


Editor: 


Since the appearance of my note [Le] on Weierstrass’ Theorem, I have been informed of several 
sources of proofs that are very similar to mine. These proofs appear in [C], [F], [Lah], and [Lam]. 
In addition, I have found a short discussion of how the proof can be obtained in [Gl]. 

R. Streit points out that a generalization of the Bernstein proof appears in [H], Theorem 10.4.3 
and (19.2.14). 

I find it interesting that this proof of the most important theorem in approximation theory 
seems to be better known to probabilists than approximation theorists. 
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Editor: 


Recently, in a discussion of estate tax law in a publication by Commerce Clearing House (a 
well-known and highly respected organization, whose writings on the tax laws are second in 
authority only to the Federal Code itself), I encountered the following remarkable passage: 
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Suppose, it is hypothesized, that a decedent has specified in his will that after certain specific 
bequests are made from his estate, the balance is to be used to pay taxes and the residue given to 
charity. 

The author notes that, since charitable bequests are deductible from the taxable estate, the 
amount of the charitable contribution will affect the tax; but that the amount of the tax will in 
turn determine the size of the charitable contribution. Neither can be determined until the other is 
known. This, he concludes, is a mathematical problem of great difficulty. 

Great difficulty indeed! Problems of this type‘ appear routinely in the Rhind papyrus no less, 
and techniques for solving them have been known for over 38 centuries! Even the most elementary 
algebra is not really required—such problems can be solved by a clever method known as “false 
positioning”. (For details, consult J. R. Newman, “The Rhind Papyrus” in The World of 
Mathematics, Vol. I, pp. 170-178.) 

The CCH author goes on to suggest Trial and Error as an acceptable method of solution and 
even cites an Internal Revenue Service form on which the requisite arithmetic can be reported! 

To the average person, even the average educated person, a problem such as the one above 
involving two unknown quantities that are mutually dependent on each other presents a challenge 
of overwhelming magnitude. The sad part is that many of these people have probably studied 
elementary algebra and know how to solve the problem—they just don’t know they know. 


F. W. Luttmann 
Department of Mathematics 
Sonoma State University 
Rohnert Park, CA 94928 
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Some More Clerihews 


Evariste Galois 

Was far from bourgeois; 

His ideas political and mathematical 
Verged on the radical. 


Georg Cantor 

Would often wander 
In the general vicinity 
Of infinity. 


Kurt Godel 

Cleared many a hurdle 

To show that mathematicians 
Cannot have omniscience. 


Nicolas Bourbaki 

Is quite a mystery to me; 

It is difficult to write a verse on 
A pluralistic person. 


—Samuel Bates 
Department of Mathematics 
Northwestern University 
Evanston, IL 60201 
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down design, coding, and testing. 
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The American High School Mathematics Examination, given each 
year to over 400,000 high school students, helps to identify young 
people with an interest in and a talent for mathematics problem solv- 
ing. This volume contains the Examinations for 1973-1982, and is a 
continuation of Contest Problem Book |, Il, and III, published as vol- 
umes 5, 17, and 25 of the New Mathematical Library. The three 
earlier books contain the first twenty-five annual examinations along 
with solutions. 

The problems are designed so that they can be solved with a 
knowledge of precalculus mathematics, with emphasis on intermedi- 
ate algebra and plane geometry. The book includes subject classifica- 
tions that indicate which questions relate to a particular topic in 
mathematics. 

The solutions have been carefully constructed, but students are 
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BULGARIAN SOLITAIRE 


ETHAN AKIN* AND MORTON DAVIS 
Department of Mathematics, CUNY, The City College, New York, NY 10031 


Blast Martin Gardner! There you are, minding your own business, and Scientific American 
arrives like a virus. All else forgotten, you must struggle with infection by one of his fascinating 
problems. In the August 1983 issue he introduced us to Bulgarian solitaire: 


Form a pile of 45 cards, then divide it into as many piles as you like, with an arbitrary 
number of cards in each pile. You may leave it as a single pile of 45 or cut it into two, three 
or more piles, cutting anywhere you want, including 44 cuts to make 45 piles of one card 
each. Now keep repeating the following procedure. Take one card from each pile and place 
all the removed cards on the table to make a new pile. The piles need not be in a row. Just 
put them anywhere. Repeat the procedure to form another pile, and keep doing it. 

As the structure of the piles keeps changing in irregular ways it seems unlikely you will 
reach a state where there will be just one pile with one card, one pile with two cards, one 
with three and so on to one with nine cards. If you should reach this improbable state, 
without getting trapped in loops that keep returning the game to a previous state, the game 
must end, because now the state cannot change. Repeating the procedure leaves the cards in 
exactly the same consecutive state as before. It turns out, surprisingly, that regardless of the 
initial state of the game, you are sure to reach the consecutive state in a finite number of 
moves. 


He then quotes an amazing theorem of Jorgen Brandt that Bulgarian solitaire always terminates 
in the consecutive sequence {1,2,...,A} providing, of course, that the number of cards in the 
deck is the triangular number ¢, = 1+ 2+ --- +k. But what if the number of cards is not 
triangular? As we later discovered, Brandt answered this question as well. Since a deck has only 
finitely many layouts, the play of Bulgarian solitaire must eventually cycle (we include equilibrium 
as a cycle of period 1). In his elegant, but terse, paper [1] Brandt characterizes and counts all 
cycles for any given deck size. The result quoted by Gardner— Brandt’s Equilibrium Theorem, we 
will call it—follows because the consecutive layout is the only cycle when the deck size is a 
triangular number. 

In elaborating Brandt’s results we hope to share with the reader our delight in Bulgarian 
solitaire viewed as a simple sort of dynamical system. 


In observing a game of Bulgarian solitaire we need a way of describing the state of the deck at 
any time. The simplest way to describe a layout of the deck is to list the sizes of the various stacks. 
Because we disregard the arrangement of the stacks we can make the listing decreasing (or more 
precisely, nonincreasing). In this way we get the rank order description: 


S, = the size of the kth stack listed in decreasing order of size. 


Ethan Akin: After my undergraduate work at City College of New York, I did my Ph.D at Princeton in 
piecewise-linear topology. Since returning to City College I have worked primarily on applications to biology and 
economics. I am also interested in philosophy, history and horses. But I never, never touch problems in 
combinatorics. (Well, hardly ever.) 

Morton Davis: I received my Ph.D. at the University of California (Berkeley) in 1961. My dissertation, most of 
my research papers, and an introductory book were all addressed to my primary interest: game theory. I have also 
worked in artificial intelligence (computer learning) and on financial models. I am now finishing my 19th year at the 
City College of New York. 
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We get the frequency distribution by counting, for each whole number /, how many stacks 
contain exactly j cards: 


N, = the number of stacks of size j. 


Finally, the residual distribution counts, for each whole number i, how many stacks contain at 
least i cards: 


Q, = the number of stacks of size > 7. 


The latter two modes of description are related by the equations: 


(1) 0,= DN, N=Q,- Q)41- 
, 

The residual and rank order descriptions are related by a useful sort of geometrical duality 
which Gardner mentions. Notice first that Q is an infinite sequence of natural numbers. It is 
nonincreasing, with finitely many nonzero terms followed by an infinite string of zeroes. We can 
state this somewhat formally by: 0 € &, where our “state space,” #, is the set defined by: 


L={P=(P,,Py,...): Pi >P>--:, P. € Z* for 
all i and P, = 0 for some k}. 


Here Z*= {0,1,2,...} is the set of natural numbers. 

We can also regard the rank order sequence S as a member of & by attaching a terminal 
string of zeroes. In any example we will describe the S or Q sequence by listing the initial nonzero 
part and omit the zeroes. 

For example, with S = 8,3,3,2,1,1 we can picture the layout of the deck as in Fig. 1 (upper 
left). Instead of stacks of cards we see stacks of dots with each dot marking a point whose x and y 
coordinates are whole numbers. So a point with whole number coordinates (k, i) is marked with a 
dot precisely when S, > i. 

In general, for any sequence P in & we define the shadow of P by 


(2) Sh( P) = {(i, /): i, 7 € W and P >j}, 


where W = {1,2,3,...} is the set of whole numbers. Clearly, we can recover P by observing its 
shadow. For P, we count the shadowed points with x-coordinate i, or alternately look at the 


I 


y-coordinate of the highest point with x-coordinate i: 
(3) P, = max{ j: (i,j) € Sh(P)}. 
By convention, we define the maximum over the empty set to be zero (max @ = 0). 

Now suppose S is the rank order sequence for some layout, as in Fig. 1. To say that (k, 7) is in 
the shadow of S means that S, > i. Because the sequence is nonincreasing, this means that all of 
the first k stacks have size at least i, i.e., 0, > k. This argument goes the other way as well and 
shows that S, > i (.e. (k,7) © Sh(S)) if and only if Q, > k (i.e. (i,k) € Sh(Q)). In other words, 
that shadow of the residual distribution Q is obtained from the shadow of the rank order 
sequence S-by just transposing the x and y coordinates. Thus, in Fig. 1 (upper right) we see that 
shadow of the residual distribution Q = 6, 4,3,1,1,1,1,1. 

For any sequence P in & we can define the dual sequence P* in ¥ exactly this way. For P# 
we count the P-shadowed points with y coordinate k or look at the x-coordinate of the rightmost 
point with y-coordinate k: 


(4) P* = max{i: P >k}, 


where, again, max @ = 0. It is clear that in general Sh( P*) is obtained from Sh( P) by transposing 
coordinates. This shows that P** = P. 

We can summarize all this by saying that the rank order sequence S and the residual sequence 
QO are dual to one another, where duality in & is defined via (4). This duality is well known to 
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Fic. 1 


those who count things other than cards. For example, in [3] May exploits this duality in 
discussing abundance of biological species on islands. 


Now let’s play some Bulgarian solitaire. We denote by a the map which associates to the rank 
order sequence of a layout the rank order of the new layout after a round of play. Fig. 1 (left) 
illustrates what happens. Pull one card from each stack—we use the American riverboat 
convention of extracting it from the bottom of each stack—and fit the resulting new stack in an 
appropriate place among the remains of the old stacks. We get S’ = a(S). When S = 8,3,3,2,1,1, 
S’ = 7,6,2,2,1 as shown. The size of the new stack is the number of old stacks, in this case: 6. In 
general, for any layout: 


(5) max{ k: S, > 0} = }|N, = Q, = the number of stacks. 
J 


We denote by B the dual of a which associates to the residual sequence of a layout the residual 
sequence of the new layout after one round. The first column is pulled off, rotated 90° to get a 
row of dots and then this new row is pushed underneath the other columns after they are moved 
left one place. This describes Q’ = B(Q). In Fig. 1 (right) with Q = 6,4,3,1,1,1,1,1, QO’ = 
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5,4, 2, 2,2, 2,1 which is dual to S’ = 7,6,2,2,1. 

By sufficient contemplation of Fig. 1 the reader may be able to convince himself that this 
messy description is indeed dual to a. But the question tends to arise at this point—if it hasn’t 
earlier—why bother with the dual at all? The residual distribution is the least intuitive way of 
describing a layout so why introduce it in the first place? 

The answer is that while—as the descriptions suggest—the physical play of the game is best 
understood using S, it turns out that the analysis is best done using Q. This is because the formula 
for the map a is somewhat messy. But the formula for the map B is easy to work with. Let’s 
derive it by looking first at the frequency distribution N. 

After one round of play the number of new stacks of size 7 is the number of old stacks of size 
j + 1, except that there is one additional new stack of size Q, (= the old number of stacks by (5)). 
This is because we have taken one card from each old stack and put them together to form one 
new stack. Notice that the old stacks of size 1 are gone. 

We can write this as: 


(6) N/ = Nai + 8(Q,),. 
The second term in (6) is the Kronecker delta defined by 
\ fl, t=, 
8(1), = (0 i #7. 
If we now define 
_ _fl, i<k, 
x(<h),= D3(k),={47 SF 
je! 
then we can get the formula for Q’ = B(Q), by summing (6) on the j’s > i (see (1)): 
(7) B(Q),=Q41+x(< Q,),. 


So, for example, the new stack number is 6(Q), = Q, + 1: one more than the number of old 
stacks of size at least 2. Notice that formula (7) corresponds to the description we gave before: the 
columns of Sh(Q) are shoved left one place after the first column is removed and then the first Q, 
columns are jacked up by 1 as the Q, row is shoved underneath. 

Instead of a formula for a, we can just use duality to say: 


(8) a(S) = B(S*)*, 
1.e., for analytic purposes, we go from the rank order description to the dual residual sequence, 
apply 6 to get the new residual sequence and take the dual to get the new rank order. 


We emphasize again that we will keep going back and forth: @ for intuition and B for analysis. 
(“Heaven for climate, Hell for company’”—M. Twain.) 


As we play, the total number of cards in the deck doesn’t change. This number is given by the 
equations: 


(9) S, = LIN, = LQ, = the size of the deck. 
k J l 
The first two formulas are easy to check. The third can be derived from the second using (1) but it 


is easier to use duality and note that for either S or Q the sum is just the size of the shadow. In 
general, for any P € & we define the size of P: 


(10) \|P|| = }> P. = the number of points in Sh( P). 
As an exercise, we prove that 8 preserves size by using the analytic definition, (7): 


WBCQ)Il= DX Qar+ XE x(< Q,),. 
i=1 i=1 
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The first sum is ||Q|| — Q, while the second consists of Q, ones and so sums to Q,. Hence, 


BC Q) Il = lle. 


Our original set of sequences ¥ can be cut up into different levels by defining 
L={ PEL: \|Pl| =r}. 


The finite set Y is exactly the set of all rank order sequences (or alternatively the set of all 
residual frequency sequences) of layouts for a deck of size r. 

This raises another objection. In discussing Brandt’s paper we mentioned that it was the 
finiteness of the set of layouts of a deck which made the cycling inevitable. In a roundabout way, 
we introduced the infinite set “& and then sliced it back up into finite sets. Why not just fix the 
deck size r and restrict attention to “&? The answer to this is the moral of our mathematical story. 

The representation of some operation by a map from a set to itself, e.g., a: “> £ or B: 
L— L, is the prototype of a dynamical system model. Iterating the mapping corresponds to 
applying the operation repeatedly, starting from an initial point. The first step in studying such a 
dynamical system is to look for structures on the set, 1.e., ways of measuring, combining or 
comparing elements of the set, which are preserved by the map in some fashion. Why “? Because 
¥ is rich in structures which are only partly apparent in each separate ¥.. This explains why our 
analysis—once we get to it—all proceeds by comparing sequences at different levels of &. 

To illustrate this idea of comparing different levels we examine the fate, under £, of an extra 1 
included on top of some column of Q: 


(11) 

B(Q + 8(1)) = B(Q)+8(Q, +1), B(Q+8(k))=B(Q)+8(kK-1) GER > 1). 
The proof of (11) is an exercise for the reader in the use of (7) and the Kronecker delta notation. 
To see what it says go back to Fig. 1 (right) and put your finger on top of some column of Q. If 
you start over some column beyond the first (k > 0), then after the operation your finger has 
moved along with its column one step to the left (to the k — 1 place). If your finger started out on 
the first column, then it now ends up underneath the new Q, + 1 (in this case = 7) column, lifting 
it up one place. A technical point for the wary: adding on an extra 1 may take the sequence out of 
YL because Q + 6(k) might not be monotonic. However, 8 can be defined via (7) for arbitrary 
sequences of natural numbers and its consequent (11) is true in general. 

In addition to the duality relation and the measurement of size, the structures on # that we 
use are a partial ordering and a closely related metric. 

For P,Q in & we define a partial ordering (i.e., a reflexive, antisymmetric and transitive 
relation) by: 


(12) P>Qe2P>Q 


So P > Q means for each i, P, is at least as high as Q,. From this it is clear that 


I 


(13) P > Q = Sh( P) contains Sh( Q) as a subset. 
We define the distance between P and Q by 


(o6) 


(14) IP-Ol= LIP - OI. 


i=1 


i=1,2,.... 


I 


We will see below that the metric, too, can be described using shadows. 
Let us collect together the relations among the four pieces of structure (size, duality, order and 
metric) that we have defined on &%. 


PROPOSITION 1. Let P,O €&%. 

(a) /P*l| =|IPI, IP — Ql =|P* — Q*I, and P > Q if and only if P* > Q* 

(b) ||P — Q|| > ||PI| — ||O|| with equality if and only if P > Q. 

(c) ||P — Q|| = minimum {||P || — Oll}, where P and Q vary over all members of & such that 
P>P,QandOQO < P,Q. 
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Proof. Recall that the shadows of P and P* are just transposes of one another. ||P|| and ||P*|| 
are the sizes of the respective shadows and so they are equal. Similarly, Sh( P) contains Sh(Q) if 
and only if Sh( P*) contains Sh(Q*). The distance result also follows by looking at shadows as we 
shall see. 

For (b), notice that |P — QO,| > P — Q, with equality if and only if P > Q,. Now sum on i 
and notice that equality between the sums requires equality at every i. 

For (c) we build two new elements of #&: P V Q and P A Q by defining for each /: 


(15) (Pv Q), = max(P,,Q,), 
(P AQ), = min(P,,Q,). 


Clearly, PV OQ > P,Q and PA Q< P,Q. Because |P, — Q,| = max(P,,Q,) — min(P,, Q,), we 
can sum on 7 to get: 


(16) IP — QI =P Vv Qll— IP A QI 
It is easy to check that 


Sh( P v Q) = Sh( P) U Sh(Q), 
Sh( P A Q) = Sh( P) 9 Sh(Q). 
From these equations and (16) we get the shadow interpretation of distance: 
(17) ||P — Q|| = the number of points in Sh( P) or Sh(Q) but not in both. 


Because this count is preserved when we transpose coordinates, it follows that ||P* — Q*|| = 
||P — Q||, completing (a) 

(c) also follows because P > P,Q implies P > PV Q, and so \|P|| > ||P Vv Q||. Similarly, 
Q < P,Q implies ||Q|| < ||P A Q||. Hence, || P|] — ||Q|] > ||P — Ql| by (16). QED 


REMARK. This result shows the relationship between the ordering and the metric. Given the 
concept of size we can get the ordering from the metric by (b) or go the other way by (c). 


Because duality preserves order, we can say that a layout of some deck is greater than or equal 
to a layout (of a possible different deck) if the corresponding rank order sequences are related by 
> or, equivalently, if the corresponding residual frequency sequences are related by >. This 
leads to a nice picture of the ordering which we formalize as: 


PROPOSITION 2. Start with a layout of cards. Now deal out additional cards onto existing stacks 
or deal out some new stacks or both. The new layout is greater than the old layout. Conversely, we 
can get to any greater layout by dealing out new cards in this fashion. 


Proof. The new deal does not decrease the number of stacks of size > i. So P, > Q, for all i, 
where P and Q are the residual sequences for the two layouts. 

Conversely, if one layout is greater than another, then Sh(7) contains Sh(S), where T and S$ 
are the rank order sequences. So we can build Sh(7) from Sh(S) by adjoining new points, i.e., by 
dealing out cards. QED 


In particular, notice that the restriction of the ordering to a single level is trivial, i.e., two 
layouts of the same deck can be compared by the ordering only when they are equal. This is what 


we meant when we said that some of the rich structure of £ is lost when we restrict attention to 
L 


r' 


Now we are ready to relate the structure on # to our dynamical system: 


THEOREM 3. Let P,Q EY. 

(a) ||PIl = |BCP Il = llaCP IL 

(b) |la(P) — a(Q)|| = I|BCP) — BCQ)II < I|P — Ql 
(c) If P > Q, then a( P) > a(Q) and B(P) > B(Q). 
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Proof. The results for a follow from the results for 8 by equation (8) because duality preserves 
size, distance and ordering. 
We have already proved (a) and the formal proof of (b) is very similar. From (7) it is clear that 


WBC P) — BOQ) < Dd |Par - Qailt+ bd Ix(< Pi), -x(< Q,),|- 
i=] 1=1 


The first sum is ||P — Q|| — |P, — Q||. 

Now suppose that P, > Q,. Then the terms of the second sum are zero except when 
QO, <i< P,, because outside this range x(< Q,), and x(< P,), are both 0 or both 1. Inside the 
range 


Ix(< Pi), — x(< Q1),J=x(< P), =1. 


Thus, the second sum is P, — Q, = |P; — Q,| and (b) follows. 

For (c) notice that P > Q implies P.,, > Q,,, and x(< P,), > x(< Q,), for all 7. So from 
(7): BCP), > BC(Q), for all i. 

As (c) is our most important result we give a conceptual proof using Proposition 2. 

Suppose that the layout for Q consists of stacks of blue cards. By the Proposition you can get 
the layout for P by extracting some red cards from your sleeve and dealing them out onto the 
existing layout. Now in playing a round of solitaire make sure that from any stack which contains 
blue cards the one you pick to remove is blue (e.g., pull the card from the bottom of each stack). 
In the new stack put all the blue cards on the bottom. The new combined layout is the same as 
though you played a round with old blue and then dealt out some red cards on top. So by 
Proposition 2 again the new layouts are related by >. QED 


REMARK. While we proved (b) and (c) independently they are actually equivalent in the 
presence of (a), 1.e., if B: #&—- £ is any map preserving size, then B preserves order if and only if 
it does not increase distance. For example, suppose (a) and (b) are true and P>Q. By 
Proposition 1(b) and assumption (b) here we have: 


Pil — Oil =P — Qi > IBC P) — BOQ) > WBC PDI — WBC Q) Il = IP Il — QI. 


Hence, the inequalities are equalities and B(P) > B(Q) by Proposition 1(b) again. Similarly, (a) 
and (c) imply (b) via Proposition 1(c). 


At last, we are ready to release the energy stored up by all this work. To the cycles! 
Recall that if r=74,=k+k—1+---+1=%35k(k +1) and P lies in &, then Brandt’s 
Equilibrium Theorem says that a”(P) eventually equals the consecutive sequence B(k), where 


(18) B(k) =k,k —1,k —2,...,2,1,0,0,.... 


Because a( B(k)) = B(k), this implies that the operation terminates at B(k) after a finite number 
of iterates. 

Notice that B(k) is self-dual, i.e., B(k)* = B(k), because the shadow is symmetric (see Fig. 
2): 


Sh(B(k)) = {CGi, fj): ityp<k +1}. 


For the equilibrium in &%., B(k) is at once the rank order sequence and the residual sequence. 
So 6" (P) is eventually equal to B(k) as well, and B(B(k)) = B(k). 

Now suppose that P lies in Y but r is not triangular: t,_, < r < t,. We use what might be 
called the method of “two big guys hustling a little guy out the door”. 

We can increase P to get Q in Y with Q > P and ||Q|| = t, (deal out ¢, — r cards). Also we 
can decrease P to get Q in & with P > Q and ||Q|| = ¢,_, (lift up r — t,_, cards). 

By the Equilibrium Theorem, there exists a large enough n that «”(Q) and B"(Q) are B(k), 
and that a”(Q) and B"(Q) are B(k — 1). Now because a and B preserve order (Theorem 3(c)): 


a"(P) <a"(Q) = B(k), = a"( P) > a"(Q) = B(k — 1), 
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and similarly for B”(P). So we have: 
(19) B(k—1) < a"(P), B"(P) < B(k). 

These inequalities exactly describe the cyclic elements of & as we now show. 

THEOREM 4. With B(k), B(k — 1) the consecutive sequences defined by (18), let @\) = {PEL: 
Bik -1)< P< B(k)}. 

(a) The set ¢“* is closed under duality and the maps a and B, i.e., P in @ implies P*, a(P) 
and B(P) lie in @. 

(b) Every element of @‘* is cyclic. In fact, P € @‘* implies 
(20) P= a‘(P) = B*(P). 

Proof. (a) is true because B(k) = B(k)* = a( B(k)) = B(B(k)) and because *, a and B 
preserve > . For example, 

B(k) = B( B(k)) = BCP) > BC B(k — 1)) = B(k — 1) 

proves: PE @*) = B(P) Ee ¢?. 

Now we prove (b) and in the process describe ¢‘*) in some detail. 

Notice first that B(k), — B(k — 1), is 1 for i < k and is 0 afterwards, ie., 


B(k) — B(k - 1) =x(<k). 
Hence, if P lies between B(k — 1) and B(k), then the difference P — B(k — 1), is either 0 or 
1 for i up to k and is 0 afterwards. So if we define 
A =P, -— B(k-1),, i=1,2,...,k, 
then A is a function from {1,...,k} to the two-point set {0,1}. 


Conversely, if A: {1,...,4}— {0,1} is any such function and we extend the definition by 
A, = 0 fori >k, then P € 4“), where 


(21) P=B(k-1)+A. 


This exhibits a one-to-one, onto correspondence between the set ¢‘*) and this set of functions 
which we denote {0, 1}*, 
Now we define £: {0,1}* > {0,1}* by 
i<k, 


~ A _ Ai 41> 
(22) B( =| A, =k 


Notice that 1 is congruent to k + 1 modulo k. So (A), is obtained by evaluating A ati+ 1 
reduced back to {1,...,k} modulo k. Iterating n times, B”(A), is A evaluated at i + n reduced 
modulo k. In particular, i + k congruent to imodk implies: 


(23) Bk(A) =A. 
The proof is completed by showing that, under the correspondence between ¢? and {0,1}*, 
B corresponds to B, Le., 


(24) B(B(k —1) +A) = B(k-1) 4+ B(A). 
To prove this with P = B(k — 1) + A, we want to check that 
BCP), = B(k-1),41 +A. + x(< Py), 
is the same as 
B(k —1), + B(A),. 
For i< k, P; > B(k — 1), = k — 1 implies 
B(k-1),41,+ x(< P,), = BK -1),4, +1 = B(k -1),. 
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For i=k, 


B(k ~~ L) pat — Anat =0= B(k _ 1)x, 


while x(< P,), is 0 if P, =k-—1 and is 1 if P, =k, 1e, x(< P,), =A,. Finally, if i>k 
everything is 0. QED 


Fig. 2 illustrates all this. Think of A as a list of 0’s and 1’s placed on the k whole number 
points of the line x + y = k + 1. The 1’s mark the points of Sh(P) — Sh( B(A — 1)). If r = || PII, 


then there are r — t, _, such points. In particular, P = B(k — 1) corresponds to the list of all 0’s 
and P = B(k) corresponds to the list of all 1’s. 


S’ = 7,6, 4,3, 3, 1,1 = B(6) + (1100101) 


QO’ = 7,5,5,3,2,2,1 = B(6) + (1010011) 


Fic. 2 


The behavior of £ is illustrated by Fig. 2 (right). Think of Sh(B(A — 1)) upon which the 0’s 
and 1’s are standing as a Staircase, but one of the paradoxical sort that occur in the prints of M. C. 
Escher. As B is applied everyone moves up one step. The 0 or 1 which was originally on top 
(i = 1 column) takes a step and appears at the bottom (i = k column). Applying this cyclic 
permutation k times we get the identity, i.e., after k steps everyone (and everyzero) is back in his 
original place. 


For completeness we have illustrated a applied to the dual in Fig 2 (left). Notice that the dual 
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consists of the same list of 0’s and 1’s but in reverse order. a on @‘*) is the inverse of B. It 
corresponds to a step down the Escher staircase. 

Theorem 4 identifies certain elements of # as cyclic. We proved (19) by assuming Brandt’s 
Equilibrium Theorem. Thus, if r = ||Q|| is between ¢,_, and t,, then B”(Q) eventually lies in 
@‘*) In particular, the elements identified by Theorem 4 are all of the cyclic elements. We now 
reprove this result without assuming the Equilibrium Theorem and obtain that Theorem as a 
Corollary. 


THEOREM 5. Define @= @% UU @9 U ---, i.e, 
€={PEL: B(k-1) <P < B(k) forsomek = 2,3,...}. 


IfP EL, then B"(P) and a"(P) lie in @ for sufficiently large n. In particular, @ consists of all 
the cyclic elements of &. 


Proof. We prove that B”(P) eventually lies in @ by induction on ||P||. Duality implies the 
result for a, and the rest of Theorem 5 then follows from Theorem 4. 


Earlier we were able to bound P from above and below by elements whose fates were known. 
Now we only have information from below. One of the big guys is no longer there so the other 
gets a bit closer and uses some judo. 

If ||P|| = 1, then the result is trivial because B(1) is the only member of Y. 

Now if ||P|| = 7 with t,_, <7 < t,, we assume the result for r — 1 (> t,_,) and prove it for 
r. Choose Q with P > Q and ||Q|| = r — 1 (we., lift one card from the corresponding layout). By 
inductive hypothesis there exists n such that B"(Q) € ¢™ and so 


B'(Q) = B(k-1) +A, 
where A &€ {0,1}* as in the proof of Theorem 4. 
Because £ preserves the ordering B”(P) > B”"(Q). Because B preserves size ||B"(P)|| = r and 


\|B"(Q)|| = r — 1. So if we let R = B"(P), then R differs from B”(Q) by the addition of 1 in 
some place: 


R= B(k-1)+A+8(/) 
for some j. 

Think of B”(Q) = B(k — 1) + A asa line of 0’s and 1’s standing on the Escher staircase. R is 
built by the addition of a new 1 which hops aboard at position 7. To keep track of this intruding 1 
let us name it: Ralph. 

What happens now depends on the character of the place j at which Ralph has arrived. There 
are three kinds of places 1: 


i is empty (i< kand A, = 0): 
If j is an empty spot, then A + 8(/) is a member of {0,1}* and so R lies in ¢“). 
iis occupied (i<kand A, =1): 
If Ralph arrives at an occupied spot, he hops on top of the 1 who is already there. A + 6(/) takes 
on the value 2 at j and so is not in {0,1}*. R is not in ¢“*?. 
i is the waiting room (i=k+1): 


If Ralph arrives at j = k + 1, then again A + 8(/) is not in {0,1}* because it takes on a nonzero 
value (namely 1) at 7 = k + 1. Again R is not in ¢™?. 


Notice that if j were greater than k +1, then R, =1 and R,,, =0 would contradict 
monotonicity of R (R is in &). So these are the only possibilities. 

So if Ralph arrived at an empty spot, then R is in @‘*) and all is well. Otherwise, Ralph is 
sitting on top of another 1 on the staircase or is not on the staircase at all but on the k + 1 
position, just beyond the stairs, which we have labelled the waiting room. Because R is not yet in 
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@‘*) we must continue to apply B. The proof is completed by using the following: 


CLAIM. (waiting room) B<(B(k — 1) + A+ 6(k + 1)) = B(k —1)+ A+ 6(1). 
(occupied spot) If A, = 1, then BY(B(k —-1)+A+8(J)) = B(K-1D+A 
+8(j + 1). 

Suppose that R is not in ¢“*?, i.e., Ralph’s position j is not an empty one. The claim says that 
after A iterations of B the original underlying list of 0’s and 1’s, A, is restored, but Ralph has 
moved. If he was originally at an occupied position /, then he is now one step down at / + 1, 
including the possibility ( = k) that he has moved out to the waiting room. If he was originally 
in the waiting room (j = k + 1), then he is now at the top of the stairs (at position 1). 

Now if this new position is empty, then B*(R) is in @“*). If not, we apply B* again. Ralph 
keeps moving rightwards (or clockwise if we include the leap from k + 1 back to 1) one step at a 
time, sampling each position in turn until he finds an empty spot. Then the process terminates 
with entry into @‘*), Notice that there are certainly empty positions somewhere because 
|B"(Q)|| = r — 1 < t, and so A does not consist entirely of 1’s. 

To prove the Claim we begin with a single application of 8. By the exercise (11) (which we 
sneaked in for just this purpose): 


(25) B(R) = B(B(k — 1) +4 + 8(j)) = B(k - 1) + B(A) + 8(2), 
wherez=j-—lifj>1 and z=k+A, if j=1. 
Here we have used (24) to evaluate B( B(k — 1) + A) and we have used (B(k — 1) + A), +1=k 
+ A, for the 7 = 1 case. 
_ The first half of the Claim (j = & + 1) follows immediately by iterating (25) k times and using 
B*(A)=A. 
Now suppose j < k and A = 1. After j — 1 iterates of (25), note that z=j —(j-—1)=1: 
B/~*(R) = B(k — 1) + B/*(A) + (1) 
and £/~1(A), = A, = 1. So applying (25) again, we note that z = k + B/~1(A), = k + 1 and so: 
B/(R) = B(k — 1) + B/(A) + 8(k + 1). 
Now apply (25) A — j times again. This time z=k+1-—(k-j)=j+1: 
B‘(R) = B(k- 1) + B*(A) + 8(f + DV). 
Finally, B“(A) = A completes the proof of the Claim and so, of the Theorem. 


To see what is happening suppose that the 1 originally in position 7 is named Quincy. As 
Quincy climbs the stairs during successive applications of 8, Ralph rides along on his shoulders. 
At last, Quincy arrives at the top (position 1) with Ralph aboard. Then Quincy takes the magic 
Escher step which brings him back to the bottom of the stairs (position k) and in the process 
chucks Ralph out into the waiting room (position k + 1). With the next application of B, Quincy 
moves up one step (to position k — 1) and Ralph hops into the position just behind Quincy. If 
this place in line is empty, we are in @‘“). If not, then Ralph is now aboard the 1 behind Quincy 
and goes for another ride. QED 

From the proof it is easy to construct sequences which take a long time to stabilize. Let 


Q=k+1,k—-1,k —2,...,3,2,0,0. 
Q is B(k + 1) with k and 1 deleted and so 
HOI = tear ~K-L St. 
O = B(k — 1) + A+ 8&(1), where A = (1,1,...,1,0). Thus, B* will have to be iterated k — 1 
times before the incoming 1 reaches the unoccupied position at the end, i.e., Q requires k(k — 1) 


iterations of 6 to reach B(k). Gardner mentions the conjecture that k(k — 1) is the maximum 
number of iterations needed to reach B(k) from any initial P in ~,,. The recursive estimates 
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yielded by the proof above are much cruder because of the iterates needed to reach R first. 
However, it is reported that the conjecture has been proved by Igusa. 
The rank order sequence for the layout with residual sequence Q is given by: 


S=k—-—1,k—-1,k—-—2,k —3,...,3,2,1,1. 
So S requires k(k — 1) iterations of a to reach B(k). 


The Equilibrium Theorem follows from Theorem 5 by observing that if r = ¢,, thnY.N @= 
{ B(k)}. The only cycling element at level ¢, is the equilibrium B(k). 

If t,_,<r<t,, then wecan writer=¢t,_, +uwithO<u<t, —t,_, =k. By using the A 
representation we see that the cardinality of YO @, 1.e., the number of cycling elements of size r, 
is given by the binomial coefficient | “ ) counting the number of ways of choosing a subset of size 
u from a set of size k. The subset is the 1-positions on the staircase. 

More interesting is the question of the number of disjoint cycles. For example, if r = ¢,_, + 1, 
L. 1 € consists of k elements corresponding to A = (1,0,..., 0), (0,1, 0,...,0),...,(0,0,..., 0,1). 
But these all lie on a single cycle of period k. Similarly, if r = ¢, — 1, & M @ consists of a single 
cycle of period k. However, if ¢t,_, + 1<yr<t, —1,1e,1< u< k — 1 (which requires k > 4), 
then there is always more than one cycle. For example, if k = 4 and u = 2, then A = (1,0,1,0) 


~ 


and A = (1,1,0,0) correspond to two elements of ¥% M @ generating the two different cycles: 
QO = 4,2,2 (S = 3,3,1,1) 


and 


~ 


Q=4,3,1 (S =3,2,2,1). 
We now give a proof of Brandt’s formula for the number of disjoint cycles. Some familiarity 


with elementary number theory is needed for this sidetrip. 


THEOREM 6. Let r=t,_, + u with O<u<k so that t,_,<7r<t,. The number of disjoint 


cycles in &. is given by 
1 k/d 
7; > ay 04), 
k da. k) u/d 


where (u,k) is the greatest common divisor of u and k and $(d) is the Euler $-function, i.e., the 
number of positive integers at most d and relatively prime to d. 


Proof. lf P= B(k-1)+AE¥YO G™, then B*(P)=P and so the period of P= 
minimum{d > 0: B4¢(P) = P} is a divisor of k. Now let d|k and see what it means for 
B4(P) = P, or equivalently 84(A) = A. If A = {i: A, = 1} is the set of positions occupied by 1’s 
on the stair, then after d steps the list of 0’s and 1’s on the stair must look the same as the original 
list. This means that A is the union of k/d copies of A M {1,...,d} each moved d steps more to 
the right. Because wu is the cardinality of A this requires (k/d)|u, i.e., k|du. Conversely, if d|k 
and k|du, then such subsets exist and are determined by an arbitrary initial choice of AM 
{1,...,d} as a subset of size du/k. So if we define 


F(d) = the cardinality of {PP €¥,n @: B4(P) = P}, 
then we have proved the formula: 


d 
(26) F(d) = Lain) if d|k and k|du, 


0, otherwise. 
On the other hand, if we define 


f(s) = the cardinality of {P € YN @: period of P = 5}, 
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then B“(P) = P = period P|d shows 


(27) F(d) = Lf(s). 


s|d 


Finally, if N is the number of disjoint cycles, then because each cycle of length s contains s 
elements of period s, 


(28) N= Di f(s)/s. 


S|k 
Equation (27) implies that F and f are related by the Mobius inversion formula (see Niven 
and Zuckerman [4, p. 87]): 
f(s) = La(s/d) F(a). 


d\s 


Substituting in (28) we get: 


N=) Dim(s/d)(1/s) F(a) 


s|k d|s 


= [Xf w(s/a)(/s): djs and s|k}| F(d). 
d|k 
Let z = s/d; then s|k if and only if z\(k/d). Also, 1/s = (k/d)/z X (1/k). Hence, the sum 
in brackets is 
1 
Gd mz) k/d)/z. 
z\(k/d) 
By the Mobius inversion formula again, this is ¢(k/d)/k (see Niven and Zuckerman [4, p. 
88]). Letting d’ = k/d so that d = k/d’, we can substitute back into the equation above for N to 
get: 


1 
N= = L 6(d’) F(k/d’). 
d\k 
Now F(d) = F(k/d’) is zero unless k|du, i.e., d’|u and in the latter case F(d) = ( ve): So the 
nonzero terms occur when d’|k and d@’|u, 1.¢., d’|(u, k). QED 


REMARK. If uw and k are relatively prime (e.g., if & 1s prime), then d|(u, k) only when d = 1 
and the formula reduces to ( /k as expected since all cycles have period equal to k. 


As our story nears its end we feel the urge to leave some problems for the reader. So we 
introduce a new game called Austrian solitaire. It proceeds as follows: 

Start by laying out stacks from a deck, but now all stacks have size < L, where L is some fixed 
whole number. One special stack is reserved on the side. It is called the bank. 

A round of play consists of two steps. First, remove one card from each ordinary stack and put 
it in the bank. Now from the bank lay out new stacks of size exactly equal to L, continuing until 
the size of the bank is < L (including the possibility of exhausting the bank). 

As with Bulgarian solitaire, the deck size is preserved and the problem 1s to describe the cycles. 
In particular, prove or disprove the following: 


CONJECTURE. For any fixed deck size and fixed L there is a unique cycle. 


We have only limited results which we would be happy to share with any reader who wants to 
play. 
The idea for the game arose when we had returned to our usual pursuits and were reading a 
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discussion of the so-called Austrian school of capital theory. The economic interpretation shows 
that anything can become applied mathematics and also accounts for the name. 

Think of the ordinary stacks as machines. Each machine has, when new, a life of exactly L 
years. The size of a stack is the number of productive years left for a particular machine. Each 
year it ages one year (and so one card is removed from the stack). For each machine on line the 
company deposits 1/L of its cost into the bank as a sinking fund. Then it buys as many new 
machines as it can afford, and the remaining funds are left in the bank until next year. 

Once these problems are solved the reader will be able to say, when asked about business 
cycles, “Why, it’s all in the cards!” 
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SOME EXAMPLES OF COMBINATORIAL AVERAGING* 


HERBERT S. WILF 
Department of Mathematics, University of Pennsylvania, Philadelphia, PA 19104 


Many important combinatorial problems, including some very hard ones, concern the de- 
termination of the extreme values of functions that are defined on graphs, sets, algorithms, etc. In 
some of these cases it turns out to be useful to try to learn more about the distribution of values of 
the functions that are involved. The maximum value, for instance, is one aspect of that 
distribution, but so are the average, the mean square, etc. Therefore one reason for studying 
combinatorial averaging is to shed some light on optimization problems. 

Another motivation for studying discrete averages comes from the analysis of algorithms. In 
some computational problems the most natural measure of the running time might be the average 
time, averaged over a suitable collection of possible inputs, just as in many other computational 
problems we are interested more in the ‘worst case’, i.e., the maximum possible running time for 
the given set of inputs. In such situations the study of averages would be an end in itself rather 
than an aid to the solution of an extremum problem. 

Whatever the reasons might be for looking at combinatorial averaging, the problems present 
great variety and beauty, along with some possibly very surprising outcomes. In this article I'd like 
to present a small sample of the varied fare of the subject. 

My thanks go to Drs. E. Bender and D. E. Knuth for a number of helpful suggestions. 


1. A warmup: triangles in graphs. In this section we will do quite an easy averaging problem on 
graphs. We will do it in two different ways, to illustrate the slightly different approaches of the 
combinatorial model and the independent edge appearance model of probabilistic graph theory. 


n 


Consider a graph G of n vertices, the vertices being labelled {1,2,...,}. There are | 2) such 


graphs, since each of the ( ") possible pairs of vertices might or might not be joined by an edge of 
the graph. 

For each such graph G, let T(G) be its number of triangles, 1.e., the number of triples 7, 7, k of 
vertices such that (i, /),(i, k),(j, k) are all edges of G. 


*Presented, by invitation, at the Annual Meeting of the Association in Louisville, Kentucky, January, 1984. 
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n 


The maximum of 7T(G), over all graphs G of n vertices, is clearly ( 3 ); the minimum is 0. What 
is the average number of triangles in graphs of n vertices? 

First, the combinatorial model. We look at the number of triangles in each graph, add them up 
over all graphs, and divide by the number of graphs, to get 


(1) T(n) = oy ET(6) 
22) & 


n 
in which the sum is over all (2) graphs G of n vertices. 

(We have just carried out the first step in a rather general three step recipe for computing the 
average number of substructures within a structure, which is to express that average as a suitable sum 
over the structures.) 

Now we are going to introduce yet another summation sign, involving a function whose values 


n 


are only 0’s and 1’s. T(G), in this example, is the sum, over all ( ) triples of vertices, of a function 
whose value is 1 if a triangle of G lives at that triple and 0 otherwise. Hence define x(i, j,k; G) = 1 
if (i, 7), (J, k), (i, k) are all edges of G, x = 0 otherwise. Then we can express T(G) as 


(2) T(G)= 2 x. Jj,ksG), 


i,j,k=1 


and so from (1), 
(3) T(n) =2 ()s, xi, j,k G). 
G i,j,k 

(Now we’ve done the second step, in such problems, in which we look for an ‘indicator function’ 
x, whose values are just 0’s and 1’s, that allows us to express the average as a double sum of 0's and 
1’s.) 

This is the time to interchange the summation signs, which gives 
“() - 
(4) T(n) = 2 » (Ux. jG}, 

i,j,k’ G 

and we now face quite a different kind of sum inside the curly braces. There we find, for a fixed 
triple 7, 7, k, the number of graphs G that have a triangle at i, j,k. 


Well then, how many graphs G have a triangle at vertices 1, 2,3? If there is a triangle there then 
3 of the (3) possible edges of the graph G have been determined: they exist. There remain just 


n 3 
— 3 edges of G that might or might not exist, and so there are exactly 3 2) such graphs G. 
(And that was the third step: to interchange the order of the summations, and to try, as above, to 
deal with the innermost sum.) n 


nN 


2 


~3 
For ea¢h of the (3) triples i, j,k, then, the inner sum in (4) has the value 2 ; , and we find 
at once that an average graph of n vertices has 


(5) T(n) = =(3] 
triangles. 

The answer (5) for the average number of triangles just begs for a ‘one line proof’, because on 
probabilistic grounds it is obvious, and that brings us, as promised, to the independent edge 
appearance model, pioneered by Erdés and Rényi [12]. In this model we have a fixed edge 
probability p, 0 < p < 1. An event in our probability space is obtained by tossing a (biassed) coin 

") times, once for each possible edge in a graph of n vertices, where the two possible outcomes 
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of each toss are called ‘edge’ and ‘no edge’, respectively, and in which the probability of ‘edge’ is 
p. Whenever the outcome is ‘edge’ we draw an edge, else we do not, and this gives us the graph G 
that was sampled by the repeated tosses of the loaded coin. 

Now consider the number of triangles. If i <j < k are fixed vertices, the probability that all 
three edges (i, j),(i, k),(j, k) occur is p*, and immediately we see that the average number of 
triangles is p° 5 , in agreement with (5) when p = 4. The independent edge appearance model is 
usually easier to handle, as this example shows. There may be small differences in the answers that 
we get with the two models. To do the combinatorial model with ‘edge probability p’ we would 


sum over just those graphs of n vertices that have exactly | "| edges, instead of summing over 
all 3 2) of them. The average number of triangles will be slightly different from p*(;) (work it 
out!), but the differences will not be important for most applications. 


EXERCISE 1. How many complete graphs of k vertices, on the average, does a graph of n 
vertices (n > k) contain? (Use either model.) 


The familiar inequality of Tschebycheff, from probability theory (e.g., [14], p. 233) gives 
information about a full probability distribution from information about just the first two 
moments of that distribution. More precisely, if we know the mean p and the standard deviation 
o of the distribution of a certain random variable x, then the inequality tells us that x is unlikely 
to be found very far away from its mean value, in fact that for every positive f¢, 


1 
(6) Prob{ |x — p| > to} < >. 
t 
In view of Tschebycheff’s inequality, there is a special value in finding the first two moments of 
an unknown distribution, since we then get at least some information about the shape of the 
whole curve. If you would like to see this principle in action, then try 


EXERCISE 2. What is the standard deviation of the number of triangles in graphs of n vertices? 
What does Tschebycheff’s inequality tell us about the probability that a graph has a large number 
of triangles? (Use either model.) 


2. The fixed points of permutations. Here’s another example of the same technique. If n is 
fixed, what is the average number of fixed points that permutations of n letters have, and what is 
the standard deviation of that number? Let S, be the set of all permutations of n letters, and let 
o € S,. By f(o) we mean the number of fixed points of o, and let f(n) be the average that we 
seek. Then (step 1) 


- 1 
(7) A(n) == Y flo). 
‘ oES, 
Now (this is step 2 of the recipe) we introduce the indicator function x(i; 0) = lif o(i) =i; = 0 


otherwise, where 1 < i < n. Then (7) expands, just as in (3), this time to 


fn) = 4, OE xliso). 


o€ES, 1=1 


Once again (here comes step 3) we invoke the commutative law of addition to interchange the 
summation signs and find that 


_ 1 J . 
(8) f(n) =~ | » x(is0)} 
“1=1 \oeS, 
The inner sum is the number of permutations that have fixed point i, and that is clearly (n — 1)! 
Hence 
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(9) f(n) = 5p D (nD 


Since you probably already knew the answer (9) that average permutations have 1 fixed point, 
let’s try the second moment of the distribution, 1.e., the mean square of the number of fixed 
points. This is 


(10) A(n)=— ¥ flo)’. 


' ses, 


Again the strategy will be to boil the sum down to a ‘0 or 1’ summand, interchange the order of 
summations, and hope. We find that 


(11) ain) 2 E [Extiey}. 


The square of the sum is handled by writing down the sum twice, using two different indices of 
summation. In other words we use the fact that 


and then (11) becomes 


(12) any=+ 5 LY LY xliio)x(io). 


“oES, i=l j=l 


n 


If we interchange the summations in (12), 


(13) Q(n) = - x | » x(a) x(Js0)}, 


i,js=l \‘oeES, 


What sort of creature now inhabits the curly braces? For a fixed pair 7, 7 we are counting the 
permutations that fix bothi and j. The number of these is (m — 1)! if i = j and (n — 2)! if i ¥ /. 
Therefore, if n > 1 we have 


(14) Q(n) 


i,s=l 
ix] 


H{E@-0o r my 


= 2. 
EXERCISE 3. The mean is 1; the mean square is 2; what is the mean cube? Is it an integer? 


The standard deviation of the number of fixed points is therefore 1. By Tschebycheff’s 
inequality (6), the probability that a permutation has more than ¢ + 1 fixed points is less than 
1/t?. 

3. In search of the biggest determinant. If M and n are fixed, and if we know that no entry of 
a certain ” X n matrix A exceeds M, in absolute value, then how big can |det A| be? 

This is the famous (and unsolved) determinant problem of Hadamard. We might as well 
standardize things by taking M = 1. Then it’s easy to check that the maximum of |det A| is 
attained on some matrix whose entries are all + 1’s. Since there are a mere 2” such matrices, the 
problem is now finite. Since 2” isa very healthy number when n is only modestly large, an 
exhaustive search-by-computer for the maximum is already out of the question for n > 6, or 
thereabouts. 
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Hadamard observed that since the absolute value of the determinant measures the volume of 
the parallelepiped spanned by the row vectors, and since, if the lengths of the row vectors are 
fixed (Vn in this case) that volume is largest when the row vectors are pairwise orthogonal, we 
have 


THE HADAMARD DETERMINANT THEOREM. The absolute value of the determinant of a matrix is 
at most equal to the product of the lengths of its row vectors. It is exactly that large if and only if the 
matrix is orthogonal. 


In the present case, it follows first that the value of the determinant of an n X n matrix of 
+1’s is at most n”/, and second, that it can be that large if and only if there is such a matrix 
whose rows are pairwise orthogonal. An n X n orthogonal matrix of +1’s is called an Hadamard 
matrix. It is easy to see that n must be a multiple of 4 if an Hadamard matrix of size n exists, but 
it is not known which multiples of 4 will work, though a good deal of research has been done on 
the question. 

Since the maximum determinant is so hard to corner, what about some averages instead? The 
average determinant of all +1 matrices of size n is obviously 0, so that won’t be of much help. 
The mean square looks more promising, and it is. 

The following amazing averaging calculation was shown to me by Mark Kac, many years ago. 
It is amazing to me because the answer turns out to be so simple, just one line after the whole 
thing starts to look hopelessly complicated. 

We let D2 denote the average of the squares of the determinants of all n X n matrices (all 2”” 
of them) whose entries are +1’s. Then directly from the definition of the determinant we have 


* Y(det 4)" 
Q” A 


2 
2" Pf T sign(n) a4, mo Gy, , 


on 
A 


D2 


n 


in which the outer sum is over all matrices of this kind, and the inner sum is over the n! 
permutations (i,,...,i,) of n letters. 

Just after equation (11) I said that when the square of a sum is encountered, the thing to do is 
to write out the sum twice, using two different names for the summation variable. If we (swallow 
hard and) follow that advice here, we find that 


(15) D; = a-"y » d sign(1)sign(p){ 4),,,41, , 42,1,42, ,, oc An. An, }s 
A @ Oo 


where 7 and p vary independently over the permutations of n letters. After interchanging the 
order of the summations in (15) we get 


(16) D? = 2-"> E sign(m)sign(o){ Fay, y 4,242, vee anim.) 
A 


7 


Things are now as dark as they will ever get, and dawn is rapidly approaching: Consider the 
sum in the inner curly brace. In that sum we have a fixed pair 7,p of permutations, and we are 
summing over all of the +1 matrices. That means that ‘D ,’ really stands for n*? summation signs, 
each one over a single summation variable, in which that variable takes just two values: +1 and 
—1. If n = 2, for instance, the inner sum looks like this 


(17) yD a, 9 92,1582, 0: 
aq ,=+la,,=+1l a,,=4+1 a.,=4+1 


This unpleasant looking sum has one saving grace: it loves to be 0. Suppose, for instance, that 
i, # j,; say , = 1, 7, = 2. Then the sum (17) is 0 because it has as a factor 
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(18) ~~ 4, =09. 


a ,= +1 


The only way that (17) can fail to vanish is if i, = j, and i, = jy. 
Likewise, in (16), the inner sum vanishes unless 7 = p, and suddenly everything is easy. We 
have 


(19) Dy = 2-"'5" { sign( m)}°2” 


=n!, 


The mean square determinant of all of these matrices is n!. Their root-mean-square is D, = yn!, 
and when # is large this is 


/2 
1/4n" 
(20) ~ (2mn) 7% 
by Stirling’s formula. We observe that the maximum possible determinant, by Hadamard’s 
theorem, is n”/?, and the root-mean-square isn’t much lower than that. 
Since the maximum cannot be less than the root-mean-square, we have proved the following 


THEOREM. There exists ann X n matrix of +1°s whose determinant is greater than yn! . 


It is characteristic of averaging methods that while we learn that such a matrix exists, we 
haven’t a clue as to how to construct one efficiently. 
Those who enjoy problems about permutations ( permutophiles?) might like to try 


EXERCISE 4. Show that the root-mean-square value of the determinants of all n X n matrices 
of 0’s and 1’s is (n + 1)! /4”. Show that the average of these determinants is 0, without doing any 
sums! 


4. The complexity of computations. The averaging methods that we have described can be used 
to determine the average complexities of certain algorithms. We will illustrate this with the 
problem of finding the maximum independent set of a graph. In this problem, a graph G is given, 
and we are looking for a set S of vertices of G, as large a set as possible, with the property that no 
two vertices of S are joined by an edge of G. 

If this problem seems simple, it isn’t. It is a member of the notorious clan of ‘NP-complete’ 
problems, which for the present purposes means that the question is seemingly, but not at present 
provably, intractable from a computational point of view. That is, all known methods for solving 
it computationally require an amount of time that is exponentially large as a function of the 
length of the string of input data. 

Anyway, let’s analyze the average time complexity of the most obvious method of doing this 
problem: exhaustive search. In that method we simply look at each and every independent set of 
vertices in the graph, and find the largest one. It is easy to march from one independent set to its 
successor (how?) quickly, so the amount of labor involved in this procedure is fairly measured by 
the number of independent sets in the graph. 

Now, in the worst case, a graph of n vertices can have quite a few independent sets, namely as 
many as 2” of them (which graph has that many?). Our question, though, is not about the worst 
case, but the average case: on the average, how many independent sets does a graph of n vertices 
have? 

Let I(G) be the number of independent sets of vertices in a graph G, and let J, be the average 
of I(G) over all graphs G of n vertices. We use the independent edge appearance model to find 


I. If T is a fixed set of exactly ¢ vertices, then T will be an independent set if and only if all ( ‘] 


of the possible edges that might join pairs in T in fact fail to materialize when the ‘coins are 
tossed.’ If the edge probability is p, then the probability that T turns out to be an independent set 
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t 
is evidently (1 — » :). Hence the average number of independent sets of vertices in graphs of n 
vertices 1S 


(21) I, = y (7a _ pyle), 


since exactly ("} subsets T of [1,...,] have cardinality ¢. 

How bad is the news in (21)? That is, is (21) a large amount of work for an algorithm to do, on 
the average, or not so large? 

First let’s find out which of the n terms in the sum (21) is the biggest one. Then n times that 
one term will be an upper estimate for the whole sum, and therefore for the average labor involved 
in the algorithm that we are analyzing. To keep things simple we choose p = 7. 

We observe that the terms in (21) increase in size for a while (1.e., for small ¢) and then they 
decrease. To find the largest term we therefore take the ratio of the rth term to the (¢ — 1)st, set 
the ratio equal to 1 and solve for ¢. After a bit of manipulation, this ratio of consecutive terms 
works out to 


(n—t+1) 
12°7} ; 


The ratio is therefore equal to 1 when 42'~' = n — t + 1. Clearly this value of ¢ is not very far 
from log,n, and if t,, denotes the value of ¢ in question, then it is easy to check that 


After another brief estimation, this time an exercise in Stirling’s formula, the size of the largest 
term is seen to be 


(22) < exp ( 5 + e)(logan)°} 


Hence the average amount of computation, being at most times the quantity shown in (22), also 
satisfies (22). 

This amount of labor, in a computation, grows more rapidly than any polynomial in n, but less 
rapidly than an exponential function c” (c > 1). It is therefore referred to as being of moderately 
exponential growth. To summarize, then, the worst case for this algorithm clearly involves an 
exponentially-in-n growing amount of computational labor, but the average is a bit tamer. 


EXERCISE 5. Use the independent edge appearance model with edge probability p to find a 
formula, corresponding to (21), for the average number of maximal independent sets in graphs of 
n vertices (an independent set is maximal if it is not a proper subset of an independent set). 


RESEARCH PROBLEM. Find the asymptotic behavior, for n — oo, of the number of independent 
sets and of the number of maximal independent sets of vertices in graphs of vertices. Do this for 
edge probabilities throughout the range from p ~ C/n to p ~ C. 


5. Two surprises. The example that we worked out in the previous section, of the average 
number of independent sets in graphs, illustrates a technique and an approach to complexity 
analysis of algorithms, but there are other applications of the technique that give more surprising 
results. We describe two such recent results in this section, without proofs. 

Consider the question of deciding if the vertices of a given graph G can be properly colored in 
3 colors (i.e., so that adjacent vertices always have different colors). This is the graph 3-coloring 
problem, and it is NP-complete. 

Suppose we try to find a 3-coloring by backtracking: 


(a) Assign a color to vertex 1. 
(b) In general, if vertices 1,2,...,4 have been properly colored, then assign a color to vertex 
k + 1 that doesn’t conflict with the colors that have already been assigned to lower-num- 
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bered vertices (if such a color exists), increase k by 1, and go to (b); if no such color exists 
then go to (c). 

(c) Recolor vertex k in the next free color for that vertex (if one exists), and go to (b); else, if 
there are no remaining free colors for vertex k, then reduce k by 1 and go to (c). 

(d) Halt if a proper coloring of G is found (success) or if the algorithm ever attempts to 
re-color vertex 1 (failure). 


Backtracking might take a very long time. For instance, if the four highest numbered vertices 
of G are all connected to each other by edges, and no other edges exist in G, then G cannot be 
3-colored, but the algorithm will take nearly 3” steps to find that out, if G has n vertices. 

How many steps on the average does backtracking take? If we average the backtracking time 
over all graphs on n vertices we find that the average is approximately (guess which one!): 


(a) 3” 

(b) eM" 

(c) cn? 

(d) 197 

(e) none of the above. 


The correct answer is 197 (see [9] for the proof and [10] for generalizations and refinements) 
and that fact, hopefully, is the first of the two surprises in this section. 

The ‘reason’ for this result is that for most graphs the answer is ‘no’ (i.e., G cannot be 
3-colored), and indeed there is a complete graph of 4 vertices in almost all graphs that usually 
stops the algorithm quickly, with a ‘no’ answer. 

The ‘reason’, however, isn’t in itself sufficient. In another important computational problem, 
the graph isomorphism problem, most pairs of random graphs are not isomorphic, and almost 
always we can find that out quickly. However, the few pairs that are isomorphic generate huge 
computations, and so the averages still turn out to be exponentially large, at least for some of the 
algorithms that have been proposed, despite the heavy preponderance of ‘no’ answers. 

The second surprise comes from the bin-packing problem. In that problem we are given n real 
numbers €,,...,§,, where0 < € <1 (i=1,n). We are asked if we can ‘pack’ these numbers 
into a given number of ‘bins’ of unit length (a set of numbers is packed into a unit bin if the sum 
of those numbers is at most 1). The problem is NP-complete. 

One well-known algorithm for bin-packing is called ‘First-Fit-Decreasing’ (FFD). In FFD we 
first sort the given €,,...,§, into nonincreasing order of size. Then, for each € in turn, taken in 
the sorted order, we do ‘put € into the first bin that it fits into.’ 

This algorithm, of course, doesn’t always find the optimum packing, 1.e., the packing into the 
smallest number of bins. In the aptly titled [13], the average behavior of FFD was studied, as 
follows. Fix a number u, 0 < u < 4. Suppose that £,,...,&, are chosen independently uniformly 
at random from [0, uv], and then suppose that FFD is used to pack them into bins. A result of [13] 
is that there is a constant C > 0 such that the average excess of bins used by FFD over the 
minimum possible number of bins is < C, averaged over all inputs €,,...,&, and for all n. There 
is experimental evidence that suggests that FFD gives optimal packing about 75 percent of the 
time, but this has not been proved. This result is more striking because there is no feeling that 
‘cheating’ is going on because most answers are ‘no’, or whatever. 


6. The hierarchy of average powers. We have already referred to the fact that the maximum of 
a finite set of positive real numbers can never be less than the root-mean-square value of the same 
set of numbers. This is a special case of a much more general pecking order that holds between 
various kinds of means of the same set of numbers. In this section we will state and prove a rather 
general form of this result, and in the next section there will be a combinatorial application of the 
idea, in a somewhat surprising way. The ‘pecking order’ theory is quite elementary, but somehow 
seems to be not as widely known as it deserves to be. 
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Let a,,...,a, be a fixed set of positive reals. If r is a real number, then by the rth power mean 
of the a’s we mean the number 


(23) u(r) = { 


The power means have a number of remarkable and useful properties. First, consider a few 
special values of r. The case r = 1 is perhaps the most obvious: (1) is the well-known arithmetic 
mean of the given numbers. Another easy special value is r = 2. Inspection of (23) shows that 
(2) is exactly the root-mean-square (rms) of the given set of a’s. 

Slightly less obvious is the value ‘r = oo’, or, more precisely, the value of lim,_, u(r). We 
leave it as an exercise for the reader to verify that 


ajta,t--: tay" 
nl a 


(co) = max a,. 
l<i¢cn 


That w(— oo) = mina is then unsurprising, and the special value 


(G 1 | 
— +— + 66-5 + — 
ay ay a, 


is the familiar harmonic mean of the given set of a’s. The best one has been saved for last, though. 
What is (0) (the limit, that is)? The answer is in 


EXERCISE 6. Prove that (0) is the geometric mean of the a’s, Le., 


p(0) = (aa, °° a, yo". 

The exercise above requires only a small application of lHospital’s rule, but it is highly 
recommended to those who haven’t tried it. 

Let’s use special names for the six special kinds of means of a given set of positive numbers 
that we have just discussed. We will write, respectively, Y(a), #(a), G(a), W(a), B(a) and 
Wa) for the minimum, the harmonic mean, the geometric mean, the arithmetic mean, the root 
mean-square, and the maximum, respectively, of a set a of positive numbers. 

The fact that the power mean function p(r) takes all of these interesting values, at r= 
— oo, —1,0,1,2, +00, lends a special impact to the main theorem of the subject, which is 


THE POWER-MEAN THEOREM. If a,,...,a, is a set of positive real numbers, then p(r) is a 
strictly increasing function of the real variable r unless a, = a, = :+- =a,, in which case wp is 
constant for all real r. 


n? 


This theorem shows that for every nonconstant set of a we have, in particular, 
L (a) <#(a) < (a) <H(a) < Ba) < Ua). 


The guaranteed lineup of the various kinds of averages shows that, for instance, if we want to 
estimate the maximum, and that’s too hard, we might try the root-mean-square next, because it is 
‘closest’ to the maximum, then the arithmetic mean, etc. 

The other feature of the theorem that merits comment is the strong grip that we have on the 
case of equality: if there exist r, and r,, such that r, # r, and p(r,) = (1), then we can be sure 
that all n of the given set a,,...,a,, of numbers are equal to each other. This strong conclusion that 
follows from the equality of any two of the power means has a number of applications, one of 
which we will see in Section 6 below. 

Now, as to the proof of the power-mean theorem, we begin with 


LEMMA 1. Let $(t) be twice differentiable, with $’(t) > 0, and let w,,...,w, be nonnegative 
numbers whose sum is 1. Then 
(24) b(t, + Wl tones +W,b, S wio(t,) + w2( ty) torte +wo(t,) 


and the sign of equality holds if and only if the t, are all equal. 
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Proof. If we put A = w,t, + wt, + --: +w,t, then Vi = 1,n, there exists a €, such that 


6(1,) = 0(4) +(, - A) 6(4) + U4) ore) 


If we multiply by w, and sum over i the result follows immediately. 


If we set $(t) = —logt and replace the ¢, by 1/f, in (24), we get 


(25) w,logt, + ++: +w,logt, > — log} fo... + 
1 n 


in which the strict inequality holds if and only if the ¢, are not all equal. 
To prove the theorem we differentiate the power mean function p(r), directly from its 
definition (23), and then write 


a’ 
26 = —_+—_—_ 1=1,n). 
(26) Meg (inn) 
The result is that 
w’(Cr . . aj+--- +a’ 
(27) pe = w,logaj + --- +w,logay — lop{ 1) 


Since )'w, = 1, the inequality in (25) yields at once the fact that the right member of (27) is strictly 
positive unless all of the a are equal, completing the proof. & 


For a fixed set of a,,...,a, a typical graph of the function p(r) vs. r is as shown in Fig. 1. 


— a ne i oe ee ee —=—_ a — Oe ee eee eee ee ee 


Fic. 1 


7. Cyclic designs. In this section we will give a small application of the power-sum theorem 
that was just proved. The theorem will be applied in the following way (which may seem 
suspicious because we will get ” pieces of information in exchange for just two): 


(1) We will want to show that a certain collection of » numbers are all the same. 

(2) We will compute their geometric mean Y. 

(3) We will compute their root-mean-square &. 

(4) We will note that Y= &, and therefore we will conclude that all n of the numbers are 
equal. 


The combinatorial question is this: for which n does there exist an n X n cyclic orthogonal 
matrix of +1’s? We are therefore looking for an Hadamard matrix, as described in Section 3 
above, with the additional restriction that the matrix must be cyclic. 
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We have first the following well-known evaluation of a general cyclic determinant: 


ao ay ay} 
ay,-] ao ee a,—2 

(28) det} = fe) 
Ga + 


in which f is the polynomial whose coefficients appear in the first row of the given matrix, L.e., 
f(t) =a, + att+ajt?+---+a,_ 0771. 

The proof of (28) is easy. Let A(a), @,,..., a, ,) be the matrix that appears on the left, and 
write C = A(0,1,0,...,0). The eigenvalues of C are the nth roots of unity w, and since 
A(ao,4,,---,4,-1) = f(C), the numbers f(w) are the eigenvalues of A, and their product is the 
determinant. 


What we are going to prove next is the following 


THEOREM. There exists ann X n cyclic Hadamard matrix if and only if there is a polynomial f, of 
degree n — 1, whose coefficients are all +1’s, and whose values at the nth roots of unity are all 
equal, in absolute value. 


As advertised above, let us first compute the geometric mean of the n numbers 


(29) If(w)|  (@" = 1). 


Since the matrix is orthogonal, its determinant is, in absolute value, equal to the product of the 
lengths of its row vectors, i.e., to n”/?, since each row has length vn. From the determinant 
formula (28), if we take the absolute value, and then the nth root of both sides, we find that the 
geometric mean of the n numbers shown in (29) is vn. 

We claim that the root-mean-square value of these same n numbers is also Vn. Indeed, the reader 
will have no difficulty verifying that this is true for every polynomial f whose coefficients are 
+1’s. 

Since the reverse implication is clear, this concludes the proof of the theorem. We remark that 
the proof showed that the ‘constant’ value of the numbers in (29) is none other than Vn. Since the 
integer | f(1)| is among the values in (29), it too must be Yn and we have the 


COROLLARY. [If there is a cyclic n X n Hadamard matrix, then n is a perfect square. 


8. Suggestions for further reading. A number of properties of ‘average trees’ are in the lovely 
book [1] of Moon, such as the average number of leaves, the average and second moment of the 
number of spanning trees of a graph, the average valence of the vertex of largest valence, etc. 

The book of Comtet [2] is highly recommended as a general reference in combinatorics, and his 
Chapter 7 contains a number of averaging problems and solutions. 

I have borrowed the treatment of power sums from the classic work of Polya-Szego [3], in 
which Chapter 2 of Section 2 (‘inequalities’) is a rich source of information about different kinds 
of averages. 

There is a proof of the Hadamard determinant theorem in Hardy, Littlewood, and Polya [4]. 
An introduction to the combinatorial problems surrounding that theorem is in Ryser [5], whereas 
an advanced and complete-as-of-its-date account of the theory of Hadamard matrices can be 
found in [6]. 

Paul Erdés has made many important contributions to probabilistic combinatorics and graph 
theory. A fine survey of these is in Erdés and Spencer [7]. The three volumes of Knuth [8] are an 
excellent source of probabilistic combinatorics and algorithmic applications. 

Another averaging result.of unusual interest is that of Grimmett and McDiarmid [11] on the 
chromatic number of a graph. 
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This paper is a sequel to a paper, “Machiavelli and the Gale-Shapley Algorithm,” written by 
Dubins and Freedman [1] in 1981. That paper was, in turn, a sequel to “College Admissions and 
the Stability of Marriage,’ by Gale and Shapley [2], 1962. For the benefit of readers who may 
have missed the previous installments, a brief recapitulation is given in the following section. 


1. The Story So Far. Paper [2] above was concerned with a situation in which there are two 
sets of “agents’’, such as students and universities or workers and employees, or women and men. 
For the sake of concreteness, we will describe the problem in terms of this last group. It is 
assumed that each man and woman has prepared a list (possibly empty) containing the names of 
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those people of the opposite sex that he or she would accept as a marriage partner and that the 
names appear on the list in order of preference. The lists are then submitted to a matchmaker who 
has the job of finding a suitable pairing of the participants. If we think of an unmatched person as 
being matched with him or herself, then the only requirement for the matching is that it should be 
free of instability; that is, there should not be any man m and woman w who are not matched 
with each other but prefer each other to their mates. The matchings are thus, “divorce-proof”. The 
main result of [2] is an existence theorem which shows that for any sets M and W of men and 
women and any preference pattern, there is always at least one stable matching. The proof is 
constructive, giving an algorithm for finding the desired matching. 

Since we know that stable matchings exist, it is an instructive and rather easy exercise to show 
that the set of stable matchings forms a lattice in the following sense: if w and p’ are matchings, 
we write up = p’ if every man likes his p-mate at least as well as his p’-mate. It turns out that this 


M 
partial ordering of stable matchings is a lattice, as we will show shortly. Further, it is not hard to 
show that if w = p’, then pw’ = p, meaning that if all men prefer p to p’, then all women prefer p’ to 


uw (these observations are due to J. H. Conway and are contained in [4]). As a corollary, since 
every finite lattice has a largest and a smallest element, it follows that among all stable matchings 
there is one which is preferred to all others by the men and another by the women. These will be 
called the M(W)-optimal matchings. The algorithms of [2] always arrive at one of these extreme 
matchings. 

Some twenty years later, Dubins and Freedman considered the following question: suppose it 
is known that the matchmaker will impose the M-optimal stable matching and suppose some 
participants have full knowledge of the preference of all of the others. Will it ever be possible for 
an individual or group of individuals to obtain a preferred mate by falsifying preference lists? It 
was shown in [1] that there are cases in which a woman can do better by falsifying. On the other 
hand, the main result showed that no man or coalition of men can ever be made better off by 
falsifying their true preferences. The conclusion is that Machiavellian behavior on the part of the 
men is not profitable and they can do no better than to list their true preferences, regardless of 
what the women do. The purpose of the present paper is to give a detailed analysis of the strategic 
possibilities for the women. It will turn out that it will almost always pay for some of the women 
to be “Machiavellian”, and under certain plausible assumptions we will describe their best 
competitive behavior. 


2. Some History. The scenario described above involving marriages is, of course, rather 
fanciful, but there are genuine real world situations in which the matching problem comes up 
regularly. The problem of assigning students to colleges (which was the original motivation for 
these investigations) is an example. The difficulty here is that, as is so often the case, there are 
complicating factors such as the possibility of financial aid of various kinds. However, there is at 
least one situation where the methods of [2] are not only applicable but are actually being applied 
and have been for more than thirty years. The “National Residents Matching Program’’, 
headquarters in Evanston, Illinois, has the task each year of assigning graduates of medical 
schools throughout the country to hospital programs in which they are to fulfill a residency 
requirement: It turns out that the procedure used by NRMP is precisely the one described in [2], 
except that their procedure leads to the hospital-optimal rather than the student-optimal matching 
of [2]. The Dubins-Freedman Theorem shows that when the student optimal procedure is used the 
students can do no better than to list the hospitals they are interested in in their true order of 
preference. Under the NRMP system, on the other hand, the results presented here show that in 
general there will be some students who can get into a preferred hospital by suitably falsifying 
their preferences. 

It should be pointed out that the student-college problem is more general than the marriage 
problem in that the colleges are “ polygamous” and may admit any number of students up to some 
fixed quota g. Many of the properties of the marriage problem carry over to the polygamous case, 
including those to be presented here. However, not all results carry over to the more general case. 
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In particular, Roth has recently shown [6] by an example that even when the college optimal 
matching is used, as in the case of NRMP, it may still be possible for a college to get a better class 
by appropriate misrepresentation of preferences. 

We now return to our main subject. 


3. Preliminaries. In this section we present two propositions which will be needed later and 
which are of some interest in themselves. These are special cases of some results of [3]. The first is 
rather surprising. Recall that in general there may be many stable matchings for a given set of 
men and women. Our result asserts, however, that the people who are unmatched or, as we shall 
say, self-matched, are the same for all of these matchings. This is particularly striking in the 
context of the student-hospital situation. Suppose NRMP were to change its policy and impose 
the student rather than hospital-optimal matching. Of course this would make all the students at 
least as well off, but those students who were not accepted by any hospital would still not be 
accepted and, on the other hand, each hospital would end up admitting the same number of 
students, though in general not the same set, under the student-optimal as under the hospital-opti- 
mal scheme. This rather non-obvious fact turns out to be quite easy to prove. Our second 
proposition, by contrast, seems extremely plausible. It asserts that if additional women are added 
to the “pool’’, this can never make any of the men worse off. The proof of this, however, is not so 
straightforward. 

We will need a little terminology. A preference pattern will consist of a triple (M,W; P) where 
M and W are the men and women and P represents their preferences, that is, the ordered lists of 
the members of M U W. Because of our convention that an unmatched person is considered to be 
self-matched, we may define a matching p as a byection of M U W onto itself which is of order 2, 
that is, w- w = identity, and such that if m or w is not self-matched, then uw(m) € W and 
u(w) € M. We call w(m) (u(w)) the mate of m (w) under p. We say that a pair (m,w) blocks 
the matching p if m prefers w to p(m) and w prefers m to p(w). The matching p is then stable 
if it is not blocked by any pair. If ~ and p’ are matchings, we say that m (w) prefers p’ to p if he 
(she) prefers ’(m) to (m) (p(w) to p(w)). We write w = pw’ if every man is at least as well off 
under pw as under p’. A key result is now the following. 


LEMMA 1. Suppose W C W’ and yp is a stable matching for (M,W; P) and wp’ for (M,W’; P’) 
where P’ agrees with P on W. Let M, be all men who prefer w to w’ and let W,, be all women who 
prefer p’ to p. Then pw and w’ are bijections between M, and W,,. 


This is a sort of decomposition lemma represented schematically by the diagram below. 


g 
M, o MW, 
UL 
UL 
- / 
M-~ Mo W — W, 
UL 


It will suffice to show that 1(M,) Cc W, and p’(W,,) c M,, for since M, and W,, are finite 
and p and p’ are injective, they must be surjective as well. For m in M, we know p(m) # m 
since m prefers p(m) to p’(m); so let w = p(m). Then w © W and w ¥ w’(m). Further, w 
prefers p’(w) to m for if not (m,w) would block p’, so w © W,. A symmetric argument shows 
that p’(W,) C M,.0 


Our first result is an immediate consequence of the lemma. 


PROPOSITION 1. If ys and pw’ are stable matchings for (M,W), then the people who are 
self-matched are the same for both. 
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1 Suppose p’(m) = m but p(m) = w. Then m € M, but then, by the Lemma for the case 
W= W’,m & p’(W,), contradicting p’(m) = m.0 


LEMMA 2. Let and w’ be as in Lemma 1 and define p” to agree with » on M, U W,, and with 
pw’ on (M — M,) U(W’ — W,). Then p” is stable for (M,W’). 


O Clearly pw” restricted to M, U W, or (M— M,)U(W’ — W,) is stable, so if there is a 
blocking pair, we must have either me M,, w € W, or m€ M, and w & W,. Suppose w 
prefers m to p’’(w). 


Casel. m © M,,w © W/ — W,. Then m prefers (m) to p’(m) whom he likes at least as well 
as w, so (m,w) does not block p”. 


Case 2.m € M,, w © W,. Then m likes y’(m) at least as well as 4(m) whom he prefers to w, 
since p is Stable so, again, (m,w) does not block p’”’. 0 


The lattice property is a consequence of Lemma 2, for note that w” assigns each man his 
favorite of 4(m) and p’(m), so in usual lattice notation we write nw” = pw V p’. As stated in the 
introduction, it now follows that for any (M,W; P) there exist matchings p,, and p,, which are 
M-optimal and W-optimal, respectively. We now have 


PROPOSITION 2. With the hypothesis of Lemma 1 let py, be the M-optimal matching for 
(M,W; P) and let jw, be the M-optimal matching for (M,W’; P’). Then every man is at least as 
well off under pw, as under ty). 


O From Lemma 2, py, V wy, 1S Stable for (M, W’; P’) and 


M2e wee .O 
M WwW 


4. The Matching Game. Since the Dubins-Freedman Theorem shows that it is never advanta- 
geous for the men to falsify their preferences, we will assume from here on that the men always 
submit their true preference list. For the special case where there is only one stable matching, the 
Dubins-Freedman Theorem implies that honesty is also the best policy for the women. In all other 
cases, however, this is not so. 


THEOREM 1. If there is more than one stable matching, then there is at least one woman who will 
be better off by falsifying, assuming the others tell the truth. 


O By hypothesis py, # fy, so let w be any woman such that p(w) # wa,(w). Now let w 
falsify by removing from her preference list all men who rank below p,,,(w). Clearly the matching 
wy will still be stable under these preferences (there are now fewer possible blocking pairs). 
Letting wi, be the M-optimal matching for these new preferences, it follows from Proposition 1 
that w is not self-matched by 1, and hence she is matched with someone she likes at least as well 
aS fly (Ww), since all other men have been removed from her list, and she prefers p(w) to py, (wv) 
so she prefers p14, to py,. 0 


Theorenr 1 shows that the policy of honest revelation of preference is “unstable” for the 
women in a different sense of instability, namely, if each woman expects the others to be honest, it 
will in general be possible for at least one woman to improve her position by lying. This leads one 
to ask the following question: if honesty is not the best policy, is there any set of policies or 
“strategies” which have the property that once they are adopted by the women there will be no 
advantage to any one woman in changing her strategy? It will be useful here to introduce some 
standard game theoretic-terminology and we will henceforth refer to our model as the matching 
game. We are dealing here with perhaps the most important concept in game theory, that of an 
equilibrium point. An abstract game consists of a set of players each of whom has a certain set of 
strategies. In a play of the game each player selects one of her strategies and the “rules of the 
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game” then assign her some payoff which may be a number (“‘score’’) or, as in our case, a mate. A 
set of strategies, one for each player, forms an equilibrium point if no player can, by changing her 
strategy, achieve a better payoff, assuming the other players do not change theirs. The strategies 
form a strong equilibrium point if no subset of players by changing its strategies can achieve a 
better payoff for all of its members. Are there any equilibrium point strategies for the women in 
the matching game? The next two theorems answer this question. 


THEOREM 2. Let p be any stable matching for (M,W, P) and suppose each woman in 2(M) 
chooses the strategy of listing only p(m) on her preference list. This is an equilibrium point. 


O It is clear that p is stable under these falsified preferences which we will denote by P’. 
Further, is the only stable matching for (M, W; P’), for any other matching would leave some w 
in #(M) unmatched, which is not possible by Proposition 1. Hence p is the M-optimal matching 
for (M,W; P’). 

To see that P’ is an equilibrium point, suppose some w now changes her preference list 
leading to a new M-optimal matching p’ which gives her a mate m’ = p’(w) whom she prefers to 
uw(w) under true preference. Then m’ must have been matched by p to some w’, for if not (m’, w) 
would have blocked p in (M,W;.P). But then w’ is self-matched under p’ since m’ was the only 
man on her P’-list. This means m’ prefers w to w’, but if this were so, again (m’, w) would have 
blocked p, a contradiction. 0 


Theorem 2 says that the women can force any matching » which is stable under the true 
preferences by equilibrium point strategies. We now present a sort of converse. 


THEOREM 3. Suppose the women choose any set of strategies Py, ( preference lists) that form an 
equilibrium point for the matching game. Then the corresponding M-optimal matching for (M,W; P’) 
is one of the stable matchings of (M,W; P). 


This theorem has been proved by Roth [5] making use of the properties of the matching 
algorithm of [2]. We present here a direct proof. 


O Suppose p’ is the M-optimal matching for (M,W; P’) but (m,w) blocks p’ under w’s true 
preference. We will show that P’ is not an equilibrium point. Namely, let w refalsify by listing 
only m on her preference list. Then she will get him, for let p’’ be the M-optimal matching for the 
new preference P’’. If w does not get m, then she is self-matched by p” so by stability of pw’, m 
prefers w’’(m) to w and by assumption m prefers w to p’(m), so m prefers p’’(m) to p’(m). But 
clearly the matching p”’ would be stable for (M,W — w; P’), where P’ is restricted to W — w. 
Thus m’ is worse off under the M-optimal matching for (M,W; P’) than he is under pw” for 
(M,W — w, P’), which directly contradicts Proposition 2 of the previous section. 0 


To summarize this section, we see that by falsifying appropriately the women can achieve by 
equilibrium point strategies any stable matching, thus, in particular, the W-optimal matching. On 
the other hand, the women cannot get too greedy for if any set of strategies gives some woman w a 
mate whom she likes better than p,,,(w), this will not be an equilibrium point, by Theorem 3, so 
some other woman can change the matching to her advantage by choosing a different strategy. 


5. Strong Equilibrium Points. By Theorem 2, the women can achieve any stable matching p by 
equilibrium point strategies. However, unless p = p,, the equilibrium point will not be strong. To 
see this note that if » # p,, then w(w) # p,,(w) for at least two women, for say, p(w,) # 
Ley (w,) = m,. By Proposition 1, m, is not self-matched by w and hence pw(m,) = w, and 
(Ww) = m, # fy(w,). To show p is not the matching of a strong equilibrium point we let W be 
all w such that p(w) # py,(w). Let all w’s in W refalsify by pretending p(w) is the only man 
on their list. Then p,, is stable for this new preference and hence since all W are matched by p,, 
they are matched by the M-optimal matching for the new preference. 

Do there exist strong equilibrium points? Yes. 
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THEOREM 4. Let each woman w submit a preference list in true order of preference but removing 
all men who are ranked below py, (w). These preferences P’ are a strong equilibrium point. 


O We claim that p,, is the only stable matching for (M,W; P’), for clearly wy, 1s stable and 
any other stable matching p’ must have p’(w) # p,,(w) for some w; hence if w is not 
self-matched, then p’(w) is preferred by w to p,,(w). Since p,, is the W-optimal stable matching, 
this means p’ is unstable under true preferences, hence is blocked by some pair (m,w). But by 
construction of P’, this means (m,w) blocks p’ under P’ preferences, contradicting P’-stability 
of p’. 

Now since py, 1s the only stable matching for (M, W; P’) it is the W-optimal matching for P’. 
If some subset W could get a better payoff by falsifying, then it would get a better payoff than 
from the W-optimal matching of (M,W; P’), but by the Dubins-Freedman Theorem this is 
impossible. 0 


It seems reasonable to consider the falsification of P’ of Theorem 4 as the best method of play 
for the women. It is (a) a strong equilibrium point so no woman or set of women would be 
tempted to deviate from P’, and, (b) among all equilibrium point strategies it gives the women the 
highest possible payoff. It would have been nice to be able to assert that the matching p,,, is the 
only matching obtainable from a strong equilibrium point. Unfortunately this is not the case, as 
shown by the following example. The true preferences are given by Table 1. 


TABLE 1 
W) 


3 1,3 3 
1,3 3,2 2,3 4x 


Mm, 

m3 X,X X,X 1.2 2,X 

M4 3,1 41 2,1 1,1 
TABLE 2 


The first entry in box i,j is the ranking of w, on the list of m,. If it is x, this means that w, is 
not in the list of m,. Thus, m, ranks w, in third place and w, is not in the list of m,. The second 
entry is the ranking of m, by w.. If it is x, this signifies that m, is not in the list of w.. So, w, 
ranks m, in second place and m, is not in the list of w,. For these preferences the matching p,, 
is given by (m,,w,), (m,,W>), (3,3), (m4, W,). Now suppose all women use the system of 
preferences, P’, given by Table 2 above. The entry in box i,j is the ranking of m, by w,. The 
M-optimal matching for these new preferences is given by (m,, w,), (1, W)), (3, W3), (M4, W4)- 
We assert that P’ is a strong equilibrium point. In fact, no subset W’ C W which contains w,, by 
falsifying, can improve the situation of all its members, since the mate of w, is the best possible; if 
W’ contains w,, it cannot improve the situation of w,, for if so (m,,w,) would block the new 
matching. If W’ contains w,, it cannot get a better mate for w, by falsifying, for if so (m,, w,) or 
(m,,w,) would block the new matching. If W’ contains w,, it is not possible to improve the 
payoff of w,, for if so (m,,w,) or (m,, w,) would block the new matching. 


6. Dominated Strategies. There is one more important game-theoretic concept which is il- 
lustrated by the Matching Game. Let o and o’ be two strategies for some player w in a general 
n-person game. We say that o dominates o’ if the payoff to w when she plays o is at least as high 
as when she plays o’ no matter what strategies the other players play, and o strictly dominates o’ 
if the payoff to w when she plays o is higher than when she plays o’ for at least one set of 
strategies for the other players. A strategy is called dominant if it dominates all other strategies. 
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As an illustration of these concepts, the Dubins-Freedman Theorem shows that for each man 
revealing true preferences is a dominant strategy. For the women there are no dominant strategies 
except for the special case where |W| = 1. If there is only one woman w, then revealing true 
preferences strictly dominates any other strategy, for if she submits her true preferences, she will 
get the highest man on her list who has listed her. If she falsifies by, for example, listing m’ > m 
instead of m > m’, she will be worse off in the case where m and m’ are the only men who are 
willing to marry her. From now on it will be assumed that there are at least two women in W. We 
will show that the strategy of listing only one man, as in Theorem 2, is dominated unless that man 
happens to be the woman’s true first choice. In fact, we can essentially characterize the dominated 
strategies of the women by the following theorems, where a > b means that x prefers a to b 
under the true preference P,. 


THEOREM 5. Any strategy P’ in which w does not list her true first choice at the head of her list is 
strictly dominated. 


O This has been proved by Roth [5] using the matching algorithm. We present here a direct 
proef. 

Let P’ be a strategy described above. We will show that P’ is strictly dominated by P’’ which 
lists m, (w’s favorite man) in first place and leaves the rest of the list unchanged. Let p’ and p” 
be the corresponding M-optimal matchings (the strategies of all other players are assumed 


unchanged). If p’’(w) = m,, there is nothing to prove so we suppose that p’’(w) # m,. Then 
(1) u’’(m,) is preferred by m, to w or w is not on m,’s list, 


for if not (m,,w) would block p’’. Hence pw” is stable under preference P’’. So, 
(2) wi (m,) > w’'(m,), 


from the M-optimality of yw’. Furthermore, from (1) and (2) it follows that p’(m,) > Ww, and since 


all the other elements different from m, are ranked in the same ordering in both lists P’ and P.”’, 
it follows that p’ is stable under P’’, and hence p’’(w) = p’(w); so w is no worse off using P”’ 
than using P’. 

To show that P” strictly dominates P’, let m’ be the first element in P’. Consider the 
following preference pattern: m’: {w}, m, = {w} and no other man lists w. Then we can see that 


m, = p'(w) > ww) =m’, which concludes the proof. O 


Our final result states that Theorem 5 describes essentially all the dominated strategies. 


THEOREM 6. Let P’ be any strategy for w in which (a) m, (w’s favorite man) is listed first, (b) 
P’ contains only men m who are on w’s true preference list P.,,. Then P’ is not dominated. 


Ww 


O We will show that for any other strategy P” there exists strategies P for the other players 
such that p’(w) > w”’(w), where p’ and p” are the M-optimal matchings for (M,W; P’ U P ) and 


(M,W, P” U P), respectively. There are three cases. We first suppose P”” also satisfies (a) above. 


yy’ 


é — 
Case 1. P’ contains some m not on P’’. Then for P we suppose that m lists w as first choice 
and no other man lists w. Then p’(w) = m while w is self-matched under p” (here we used that 
me P.). 


Case 2. P’’ contains some m not on P’. Then for P suppose m has preference list {w>w’} 
for some w’, m, has preference list {w’ > w}, and w’ has preference list {m > m,} and no other 
man lists w or w’. Then one verifies that p’(w) = m, > ww) = mM. 


Case 3. Let lists P’ and P,” be the same but w prefers m to m’ in P’ and m’ to m in P”. 


Then suppose preference list for m, is {w’ > w}, for mis {w > w’}, for m’ is {w > w’}, and for 


/ 


w’ is {m’ > m, > m}, and no other men list either w or w’. It is an instructive exercise to verify 
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that p’(w) = m, while p(w) = m’. 


We have seen that if P’ satisfies (a) and (b) and P’ satisfies (a), then for some P , 
w’(w) > p(w). If P’’ does not satisfy (a), then by Theorem 5 there is some P,’” which dominates 


P” and P/’” satisfies (a) so we construct P so that p’(w) > w’’(w) but p’” (w) = w’(w) by 
dominance. The proof is now complete. 0 


REMARK 1. Condition (b) above is needed to avoid cases in which P’, = {m, > m} where m is 
notin P.,. It is clear that this is strictly dominated by P’”’ = {m, }. 


REMARK 2. Note that the counterexample of Section 4 uses only undominated strategies for the 
women. Thus we do not have a unique strong equilibrium point for W even when restricting to the 
use of undominated strategies. 
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144. MISCELLANEA 


Inflation 


The growth of mathematics is producing discontent: 

The inflation rate per annum is pushing 10 per cent. 

Faced with so much information, it’s not easy to succeed 

In locating any theorem that you appear to need. 

Our plethora of indices can leave you in the lurch, 

For it takes less time to prove it than to go and make a search. 
I can offer a solution, but it’s totally upsetting: 

We need to introduce a way of constantly forgetting 

The results that won’t be needed for another 20 years, 

At the end of which they'll surface with appropriate loud cheers, 
While the ones that won’t be needed till forever and a day, 
Once their authors get their tenure will be firmly thrown away. 


—R. P. Boas 
Department of Mathematics 
Northwestern University 
Evanston, IL 60201 
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Math. Sci., V. 55. Springer-Verlag, 1984, x + 170 pp, $22 (BP). [ISBN: 0-387-96047-3] Revised and 
enlarged translation of Japanese edition published by University of Tokyo Press (1982). Develops 
theory of operational calculus using hyperfunctions and applies it to solve one-dimensional wave 
equation, telegraph equation, and heat equation. Additions to Japanese version include a simple 
proof of the Titchmarsh convolution theorem using Liouville’s Theorem and an algebraic solution of 
Laplace’s differential equation: (a,t+b,)y" + (a,t+b) dy” + (agt+by)y = 0. Rather dry presentation. 
BH 


ee 


and Degenerate Elliptic Partial Differential Equations. Edward W. Stredulinsky. Springer-Verlag, 
1984, 143 pp, $7.50 (P). [ISBN: 0-387-13370-4] The first two chapters derive preliminary inequali- 
ties, involving mainly measure theory. The third chapter establishes the main result: the Holder 
continuity of weak solutions of quasilinear degenerate elliptic systems. The author has sprinkled 
the text with welcome explanations that lead the reader through the technicalities; at times, how- 
ever, the mathematics appears simpler than the syntax. YN 


Partial Differential Equations, S(17-18), P. Bilinear Transformation Method. Yoshimasa Matsuno. 
Math. in Sci. & Engin., V. 174. Academic Pr, 1984, viii + 223 pp, $50. [ISBN: 0-12-480480-2] The 
bilinear transformation method is a tool for the solution of a wide class of nonlinear evolution 
equations. The first part of this monograph is an introduction to the method. The second part is a 
study of the mathematical structure of the Benjamin-Ono and related equations. AO 


Partial Differential Equations, P. Lecture Notes in Mathematics-10/5: Asymptotic Analysis for 
Integrable Connections with Irregular Singular Points. Hideyuki Majima. Springer-Verlag, 1984, ix 
+ 159 ppg $9.50 (P). [ISBN: 0-387-13375-5] Uses "strongly asymptotic expansions of functions of 
several variables to prove existence theorems of asymptotic solutions to integrable systems of par- 
tial differential equations. ... Formulates and solves the Riemann-Hilbert~Birkhoff problem, and 
provides analogues of Poincaré’ s lemma and deRham cohomology theorem for integrable connections with 


irregular singular points." BH 


Numerical Analysis, P. Large Scale Scientific Computation. Ed: Seymour V. Parter. Academic Pr, 
1984, ix + 326 pp, $26. [ISBN: 0-12-546080-5] The invited lectures from a conference held at the 
University of Wisconsin, Madison, on June 17-19, 1983. These papers review recent developments in 
the solution of large-scale problems in fluid dynamics. AO 


Numerical Analysis, S(17-18), P, L. Methods and Applications of Error-Free Computation. R.T. Gre- 
gory, E.V. Krishnamurthy. Texts & Mono. in Comp. Sci. Springer-Verlag, 1984, xii + 194 pp, $29.80. 
[ISBN: 0-387-90967-2] Theory of residue and p-adic arithmetic, leading to a system of "Hensel codes" 
that permits exact calculation of standard problems of linear algebra--suitable for ill-conditioned 
problems or as alternatives to numerically unstable algorithms. LAS 


Functional Analysis, P. Ten Lectures on Operator Algebras. William Arveson. CBMS Reg. Conf. Ser. 
in Math., No. 55. AMS, 1984, viii + 93 pp, $14 (P). [ISBN: 0-8218-0705-6] Lectures on nonselfad- 
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joint algebras of operators on Hilbert spaces. Topics include triangular and quasitriangular opera~ 
tor algebras, the Feynman-Kac formula, commutative subspace lattices, reflexive and hyperreflexive 
operator algebras, and the absorption principle. Very readable and well-motivated. BH 


Functional Analysis, S(18), P. Methods for Solving Incorrectly Posed Problems. V.A. Morozov. 
Transl: Z. Nashed, A.B. Aries. Springer-Verlag, 1984, xviii + 257 pp, $36 (P). [ISBN: 0-387-96059- 
7] A fine translation of the 1974 Russian edition, emphasizing "regular methods" for solving a 
variety of problems about operators on Hilbert space, problems which do not quite satisfy the defin- 
ition of well-posed problems. Includes a section on "pseudosolutions." MU 


Analysis on Manifolds, P. Semimartingales and their Stochastic Calculus on Manifolds. Laurent 


Schwartz. Pr U Montreal, 1984, 187 pp, $17 (P). [ISBN: 2-7606-0660-0] An exposition based on lec- 
tures given by Laurent Schwartz at the Université de Montréal in 1982. LCL 


Analysis, P. An Infinitesimal Approach to Stochastic Analysis. H. Jerome Keisler. Memoirs of AMS, 
No. 297. AMS, 1984, x + 184 pp, $16 (P). Uses nonstandard analysis to study stochastic integral 
equations with respect to a Brownian motion. Does not presuppose prior knowledge of nonstandard 
analysis or the theory of stochastic integral equations. AO 


Analysis, P. A Computer~Assisted Proof of Universality for Area~Preserving Maps. J.-P. Eckmann, H. 
Koch, P. Wittwer. AMS Memoirs, No. 289. AMS, 1984, vi + 122 pp, $12 (P). An existence theorem 
concerging the Feigenbaum constant 4.66920... associated with one-parameter families such as T(x) = 
1 - ax’ is proved using a computer program to verify the norm of a high-degree polynomial. The pro- 
gram is carefully crafted to take into account round~off error, and the authors provide extensive 
analysis of other possible sources of error (hardware, software, operating system) enroute to a 
claim that this really is a mathematics proof. LAS 


Geometry, S(15-17), L. Problems in Geometry. Marcel Berger, et al. Transl: Silvio Levy. Problem 
Books in Math. Springer-Verlag, 1984, viii + 266 pp, $29.80. [ISBN: 0-387-90971-0] Written in 
response to request of readers of Marcel Berger’s wide-ranging and concise Geometry for hints and 
solutions to problems therein. In 1985, look for the Springer-promised English edition of the 
five-volumed, well-received Geometry. JK 


Topological Groups, T(18: 1), S, L. Matrix Groups, Second Edition. Morton L. Curtis. Universitext. 
Springer-Verlag, 1984, xiii + 209 pp, $19.80 (P). [ISBN: 0-387-96074-0] Apart from minor editorial 
changes, the only difference from the First Edition (TR, May 1980) is the inclusion of a short 
chapter on roots of a compact connected Lie group. JS 


Topological Groups, P. Lecture Notes in Mathematics-1077: Lie Group Representations III. Ed: R. 
Herb, et al. Springer-Verlag, 1984, xi + 454 pp, $22.50 (P). [ISBN: 0-387-13385-2] Last of three 
volumes of articles by the main speakers at the University of Maryland’s 1982-83 Special Year on Lie 


Group Representations. These papers come from Periods III, IV and V of the special year. LAS 


Operations Research, P. Minimal Cost Flow in Processing Networks, A Primal Approach. J. Koene. CWI 
Tract, No. 4. Math Centrum, 1983, iv + 157 pp, Dfl. 22.70 (P). [ISBN: 90-6196-270-6] Processing 
networks can be used to model certain refining and blending processes. This monograph presents spe- 
cial techniques for solving minimum cost flow problems in these special network models. AO 


Operations Research, P. Algorithms and Approximations for Queveing Systems. M.H. van Hoorn. CWI 
Tract, No. 8. Math Centrum, 1984, 122 pp, Dfl. 17.90 (P). [ISBN: 90-6196-274-9] A research tract 
that tries to bridge the gap between the theory of queveing and efforts to use that theory in numer- 
ical applications. AWR 


Game Theory, T(15), S, L. Games, Theory and Applications. L.C. Thomas. Ser. in Math. & Its Applic. 
Halsted Pr, 1984, 279 pp, $65.95. [ISBN: 0-470-27507-3] An attempt to make the ideas of game theory 
accessible to a wide audience--undergraduate numerates--to use the author’s designation. More 
mathematical than popularized descriptions (readers need some calculus, some visual acquaintance 
with matrices, basic probability), less demanding than a book including proofs or assuming back- 
ground in linear programming or econometrics. Very nice balance, attractive text, with problems. 
AWR 


Optimization, T(17). Optimization Theory and Applications. Jochen Werner. Adv. Lect. in Math. 
Heyden & Son, 1984, vii + 233 pp, $16 (P). [ISBN: 3-528-08594-0] An augmented version of lecture 
notes for an optimization course at Gottengen. Necessary tools from functional analysis are 
developed so as to enable emphasis on duality theory in convex programming and necessary optimality 
conditions for nonlinear optimization problems. No exercises. AWR 


Probability, T(16-18), S, P, L. An Introduction to Probability Theory. P.A.P. Moran. Oxford U Pr, 
1984, 542 pp, $19.95 (P). [ISBN: 0-19-853242-3] A moderately advanced introduction using measure- 
theoretic language and random processes, written to provide a background for more abstract and gen- 
eralized treatises. Applications include discussions of Markov processes, Brownian motion, and ran- 
dom walks. LCL 


Probability, P. Theory of Probability, Third Edition. Harold Jeffreys. Oxford U Pr, 1983, xi + 459 
pp, $24.95 (P). [ISBN: 0-19-853193-1] Paperback reprint of the 1967 impression, which contained some 
corrections and amplifications, of the author’s 1961 Third Edition. A classic treatise, originally 
published in 1939. RSK 
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Probability, T(16-17: 2). Mathematical Techniques of Applied Probability. Jeffrey J. Hunter. 
Operations Res. & Industrial Eng. Academic Pr, 1983. Volume 1: Discrete Time Models: Basic Theory, 
xiii + 239 pp, $32 [ISBN: 0-12-361801-0]; Volume 2: Discrete Time Models: Techniques and Applica- 
tions, xiii + 286 pp, $35. [ISBN: 0-12-361802-9] Rigorous and thorough introduction, presuming a 
background in introductory probability, real analysis and linear algebra. Volume 1 presents the 
recurrent event model and introduces Markov chains, together with prerequisite mathematics. Volume 
2 develops key properties of Markov chains and relates them to two main application areas, branching 
chains and discrete time queueing models. RSK 


Probability, T(16-17: 2). Introduction aux Probabilités. P. Brémaud. Springer-Verlag, 1984, xv + 
334 pp, $16 (P). [ISBN: 0-387-13612-6] 


Probability, P. Stochastic Analysis and Applications. Ed: Mark A. Pinsky. Advances in Prob., V. 7. 
Dekker, 1984, x + 460 pp, $59.95. [ISBN: 0-8247-1906-9] Sixteen papers from leading authorities 
exhibit current research in, for example, stochastic integration, stochastic differential equations, 
stochastic optimization, stochastic problems in physics and biology. LCL 


Probability, T(13). Introduction to Probability, Revised Edition. Nelson G. Markley. Ginn Custom, 
1984, 300 pp, $13.95 (PB). [ISBN: 0-536-04704-9] A nonthreatening introduction to elementary proba- 
bility (including permutations, combinations, conditional probability, binomial distribution, normal 
distribution). No theorems or proofs, but lots of examples and related exercises. LCL 


with Ranked Servers in Heavy Traffic. G.F. Newell. Springer-Verlag, 1984, xi + 126 pp, $8.50 (P). 
[ISBN: 0-387-13377-1] Analysis of the distribution of the number of busy servers among the first s 
at time t for systems of infinitely many (preferentially ordered) servers in parallel. Closed form 
solutions, approximations and asymptotic analysis based on deterministic flow and diffusion models. 
RM 


Statistics, $(14-18), L. Lecture Notes in Statistics-23: Estimation of Victimization Prevalence 


Sa 


Using Data from the National Crime Survey. Diane Griffin Saphire. Springer-Verlag, 1984, v + 165 
pp, $10.50 (P). [ISBN: 0-387-96020-1] Thorough analysis of different estimation models (both ad hoc 
and distribution-based) based on monthly household data from the national crime survey. A good case 


study in alternative means of statistical inference. LAS 


Statistics, S. New Cambridge Elementary Statistical Tables. D.V. Lindley, W.F. Scott. Cambridge U 
Pr, 1984, 80 pp, $7.95 (P). [ISBN: 0-521-26922-9] Updated version of Cambridge Elementary Statisti- 
cal Tables by Lindley and Miller, 1953. Contains tables for the most common parametric and non- 
parametric tests. RSK 


Statistics, P*. Multivariate Descriptive Statistical Analysis: Correspondence Analysis and Related 
Techniques for Large Matrices. Ludovic Lebart, Alain Morineau, Kenneth M. Warwick. Transl: Elisa- 
beth Moraillon Berry. Wiley, 1984, xvi + 231 pp, $34.95. [ISBN: 0-471-86743-8] In the Wiley Series 
in Probability and Mathematical Statistics. Translation, with new examples and updated references, 
of the 1977 French edition entitled Techniques de la Description Statistique. Presents recently 
developed statistical techniques that are particularly suited to the description and analysis of 
large data sets. Presumes linear algebra and classical statistics. Good set of references. RSK 


linear Regression with Application to Kinetic Data. Sgren Johansen. Springer-Verlag, 1984, viii + 
126 pp, $12 (BP). [ISBN: 0-387-90968-0] Theoretical investigation of linear and non-linear models 
motivated by data from a classic experiment involving the metabolism of ethanol in the liver. LAS 


Statistics, P. Lectures in Computational Statistics. Ed: J.M. Chambers, et al. Compstat Lectures, 
No. 2. Physica-Verlag, 1984, 154 pp, $21 (P). [ISBN: 3-7051-0004-1] Algorithms for "parsimonious 
data fitting," the selection of mathematical models of exploratory data analysis with as few free 
parameters as possible. Designed for those who earn their living fitting data under constraints of 
resources and time. LAS 


Statistics, P. An Introduction to the Theory of Large Deviations. D.W. Stroock. Universitext. 
Springer-Verlag, 1984, vii + 196 pp, $18 (P). [ISBN: 0-387-96021-X] Technical monograph based on a 
course given by the author. Main parts include extensions of Cramér’s fundamental theorem and 
explanations of some recent results in the theory of large deviations from ergodic phenomena. RSK 


Statistics, T*(18: 2), P*. Multivariate Observations. G.A.F. Seber. Wiley, 1984, xx + 686 pp, $49. 
[ISBN: 0-471-88104-X] In the Wiley Series in Probability and Mathematical Statistics. Presents both 
modern data-oriented techniques in multivariate analysis and classical methods supported by comments 
about robustness and general practical applicability. Presumes matrix algebra and familiarity with 
the multivariate normal distribution, multiple linear regression, and simple analysis of variance 
and covariance. Extensive references. RSK 


Statistics, P*. Statistics: An Appraisal. Ed: H.A. and H.T. David. Iowa St U Pr, 1984, x + 664 pp, 
$31.25. [ISBN: 0-8138-1721-8] Proceedings of an international conference held in Ames, Iowa, in June 
1983, to mark the 50th anniversary of the Statistical Laboratory at Iowa State University. Esta- 
blished by George Snedecor in 1933, it was the first such unit in the U.S. Papers deal with past 
achievements and future trends in major areas of statistical research. RSK 
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Series Models. T. Subba Rao, M.M. Gabr. Springer-Verlag, 1984, viii + 280 pp, $15.50 (P). [ISBN: 
0-387-96039-2] 


Statistics, P. The Analysis of Categorical Data. Ed: A.K. Gupta. Comm. in Stat.: Theory & Methods, 
V. 12, No. 11. Dekker, 1983, 117 pp, (P). Communications in Statistics currently consists of five 
journals, Theory and Methods, Simulation and Computation, Sequential Analysis, Econometric Reviews 
and Statistical Reviews. Theory and Methods, issued 25 times a year, "focuses primarily on papers 
describing new applications of known statistical methods...and on articles dealing with a strong 
mathematical orientation that are significant to statistical studies." This special issue contains 
six research articles in the area of categorical data analysis. RSK 


Statistics, T(17: 1), P*. Analysis of Ordinal Categorical Data. Alan Agresti. Wiley, 1984, ix + 
287 pp, $35.95. [ISBN: 0-471-89055-3] In the Wiley Series in Probability and Mathematical Statis- 
tics. Contemporary treatment of methods of analyzing cross-classification tables having ordered, 
rather than just nominal, categories for at least one of the classifications. Includes an introduc— 
tion to basic descriptive and inferential methods for categorical data. Good set of references. 
RSK 


ed 


Frederick Mosteller, David L. Wallace. Ser. in Statistics. Springer-Verlag, 1984, xxxviii + 303 
pp, $29.80. [ISBN: 0-387-90991-5] Reprint of the original edition (published in 1964 by Addison- 
Wesley) supplemented by a 25-page Analytical Table of Contents containing descriptions of the con- 
tents of each section and a new concluding chapter reviewing authorship studies in the past 20 
years. LAS 


Computer Literacy, S, L. Overcoming Computer Illiteracy: A Friendly Introduction to Computers. 
Susan Curran, Ray Curnow. Penguin Books, 1983, 458 pp, $12.95 (P). [ISBN: 014-007159-8] An invit- 
ing, well-written, substantial introduction to computers, beginning with abstract data and real 
electronics: the emphasis is on hardware (not keyboards and printers but gates and busses), and 
machine-level code. Concludes with introductions to high-level languages, artificial intelligence, 
and expert systems. A significant, readable treatment intended for the curious layman. LAS 


Computer Literacy, T(13-18: 1), S. An Introduction to Computers and Information Processing. Robert 
A. and Nancy Stern. Wiley, 1982, xxii + 637 pp, $9.80 (P). [ISBN: 0-471-09896-5] A business- 
oriented text. Touches on many basic topics in the area of computer literacy and computer "aware- 
ness." Also contains some good information about various languages, program logic, and system 
design. Includes an interesting chapter on the historical perspective of computers. TR 


Computer Literacy, T(13). Computer Fundamentals for an Information Age. Gary B. Shelly, Thomas J. 
Cashman. Anaheim, 1984, xviii + 474 pp, $23.95 (P) [ISBN: 0-88236-125-2]; Workbook and Study Guide 
with Computer Software Projects, xiii + 321 pp, $10.95 [ISBN: 0-88236-126-0]; Transparency Masters, 
206 pp. Glossy full-color presentation of computer vocabulary, covering hardware and business 
applications. BASIC is introduced in an appendix as an advanced feature; the word "algorithm" is 
not listed in the 17-page index. Extensive teaching and review aids: vocabulary lists, chapter sum- 
maries, workbook, review of questions, and transparency masters. Much ado about very little. LAS 


Computer Programming, S, P. Introduction to PEARL: Process and Experiment Automation, Realtime 
Language: Description with Examples, Second Edition. Wulf Werum, Hans Windauver. Heyden & Son, 
1983, xi + 183 pp, $20 (P). [ISBN: 3-528-13590-5] PEARL is a special-purpose programming language 
designed to meet the needs of real-time applications in the areas of process control and laboratory 
experimentation. PEARL was developed in Germany in the early 1970°s and is now running on dozens of 


different computers. This text contains a description of the complete PEARL language syntax. MS 


Computer Science, P. Computer Aided Analysis and Optimization of Mechanical System Dynamics. Ed: 
Edward J. Haug. NATO ASI Ser. F, V. 9. Springer-Verlag, 1984, xxii + 700 pp, $49.50. [ISBN: 0- 
387-12887-5] Proceedings of the NATO-NSF-ARO study institute held in Iowa City, Llowa, August 1-12, 
1983. JAS 
é 

Computer Science, T(13-14), S. Common LISP, The Language. Guy L. Steele, Jr. Digital Pr, 1984, xii 
+ 465 pp, $22 (P). [ISBN: 0-932376-41-X] LISP is a very popular programming language quite useful in 
the areas of artificial intelligence and natural language processing. One of the problems in using 
LISP, however, is that there are a number of incompatible dialects of the language. This text 
describes another dialect, called "Common LISP," which is intended to serve as a common dialect of 
the language across a wide range of different computer systems. This book is a syntactic and seman- 
tic reference manual for the language. MS 


Computer Science, P. Models of the Lambda Calculus. C.P.J. Koymans. CWI Tract, No. 9. Math Cen- 
trum, 1984, iii + 181 pp, Df1l. 26.20 (P). [ISBN: 90-6196-275-7] Author’s Ph.D. thesis on model 
theory of the pure untyped lambda calculus (the function space theory underlying algorithms). Fol- 


lows categorical approach of Scott; discussion of hypergraph models. RM 


Computer Science, P. Lecture Notes in Computer Science-178: Readings on Cognitive Ergonomics~~Mind 
and Computers. Ed: G.C. van der Veer, et al. Springer-Verlag, 1984, 269 pp, $13 (P). [ISBN: 0- 
387-13394-1] Proceedings of the second European conference held in Gmunden, Austria, September 10- 
14, 1984. JAS 
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Computer Science, P. Lecture Notes in Medical Informatics-24: Medical Informatics Europe 84. Ed: 


F.H. Roger, et al. Springer-Verlag, 1984, xxvii + 778 pp, $34.50 (P). [ISBN: 0-387-13374-7] 
Proceedings of the congress held in Brussels, Belgium, September 10-13, 1984. JAS 


Computer Science, P. Ninth Colloquium on Trees in Algebra and Programming. Ed: B. Courcelle. Cam- 
bridge U Pr, 1984, 326 pp, $39.50. [ISBN: 0-521-26750-1] Proceedings of a conference held in Bor- 
deaux, France in March, 1983. The text contains reprints of 22 research papers on a wide range of 
theoretical computer science including data structures, regular expressions, lambda calculus, finite 
automata, and formal languages. The papers are all quite advanced and require a deep understanding 
of theoretical computer science and discrete mathematics. MS 


Computer Science, P. Lecture Notes in Computer Science-1/9: How to Multiply Matrices Faster. Victor 
Pan. Springer-Verlag, 1984, xi + 212 pp, $11 (P). [ISBN: 0-387-13866-8] Surveys the progress made 
since 1978 in the development of fast algorithms for matrix multiplication and related problems from 
combinatorial analysis and linear algebra. AO 


Computer Science, S, P. Lecture Notes in Computer Science-180: Ada Software Tools Interfaces. Ed: 
Peter J.L. Wallis. Springer-Verlag, 1984, 163 pp, $10 (P). [ISBN: 0-387-13878-1] Ada is a new pro- 
gramming language used primarily in the area of system’s software. This text describes research 
work being conducted in building the tools (i.e., support programs) that will enhance the Ada pro- 
gramming environment and make the Ada programmer more productive. It contains 10 research papers 
which were presented at a Conference of Ada software tools in Bath, England in 1983. The reader 


should already be familiar with the syntax and semantics of the Ada language. MS 


Computer Science, P*, L. The Carnegie-Mellon Curriculum for Undergraduate Computer Science. Ed: 
Mary Shaw. Springer-Verlag, 1985, 198 pp, $18.50 (P). [ISBN: 0-387-96099-6] Philosophy, objectives, 
requirements, and course descriptions for CMU’s recently redesigned undergraduate curriculum in com- 
puter science, Relying on integration of topics to streamline the major, it requires only 10 com- 
puter science and 4 mathematics courses, reserving nearly 504 of the curriculum for liberal arts and 
elective non-CS courses. The volume contains reprints of several papers dealing with the curricular 
relationship between mathematics and computer science. LAS 


Control Theory, P. Control and Dynamic Systems: Advances in Theory and Applications, V. 21: Non- 
linear and Kalman Filtering Techniques, Part 3 of 3. Ed: C.T. Leondes. Academic Pr, 1984, xx + 422 


pp, $49. [ISBN: 0-12-012721-0] A selection of new and substantive examples of the techniques and 
technology in the applications of nonlinear filters and Kalman filters. LCL 


Control Theory, T(18), S, P. Differential Inclusions: Set-Valued Maps and Viability Theory. Jean- 
Pierre Aubin, Arrigo Cellina. Grund. der math. Wissenschaften, B. 264. Springer-Verlag, 1984, xiii 
+ 342 pp, $44. [ISBN: 0-387-13105-1] A technical report on the motivations, ideas, and tools used in 
the study of dynamical systems in which a differential equation x” = f(x) is replaced by a differen- 
tial inclusion x€F(x), where F(x) is an associated set-valued map ("feasible velocities"). LCL 


Control Theory, P. Impulse Control and Quasi-Variational Inequalities. Alain Bensoussan, Jacques- 
Louis Lions. Gauthier-Villars, 1984, xiv + 684 pp, $76. [ISBN: 2-04-015577-5] The general goal of 
the book is to explore analogies between variational inequalities (see the author’s Applications of 
Variational Inequalities in Stochastic Control, TR, August-September 1979), and quasi-variational 
inequalities. LCL 


Systems Theory, P. Simulation and Model-Based Methodologies: An Integrative View. Ed: Tuncer I. 
Oren, Bernard P. Zeigler, Maurice S. Elzas. NATO ASI Ser. F, V. 10. Springer-Verlag, 1984, xiii + 
651 pp, $57.50. [ISBN: 0-387-12884-0] Invited addresses and contributed papers at a 1982 NATO 
Advanced Study Institute held at Ottawa, the goal of which was to foster simulation as a method 
applicable in many disciplines. AWR 


Applications (Biology), P, L. Mathematical Models of Renewable Resources. Ed: Roland H. Lamberson. 


Humboldt State U (Math. Modeling Group), 1983, 245 pp, (P). Proceedings of a conference held at the 
University of Victoria in May 1983, primarily devoted to models of fisheries. LAS 


Applications (Biology), &(16). Modeling Dynamic Phenomena in Molecular and Cellular Biology. Lee A. 
Segel. Cambridge U Pr, 1984, xx + 300 pp, $49.50; $12.95 (P). [ISBN: 0-521-25465-5; 0-521~-27477-X] 
Intended for biology students who have studied calculus for one year; aims to show that analysis of 


mathematical models can give insight into biological problems. AWR 


Applications (Economics), P. Coping with Complexity: Perspectives for Economics, Management and 
Social Sciences. Hans W. Gottinger. Theory & Decision Lib., V. 33. D Reidel, 1983, xv + 224 pp, 
$44. [ISBN: 90-277-1510-6] Most socio-economic systems are too complex to be modelled by a few equa- 
tions but are not large enough to assume continuity. Gottinger proposes modelling these systems 
with sequential machines. Motivated by concept of complexity in automata theory, introduces design, 
control and evolution complexity to study stability and controllability of systems. KS 


a AN TANS hte 


Maintenance Policies, Life Testing, and Structures. Ed: Mohamed S. Abdel-Hameed, Erhan Ginlar, 
Joseph Quinn. Notes & Reports in Comp. Sci. & Appl. Math., V. 10. Academic Pr, 1984, xiii + 303 
pp, $37.50. [ISBN: 0-12-041420-1] Proceedings of a conference held at Charlotte, North Carolina, 
June 24-26, 1983. Twenty papers on stochastic failure models, optimal maintenance and replacement 
policies, life testing, reliability structures, computability, and approximations. AO 
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Applications (Engineering), P. Flexible Shells: Theory and Applications. Ed: E.L. Axelrad, F.A. 
Emmerling. Springer-Verlag, 1984, ix + 282 pp, $23. [ISBN: 0-387-13526-X] Proceedings of the Euro- 
pean Mechanics Colloquium 165. Contains seventeen papers on the theory of shells capable of large 
elastic displacements. AO 


Applications (Image Reconstruction), $(17), P. Inverse Problems. Ed: D.W. McLaughlin. SIAM AMS 
Proc., V. 14. AMS, 1984, vii + 189 pp, $33. [ISBN: 0-8218-1334-X] This symposium (April 1983, New 
York) concerns the mathematics pertinent to: integral equations in geophysics; Radon transforms in 
radiation oncology; spectral techniques and boundary measurements in physics; probabilistic methods 
of maximal entropy to enhance blurred images. Interesting survey of the field and its open prob- 
lems. YN 


Applications (Physics), S(18), P. Differential Geometric Methods in Mathematical Physics. Ed: S. 
Sternberg. Math. Physics Stud. D Reidel, 1984, vii + 295 pp, $49. [ISBN: 90-277-1781-8] Research- 
expository papers delivered at a conference in Tel Aviv in August 1982, focusing more on differen- 
tial geometry than on physics. A few concern pure symplectic geometry (e.g., Guillemin and 
Sternberg’s), some have a more electromagnetic Yang-Mills flavor (Carmeli and Huleihil’s), and a 
couple provide a colorful flow of explanations on quarks and plasmas for the non-physicists 
(Bleuler’s and M.E. Mayer’s). This variety of styles and topics will captivate both mathematicians 


and physicists. YN 


Applications (Physics), $(16-18), P*, L*. Rational Thermodynamics, Second Edition. C. Truesdell. 
Springer-Verlag, 1984, xvii + 578 pp, $69. [ISBN: 0-387-90874-9] A significant enlargement of the 
1969 McGraw-Hill First Edition featuring an extensive new historical introduction plus 22 appendices 
by individuals whose work has advanced the subject during the last fifteen years. "Rational thermo- 
dynamics is almost as much a restoration as it is a new science... . Taken as a whole [it] is nei- 
ther more nor less axiomatic than any other mathematical science as treated by mathematicians." LAS 


Applications (Physics), P. Chaos and Statistical Methods. Ed: Y. Kuramoto. Ser. in Synergetics, V. 
24. Springer-Verlag, 1984, xi + 273 pp, $32. [ISBN: 0-387-13156-6] Proceedings of the Sixth Kyoto 
Summer Institute, devoted to chaos and statistical mechanics, held in Kyoto, Japan, in September 
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that p’(w) = m, while p”’(w) = m’. 


We have seen that if P’ satisfies (a) and (b) and P” satisfies (a), then for some P , 
ww) > p(w). If P’ does not satisfy (a), then by Theorem 5 there is some P’”” which dominates 


P” and P/” satisfies (a) so we construct P so that p’(w) > hw’ (w) but wp’ (w) = p(w) by 
dominance. The proof is now complete. O 


REMARK 1. Condition (b) above is needed to avoid cases in which P’ = {m, > m} where m is 
not in P.,. It is clear that this is strictly dominated by P’” = {m,}. 


REMARK 2. Note that the counterexample of Section 4 uses only undominated strategies for the 
women. Thus we do not have a unique strong equilibrium point for W even when restricting to the 
use of undominated strategies. 
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144. MISCELLANEA 


Inflation 


The growth of mathematics is producing discontent: 

The inflation rate per annum 1s pushing 10 per cent. 

Faced with so much information, it’s not easy to succeed 

In locating any theorem that you appear to need. 

Our plethora of indices can leave you in the lurch, 

For it takes less time to prove it than to go and make a search. 
I can offer a solution, but it’s totally upsetting: 

We need to introduce a way of constantly forgetting 

The results that won’t be needed for another 20 years, 

At the end of which they'll surface with appropriate loud cheers, 
While the ones that won’t be needed till forever and a day, 
Once their authors get their tenure will be firmly thrown away. 


—R. P. Boas 
Department of Mathematics 
Northwestern University 
Evanston, IL 60201 


How likely is it that somebody who solved one of Hiibert’s problems is more famous for his 
work in another part of mathematics? (See p. 304.) 
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A, is simple, independent of the five forms in which it commonly arises: as a group of 
permutations, as the rotation group of a dodecahedron or an icosahedron, as a group of 2 X 2 
matrices over a field of order 4 or 5 modulo the scalar matrices. 


Joseph A. Gallian 

Department of Mathematical Sciences 
University of Minnesota 

Duluth, MN 55812 


145. MISCELLANEA 
Ode to a 6? 


A thing as lovely as a } 
I think that I shall never spy} [Choose one. ] 


If you want to make @ rhyme with “see”, 
But you're really determined to be 
Consistent, admit 
Though it doesn’t quite fit, 
That a disk measures r? times “pea”’. 


Each of €, 7, ¢, x, W will claim 

An identical rhyme for its name. 
Before you get hot, 
Check Liddell and Scott!; 

Give them, not the author, the blame. 


Barry W. Brunson 
Department of Mathematics 
Western Kentucky University 
Bowling Green, KY 42101 


'A Greek-English Lexicon, compiled by H. G. Liddell & R. Scott, Oxford at the Clarendon Press, 1968 (Ist 
edition 1843; widely recognized as authoritative). 
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WHEN ONE EQUATION SOLVES THEM ALL 


WILLIAM C. WATERHOUSE 
Department of Mathematics, The Pennsylvania State University, University Park, PA 16802 


There is a real polynomial, X? + 1, with no real root. But as soon as we adjoin a root of this 
one polynomial, we can solve all other polynomial equations. This fact, the “fundamental theorem 
of algebra,” is a remarkable property of the real numbers, and one would not expect many other 
number systems to share it. This expectation is actually supported by a theorem, a precise and 
pretty result due to Artin and Schreier [1]. Unfortunately, their theorem is not very widely known. 
For one thing, it is often buried in more general discussions of ordered fields. But in addition, all 
the proofs I have seen [1], [2, p. 47], [3, p. 316], [5, p. 66] require some special results on extensions 
of degree p*. In this note, I want to show how the theorem can be proved using only results on 
extensions of prime degree, results that would be included in any case by most people teaching 
field theory. I have written out these results as a lemma, so that this note itself can be read by 
anyone knowing basic Galois theory. Someone interested in the theorem only for fields of 
characteristic zero can skip parts (2), (4), and (6) of the Lemma and the first and third paragraphs 
of the main proof. 

We call a finite-dimensional field extension Galois if it is normal and separable. Our 
preliminary lemma deals with a Galois field extension E/F of prime degree p. In this case the 
Galois group of automorphisms is cyclic, say with elements o,0°,...,0”~', o? = id. For any x in 
E we define its norm and its trace by the formulas 


N(x) = xo(x)07(x)o7%(x)--- 6? '(x), 
Tr(x) = x + o(x) +0°(x) +--+ +0?7"(x). 
These elements are obviously sent to themselves by o and thus lie in F. Clearly 
N( xy) = N(x) N(y) and Tr(x + y) = Tr(x) + Tr(y), 
while 


N( by) = b?N(y) and Tr( by) = b- Tr( y) 
for b in F. 


LEMMA. Let E'/F be a Galois extension with cyclic Galois group (0) of prime order p. Then: 

(1) For any values c,,...,¢, in E not all zero there is some x in E with 0 # Xc,0'(x). 

(2) Tr(E) is all of F. 

(3) Each z in E with N(z) = 1 has the form o(y)/y for some y in E. 

(4) Each z in E with Tr(z) = 0 has the form o(y) — y for some y in E. 

(5) If char(F’) # p, and F contains an element § #1 with §? =1, then E = F(y) for some y 
satisfying o(y) = Sy and y? & F. 

(6) If char(F) = p, then E = F(y) for some y satisfying o(y) =y + 1 andy’? —yeé F. 


Proof. Suppose (1) fails, and let Uic,o’ be a counterexample. Clearly there cannot be just one 
nonzero c,, So say c, and c, are nonzero. As o” # o°, we can choose b in E with o’(b) # o°(b). 
We have then 


0=Lco'(bx) =) c,0'(b)o'(x), 
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and also 
0 =0'(b) > co'(x) = ico'(b)o'(x). 
Subtracting, we get 
0=Yc|o'(b) — 0'(b)] o'(x). 


In this relation the coefficient of o' is zero whenever c, was zero and is also zero for i = r, but it 
is nonzero for i = s. Thus we have a nontrivial counterexample with fewer terms in it. Such 
reduction cannot continue forever, and hence (1) must be true. Applied to the case c, = 1, this 
shows that some Tr(x) is nonzero; and Tr( bx) = b Tr(x) for every b in F, so (2) follows. 

For (3), we consider expressions of the form 


w= x + zo(x) + zo(z)o?(x) +--+ +(z0(z) +++ 6? ~7(z)) 0? 1x). 
By (1), we can choose x to make this nonzero. Applying o and recalling N(z) = 1, we find that 
zo(w) = w. Thus we can take y to be w '. For (4) similarly we first use (2) to choose x with 
Tr(x) = —1. Then we write down 
y = 20(x) +(z + 0(z))o?(x) +--+ +(z + 0(z) + +++ +0?>-7(z)) 0?“ 1(x). 
As Tr(z) = 0, we find that y — o(y) equals 
zo(x) + z07(x) +--+ +z0?~ M(x) —(o(z) +--+ +0? -M(z))x 


= z(x + o(x) +--+ +0?-1(x)) =2Tr(x) = —z. 


For (5), we observe that N({) = §? = 1, so § = o(y)/y for some y. As o(y) # y, we have 
y € F. Hence E = F(y), since the prime degree extension E/F cannot have any intermediate 
fields. We have 


o(y?) = o(y)" = (Sy)? = Sy? = y?, 
so y? isin F. For (6) similarly we observe that 
Tr(1) =1+---+1=0, 


so 1 = o(y) —y for some y by (4). Thus o(y) = y + 1. As before, y must generate E. Since 
(u + v)? =u? + v? in characteristic p, we have 


o(y? —y) =o(y)* —o(y) =(y +1)’ -(vt+ Day? +1-y-l=y’-y, 
so y? —y isin F. 


DEFINITION. A field K is called real closed if 

(1) every element of it is either a square or the negative of a square, 

(2) every sum of squares in K is a square, 

(3) —1 is not a square, and 

(4) K (V-1 ) is algebraically closed (that is, every polynomial over it has a root in it). 


These conditions, which will all come out automatically when we prove the theorem, obviously 
say that K is very much like the real numbers. More detailed study ((4, §5.6], [6, pp. 60-—65]) 
shows in fact that any “elementary” property of the reals is shared by K. We may note at least 
that (2) and (3) imply char(K) = 0, since 1 is a square and —1 has the form 1+ --- +1 in 
characteristic p. Also, (4) implies that every polynomial over K of odd degree has a root in K. 


THEOREM (Artin-Schreier). Suppose K is a field whose algebraic closure K is a nontrivial finite 
extension of K. Then K is real closed. 


Proof. We show first that if char(K) = q # 0, then K must be perfect. That is, we show that 
K is closed under taking qth roots. Indeed, f(x) = x? is a function K — K preserving sums as 
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well as products; its kernel is trivial; and it is surjective, because for any b the equation 
X7 — b=0 has a root in K. Thus f is an isomorphism of K onto itself. Clearly f(K) ¢ K. If 
f(K) is a proper subfield of K, then we can apply f-' and conclude that K is a proper subfield of 
f-'(K). Applying f—' again, we see that f—'(K) is a proper subfield of f~?(K), and so on. But 
such an infinite tower 


Kcf\(K)cf?(K)c--- cK 


cannot exist, since K is a finite extension of K. Thus f(K) must equal K, and K is perfect. 

Obviously K is a splitting field over K, and we have just seen that K is perfect even if 
char(K) # 0, so K/K is a Galois field extension. We focus now on some possible intermediate 
field F for which the degree |K: F| is equal to a prime, p. Our goal is to restrict the possibilities 
for this F so strongly that the general result will follow. As K/F is Galois of prime degree p, we 
know at least that the results of the lemma can be applied to it. 

Suppose first that char( F’) is the same as p. Write K = F(y) with y”? — y =a in F. Choose b 
in K with Tr(b) = a. As K is algebraically closed, it contains some z satisfying z? — z = b. Let 
c = Tr(z). Then 


Te(z?) = Yo'(z?) = Y(o'(z))” = (Lo'(z))" = (TH(z))’, 
so c? —¢ = Tr(z? — z) = Tr(b) = a. Thus c? — c = y? — y, so 


coy=eP—yP=(e-y)", 
But the roots of 7? = T are T= 0,1,..., p — 1, which all lie in F. Thus c — y isin F. But c is 
also in F, and hence y is in F. This contradicts K = F(y). Hence p ¥ char(F). 
Suppose next that p is an odd prime different from char( F’). Then the equation 


0O=(7T? -1)fT-1)=T? '+.---4+T4+1 


does not have 1 as a root. Let § be a root of it (in K). Since K has prime degree p over F, there 
are no fields intermediate between F and K. Since § satisfies an equation of degree less than p, 
we have F({) # K, and hence F({) = F. That is, £ is in F. The roots of T’? = 1 then are 
1,6,5°,...,§?-', and they are all in F. Now by the lemma we can write K = F(y) with 
o(y) = Sy and y? =a in F. As K is algebraically closed, it contains some z satisfying z? = y. 
Let c = N(z). Then 


c? = (Nz)" = N(z?) = N(y) = y(Sy)(87y) «(8 'Y) 
= prgp(p "DP = y?. 
Thus (y/c)” = 1, so (as we saw) y/c isin F. But c is in F, and so y is in F. This contradicts 
K = F(y). Hence p cannot be odd. 

Thus far we have seen how hard it is for K/F to be of prime degree. We continue by finding 
what restrictions there are in the one case that can occur, p = 2 (and char( F’) # 2). Here by the 
lemma we can write K = F(y) with o(y) = —y and y? =a in F. In particular, not all elements 
in F are squares in F. If 5 is any nonsquare in F, and z is a fourth root of it in K, let ¢ = N(z); 
then 

c? = N(z?) = N(yb) = —b, 


so —b is a square in F when b is not. Also, vb =c/-1 , so —1 is not a square in F (and 
K = F(V-1)). Finally, if r and s are any elements of F, then K contains some element u + vy —1 
with 


(ut+v/-1) =r4+sV-1. 
This gives 


r? +s? = N(r+s/-1) = (N(u + v/—1))°, 
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and thus a sum of two squares in F is again a square. In short, F is real closed. 

We have shown now that any field F whose algebraic closure has prime degree must be real 
closed. Consider finally an arbitrary K with K/K finite Galois. Let L = K(/—1). Then K/L is 
finite Galois. If L # K, choose some nontrivial automorphism + of K over L. Let n be its order, 
and choose some prime p dividing n. Then the field F fixed by r””” has |K: F| = p. Hence F 
should be real closed; but this is impossible, since V—-1 isin LC F. This contradiction shows 
that L = K. But then K/K has degree 2, and K must be a real closed field. m 
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THE D’ALEMBERT FUNCTIONAL EQUATION 


F. J. PAPP 
Department of Mathematics, University of Michigan, Dearborn, MI 48128 


I. Introduction. D’Alembert’s functional equation 


(1) 2f(x)f(y) =f(x + y) + flx—y) 
has a long history going back to d’Alembert [1], Poisson [2] and [3], and Picard [4] and [5, Chapter 
I, Section III]. The equation plays an important role in determining the sum of two vectors in 
various Euclidean and non-Euclidean geometries; in addition to the references above see [6] and 
(7, Section 2.4.2]. 

In equation (1) f is taken to be a continuous real valued function defined on the entire real 
line. With these assumptions, the only solutions of (1) are 


f(x) =0, f(x)=1, f(x) =coskx, and f(x) =coshkx, 


where k is a real constant. The Cauchy method of showing that these are the only solutions to (1) 
first determines f on a dense set and then uses continuity to pass to the entire real line; see [7, 
Section 2.4.1] for the details. This paper provides an alternative proof which makes use of a few 
elementary ideas from calculus and differential equations. This method given below is quite 
straightforward and will, perhaps, be somewhat more accessible than Cauchy’s method. 


II. Preliminary Properties. We will first obtain several properties which any solution of (1) 
must satisfy. All of these are derived directly from the functional equation itself. 
If x = y = 0 in (1), then 


(2) 2[ £(0)]* = 2f(0) 
so that, either 

(3) f(0) = 0 

(4) f(0) =1. 


If (3) holds, then for any real x, 
O = 2f(x) (0) = f(x + 0) + f(x — 0) = 2f(x). 
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In other words, f is the identically zero function. So we will assume from now on that (3) does not 
hold and that f satisfies (4). 

Next, any solution of (1) must be an even function. To see this, let x = 0 and y be any real 
number. It then follows from (1) and (4) that 


2f(y) = 2f(0) f(y) =f(0 + vy) + f(0 — vy) = f(y) + f(-»); 


(5) fly) =f(-y). 


III. Construction of the Continuous Solutions of (1). Since f is assumed to be continuous on 
the entire line, it will be integrable on any finite subinterval. Consequently, for t > 0 we have 


(6) J 2f0) £00) ay = fi fle + ») dy + fl fey) oy. 


With changes of the variables in the two integrals on the right hand side of this last equation, 
we obtain 


t xt+t 
(7) 2f(x) [ f(y) dy = 2," f(y) &. 
—T x—t 
Since (4) holds, and since we have assumed that f is continuous, there exists t > 0 such that 
t 
(8) f flya>o. 
—1t 


Note that the right hand side of (7) is differentiable with respect to x and thus so is the left hand 
side. Knowing now that f is differentiable allows us to conclude that f’ exists as well. In fact, 
proceeding step-by-step, we see that any continuous solution of (1) is in fact infinitely differentia- 
ble. 

Differentiate both sides of (7) with respect to x, evaluate the resulting expression at x = 0, and 
choose ¢ > 0 sufficiently small so that (8) holds. It then follows that 


(9) 24(0) f° f(y) 4 = Af) -—f(-1)] = 0. 
since f is an even function. Consequently 
(10) f’'(0) = 0. 

Now, differentiate (1) twice with respect to y and obtain 
(11) 2f(x)F(y) = f(x + yy) + f(x y). 
Letting y = 0, we get 
(12) 2f(x)f'(0) = 2f"(x). 


Let K = f’(0). So (12) becomes (13) below, and we have now shown that if f is a non-trivial 
solution of (1) it must also be a solution to the initial value problem (13)-(4)-(10): 


(13) f(x) = Kf(x), 
(4) f(0) = 1, 
(10) f'(0) = 0. 


Depending on K there are three possibilities. For K = 0, 
f(x) =Cx+ 6; 
for K > 0, 
f(x) = Csinh kx + C,cosh kx 
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where k = K!/?; and for K < 0, 
f(x) = Cysin kx + C,cos kx 


with k =(—K)!”*, 
In all three cases, (4) and (10) lead to C, = 0 and C, = 1. Thus the continuous solutions of (1) 
are 


(14) f(x) = 0, 

(15) f(x) =1, 

(16) f(x) = cosh kx, 

and 

(17) f(x) = cos kx. 
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THE DISCRIMINATOR 
(A SIMPLE APPLICATION OF BERTRAND’S POSTULATE) 


L. K. ARNOLD, S. J. BENKOSKI, AND B. J. MCCABE 
Daniel H. Wagner, Associates, Station Square One, Paoli, PA 19301 


1. Introduction. For a positive integer n, what is the smallest value of k such that 1°,27,...,n? 


are all incongruent modulo k? The answer to this question requires only elementary number 
theory techniques and also provides a rare application of Bertrand’s Postulate in an elementary 
setting. 

Bertrand [1] postulated in 1845 that for every natural number > 3, there is a prime p 
satisfying n < p < 2n — 2. He verified this for n < 3,000,000. Tchebychef [3] proved this in 1850. 
Usually, the Postulate is now stated with n < p < 2n. Most modern texts use a proof due to 
Erdos [2]. 


2. Motivation. This particular question arose from consideration of a problem in computer 
simulation. The problem may be described as developing a method to determine quickly the 
square roots of a long sequence of integers. Each of these integers is a perfect square and the range 
of the integers is known. However, the integers themselves are generated by another process and, 
in fact, it can be assumed that they are generated randomly. The goal is to determine the square 
roots quickly. The usual square root algorithm is much too slow. An alternative procedure would 
be to calculate all the required square roots once and then “look up” the appropriate value in an 
array. More precisely, let S = {1,2?,...,°} be the set of integers whose square roots must be 
determined. Let A be al X n? array with A(x) = x’/*. For any value s € S, A(s) is the square 
root of s. This is obviously a very quick procedure; however, it requires an array of size 1 X n°. 
This is a large array and most of the values in the array are not used. Is there an alternative 
method which would still allow quick calculation of the square roots but require a much smaller 
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array? A potential candidate is the modulo function. If 1,27,...,n* are distinct modulo k, then 
letting A(r) = x'/? where r = x modk and 1 <r< k, allows a similar look up procedure to be 
performed. This is obviously slower than the previous method, since the index for that array is 
determined by reducing x modulo k. However, the modulo procedure is, in general, much faster 
than the square root procedure. The only remaining question is the size of the array required to 
store the square roots. This is simply the smallest value of k such that 1,27,...,n?* are 
incongruent modulo k. This is an interesting number theory problem and the solution is given 
below. 


3. The discriminator function. Let D(n) = min{k{l’,2’,...,”° are incongruent modulo k}. 
We call D the discriminator (or discriminator function). Table 1 gives the values of D(n) through 
n= 18. 


TABLE 1 
4 5 6 7 8 9 10 11 12 13 #14 15 %16 #17 «18 
9 10 13 14 17 19 22 22 26 26 29 31 34 34 37 


We shall show that D(n) (for n > 4) is equal to the smallest integer which is greater than or 
equal to 2n and which is either a prime or twice a prime. This is done by first defining B(n). Let 


1, ifnm=1, 
2, ifn= 2, 
B(n) =( 6, ifn =3 
9, ifn = 4, 


min{k|k >2n and k=pork=2p forpaprime}, ifn> 4. 


We show below that D(n) = B(n) by first showing that D(n) < B(n) and then that 
Bn) < D(n). (Throughout it is assumed that n > 4.) 
The proof that D(n) < B(n) is done by two lemmas. 


LEMMA 1. If p > 2n, then D(n) < p. 


Proof. Suppose not. D(n) > p implies that there exist s and r with 1 < r<.s <n such that 
s’? = r* mod p. This implies that p(s + r)(s — r). This is impossible since p > 2n implies that 
p>(s try). 


LEMMA 2. If 2p > 2n, then D(n) < 2p. 


Proof. Suppose not. D(n) > 2p implies there exist r and s with 1 <r<s <n such that 
s* = r* mod2>p. Thus, 2p|(s + r)(s — r). Since s — r < p, then p\(s + r). Also, 2p > 2n >s+4r 
implies that p = r + s. Thus, 2|(s — r). But this means that s + r is odd and s — r is even. 


These two lemmas show that D(n) < B(n). Next we show that B(n) < D(n). This is also 
done with two lemmas. 


LEMMA 3. D(n) > 2n. 


Proof. If not, then there exist integers r and s such that 1 <r<s<n and D(n)=s+r. 
This is impossible since s? = r? modr + s. 


A check of Table 1 indicates that B(n) = D(n) for n < 18. The following lemma completes 
the proof. 


LEMMA 4. If 2n < m< B(n) andn > 18, then there exist integers s andr such thatl <r<s 
< nands* = r* mod m. 


Proof. Since m < B(n), this means that m is less than the smallest prime greater than or equal 
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to 2n. Bertrand’s Postulate states that there is at least one prime between x and 2x. This means 
that m < 4n. Further, m < B(n) implies that m # p and m # 2p for any prime p. Thus, writing 
m as a product of primes 


m= 2.41 3%2 5% trey 


we see that it can be factored in at least one of four ways: 


(1) a, =1 and m=2??. 

(2) a, = 1 and m = 2ab with a > 3, b > 3, and a and b both odd. 

(3) a, # land m=?’. 

(4) a, # 1 and m= ab with a > b > 2, and a and b of the same parity. 


In each case we will exhibit s and r such that 1 <r< s,s? =r? modm,and 5s <n. (We will 
use the fact that for a,b,s positive integers with a,b >s, then (a + b) < ((ab/s) +5). This 
follows immediately from s(a — s) < b(a — s).) 


(1) Let s = 3t and r = ¢, then s* = r* mod m. Since 2t* < 4n, this implies that t < (2n)'/? 
or s < (18m)'/*. Now, (187)'”* is less than n for n > 18. Thus, s < n. 

(2) Lett s=a+b and r=a-—b, then s?=r*modm. For n> 9, it follows that 
((4n/6) + 3) < n. Since 2ab < 4n, then ((ab/3) + 3) < ((4n/6) + 3) < n. Combining this with 
the remark above yields s = a+b < ((ab/3) + 3) <n. 

(3) Let s = 2t and r = ¢. Clearly, s* = r? mod m. For n > 16, s = 2t < 4n'/* <n. 

(4) Let s =(a+ b)/2 and r = (a — b)/2. Again, s* = r* mod m. From the remark above, 
(a + b) < ((ab/2) + 2). This implies that 


(a+ b)/2=s <(ab/44+1)=(m/4 +1). 
Since m < 4n, we have s < (4n/4 + 1) =n +1. Thus, s <n. 


4. Summary. If D(n) denotes the smallest positive integer k such that 1°,27,...,n° are 


incongruent modulo k, then 


l, ifnv=1, 
2, ifn=2, 
D(n) =( 6, ifn = 3, 
9, ifn= 4, 


min{ k|k > 2n andk=pork=2p forpaprime}, ifn> 4. 
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NOTE ON GRAPHS AND MATRIX INEQUALITIES 


SEYMOUR LIPSCHUTZ 
Department of Mathematics, Temple University, Philadelphia, PA 19122 


1. Introduction. The purpose of this note is to show how a very simple counting argument in 
graph theory can be used to obtain results about matrices which are related to a classical problem 
in analysis. (We use the notation S(M) to denote the sum of the entries of a matrix M.) 


2. Counting Argument. Let G be a finite graph, let P,(k) denote the number of paths of 
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Fic. | 


length k in G, and let d, denote the degree of the vertex v, of G. We illustrate our idea by means 
of an example. Suppose G is the graph in Fig. 1. The adjacency matrix A of G, and its square A? 
follow. 


and A’ = 


Pe ON © 
rere CON 
PON s&s 


2 
4 
1 
6 


NO Or © 
ONrR Fr 
NO RR NSN 
RN DW 


Then it is well known that the number P,.(2) of paths of length two in G is obtained by summing 
the elements of A’: 


P.(2) = S(A?) = 12 + 13 + 12 + 13 = 50. 


(We compute S(A’) by first adding the elements of each row.) 
Our idea is to count P,(2) in another way. First observe that the degrees of the vertices of G 
are as follows: 


d, = 3, d, = 4, d, = 3, d,= 4. 
Now each path of length two in G has a unique midpoint. The number of paths of length two with 


midpoint v, of G is equal to dj? = 3° = 9, since there are exactly d, = 3 ways to enter v, and 
d, = 3 ways to leave v,. A similar result holds for the other vertices of G. Accordingly, 


P;(2) = dj + di +d? +dj=9+4+16+9+416 = 50. 
This result clearly holds for any finite graph. 
PROPOSITION 1. Let G be a finite graph with adjacency matrix A. Then 
P.(2) = S(A*)=djp+d5+-+--+@ 


where d, denotes the degree of the vertex v, of G. 
3. Classical Problem. Suppose 4 is a nonnegative irreducible n-square matrix. Then it is well 


known (see, for example, [1], p. 27) that 4 has a positive eigenvalue r(A), called the Perron root 
of A, such that 


JA] < r(A) 


for any other eigenvalue A of A. A great deal of research has been done on finding bounds for 
r(A). (See, for example, [2] and [3].) When 4 is a nonnegative symmetric matrix, the inequality 


S(A)/S(1) < S(4°)/S(A*) 


is needed to obtain certain bounds for the Perron root of A. (Here J denotes the identity matrix.) 
This inequality, however, does not hold for all nonnegative symmetric matrices 4 of order greater 
than three [4]. Using Proposition 1, we easily prove Theorem 1. 


THEOREM 1. Let A be a nonnegative symmetric matrix. Then 
S(A)/S(1) < S(A*)/S(A). 


1985] NOTES 279 


Proof. We need only prove the theorem when 4 is an integral matrix; it follows that the 
inequality will hold for rational entries and, by passing to the limit, the inequality will also hold 
for real entries. 

Let G be the multigraph whose adjacency matrix is A. Since d, denotes the degree of the 
vertex v,, it is clear that d, is also equal to the sum of the elements of row i of A. Hence 


S(A) =d,+d,+--- +d,. 
The Cauchy-Schwartz inequality with (1,1,...,1) and (d,,d,,...,d,,) gives 
(d,+d,+---+d,) <n(d? +d? +--- +d). 
But S(/) = n. Therefore, by Proposition 1, 
(S(A))° < S(1) S(4’) 
which gives our result. 


This theorem has already been proved in [5]. Our proof, however, is geometrical using 
Proposition 1. Later we prove Theorem 1 without the restriction that A be nonnegative. 


4. Generalization of Counting Argument. Again let G be a multigraph with n vertices and let 
A = (a,,) be its adjacency matrix. Let 


d.(k) = number of paths of length k beginning at vertex v,. 
(Note d,(1) equals d,, the degree of v,.) Then it is well known that 
d,(k) = sum of the elements of row i of the matrix A“. 
Hence 
S(A*‘) = di(k) +. d,(k) +--+ +d,(k). 


Consider now the number P,(2k) of paths of length 24 in G. It is well known that P,(2k) is 
equal to the sum of the elements of the matrix A?*. On the other hand, as with paths of length 
two, each path of length 2k has a midpoint and the number of paths of length 2k with midpoint 
v, is precisely d,(k)?. Thus we have the following generalization of Proposition 1. 


PROPOSITION 2. Let G be a finite graph with adjacency matrix A. Then 
P(2k) = S(A**) = i(k)’ + d,(k)° + ++ +, (k)’, 
where d,(k) is the number of paths of length k beginning at vertex v,. 


5. Recursion Formula and Theorem 2. Consider now d,(k + 1), the number of paths of length 
k + 1 beginning at vertex v,. Then a,,d,(k) is equal to the number of paths of length k + 1 
whose first vertex is v, and whose second vertex is v,. Hence the sum of these numbers for all the 
vertices v, gives us d,(k + 1). 


PROPOSITION 3. Let G be a graph with adjacency matrix A = (a,,). Then 
d(k+1)=Qoa,,-d,(k). 
J 


We are now able to prove the following theorem. 
THEOREM 2. Let A be any nonnegative symmetric matrix. Then for any positive integer k, 
S(A2*+2) < e(A)S( A), 
where c(A) is the sum of the squares of the elements of A. 


Proof. As with Theorem 1, we need only prove the theorem when 4 is an integral matrix. By 
Proposition 2 and Proposition 3, 
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2 


s(4**?) = Pd (k +1 =E(La,d(4)] 
l l J 
By the Cauchy-Schwartz inequality and Proposition 2, for each i, 
2 
[Da,4(e)} <(Laz)(La(4") = (Laz) sa), 
J J J J 
Therefore 


s(A**?) <E{ Yai} s(4%) = (Laz) S(a4) = ef 4) SA), 


I J i, J 


Thus the theorem is proved. (Observe that c(A) does not depend on k.) 


6. Generalization of Theorem 1. The restriction that A be nonnegative in Theorem 1 is not 
necessary. 


THEOREM 3. Let A be a symmetric matrix. Then 
S(A)/S(1) < S(A’)/S(A). 


Proof. Let d, denote the sum of the elements of row i of A. Then 
S(A) =d,+d,+---+d,. 
Using A = (a,,), we have 


S(A*) _ » ¥ (Lana), 
ty k 
Using a,, = a,,, and interchanging the summands, we have 
S(4) =¥(E Lae, )=L(Lau|( Lae}. 
k a | k l J 
But 
\a,, = d, and a, =d,. 


J 


Hence 


S( A?) =)od,d, = d? + dj +--+ +d?. 
k 


The theorem now follows from the Cauchy-Schwartz inequality as in Theorem 1. 


REMARK. Although the proof is actually only a few lines, it lacks the geometrical interpretation 
of the term dj. 
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WIENER CHAINS 


KURT KREITH 
Department of Mathematics, University of California, Davis, CA 95616 


1. Introduction. The study of Markov chains constitutes a useful and important enterprise 
which deserves to be pursued for its own sake. However, for many students it also represents 
essential preparation for a profound and challenging generalization—the study of stochastic 
differential equations. 

This note deals with an exercise which, if presented in the context of stationary Markov chains, 
should also prepare the student to understand some of the basic ideas underlying continuous 
processes whose randomness is the result of “white noise’’. It is based on a rather obvious discrete 
analogue of the Wiener process, one whose properties will not surprise anyone familiar with the 
continuous version. Indeed, the only remarkable thing about the ideas which follow is that they 
are not embodied among the standard exercises of introductory texts dealing with Markov chains 


[1], [2], [3]. 


2. Some Motivational Experiments. Motivated by the classical “gambler’s ruin” situation, we 
ask the student to consider two individuals who agree to toss a fair coin until the difference 
between the total number of heads and tails exceeds some large number, say d. Instead of 
calculating the distribution corresponding to this difference directly, we proceed empirically by 
representing the resulting stationary Markov chain in terms of a (2d + 1) X (2d + 1) transition 
matrix P with transition probabilities 


(2.1) P(sx+1)=P(mx-1) =2 


for all x © Y= {-d, —d+1,...,0,1,...,d}, except that P(—d, —d) = P(d,d) = 1. Adopt- 
ing the notation [2] 
m,(x)=Pr(X,= x); %, = (4,(—d),...,,(0),.-.,%,(4)), 
we readily verify by several applications of 
(2.2) 7.,=7,P 


that an initial state X) with 7)(0)=1 leads to X, whose distributions involve binomial 
coefficients. That is, 


1 1 
iy = (0, :9,0,5 0, 5 0,9, 0}, 
_ 1 1 1 
rp) (0, 0, 470 5 79; 4° 0), 
- 1 3 3... I 
fy = (05.225 05 3 005 3 50s g 92-250) ete, 


The reason for this pattern can of course be determined by showing that as long as n < d, 
X, = 2Y,—n, where Y, is binomial with mean n/2 and variance n/4. This provides the 
theoretical explanation (which might best be suppressed for the moment) underlying the empirical 
observation that the X, generated by this process have distributions with “binomial patterns” and 
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that they satisfy 
E(X,)=0; — Var(X,) =n 
for n = 0,1,...,d. 


To eliminate the zeros which separate the binomially patterned states in the above 7, (i.e., to 
deal with the periodicity of this particular chain) one can modify (2.1) so that 


1 1 
(2.3) P(x,x+1)=P(x,x—-1)=7, P(x, x)=, 
except for the special cases of P(—d,—d) = P(d,d) = 1. Starting with 7)(0) = 1, one again 


obtains X, whose probability functions give rise to binomial patterns symmetric about x = 0. 
Specifically, we now find that 


= 1 

7 = [0,.. 10,0, 353541000, 0), 

— 1 46 4 21 

ty = (0,...40, 61616? 16° 160°): 


G4 GH" 64° 64 64 64? AP 7) 
for which E( X,,) = 0 and the variance grows with n according to the rule Var( X,,) = n/2 (as long 
as n < da). As before, there is a simple theoretical explanation—1.e., that X, + n is binomial with 
parameters 2n and 5. 

The point of such experimentation is to get a feel for the special role which binomially-related 
distributions play in certain Markov processes. While they do not constitute stationary distribu- 
tions, they do “reproduce’’, in the sense that if 7, is the probability vector associated with an 
appropriately translated binomial X,, then 7,,, = 7,P will contain the same binomial pattern, 
but with a larger variance. 

The next problem is to gain some insight into the kinds of Markov processes which serve to 
reproduce binomially-related distributions in this way. Such insight follows readily from the 
relation 


(2.4) Tin = Ty, P" 


m+n 


which shows that powers of the matrices determined by (2.1) or (2.3) also have reproducing 
properties similar to P. To find the form of such P” it is not necessary to compute matrix 


products. Rather, choosing 7 with m(y) = 6,,, the relation 


(2.5) i, = % P™ 


m 


shows that the xth row of P” is just 7,. Thus it is transition matrices whose xth rows are 
symmetrically distributed about y = x in a binomial pattern which have the previously observed 
reproducing property vis-a-vis binomial distributions. All this is based, of course, on the 
assumption fhat d is sufficiently large so that the end conditions P(—d, —d) = P(d,d) = 1 do 
not enter into the matrix computations. 


3. The Formalism. At this point the student should be prepared to let d — oo and to accept 
the following definitions. 


DEFINITION 1. A random variable X is no*/2-centered binomial (with mean 0 and variance 
no’ /2) if (X, + n)/o is binomial with parameters 2n and 4. 


Thus the random variables X,, X,,... generated by (2.3) and X) = 0 have now been dubbed 
“n/2-centered binomial”. Motivated by the process observed under assumptions (2.3) we go on to 
define a Wiener chain as a stationary Markov chain W,, W,, W,,... whose state space consists of 
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all integers and for which 


(ij) M=0, 
(3.1) (ii) W, is no* /2-centered binomial, 
(iii) Ifn, <n, <n;<n,< +--+, then W,,, — WW, — WW, 
W,,, — W,,>--- are mutually independent. 
This definition has an important consequence. 
THEOREM 3.1. If W,W,,... is a Wiener chain, then W,,,,, — W,, is no* /2-centered binomial. 
Proof. Because W,,W,,... 18 assumed to be a stationary Markov chain, we have 


Pr( Wan — W,, =x) =) Pr(W,,,,, = x + y and W,, = y) 
y 


=)) Pr( W,,,, =x + y|W,, = y)Pr(W,, = y) 
y 

=) Pr( W,,,,, = x|W,, = 0)Pr( W,, = y) 
y 


= Pr( Wratn = x|W,, — 0) 
= Pr(W, = x|W = 0). 


However, since W, = 0, the last sum reduces to Pr(W,, = x), as was to be proved. 


It is also easy to verify that the transition probabilities giving rise to a n/2-centered binomial 
Wiener chain are just those of (2.3): 


1 
4’ 
More generally, if A is a positive integer we can find the transition probabilities P(x, y) which, 
together with W = 0, give rise to a nk* /2-Wiener chain. Specifically P(x, y) is the probability 
function of the random variable Y,, where (Y, + x + n)/k is binomial with parameters 2n 
and 3. 

This structure provides the basis for a variety of exercises which illustrate, in a discrete context, 
the phenomena central to “white noise” processes—e.g., diffusion, semigroups, and reproducing 
distributions. 


(3.2) P(x,x +1) =P(x,x-1)= P(x,x) = 5. 


4. An Application. As is the case with most aleatory processes, the central ideas related to 
Wiener chains are well illustrated in a gambling context. Consider a conservative investor who, 
while willing to take substantial risks, also seeks a complete mathematical description of the 
uncertainties he faces. With initial capital X, and an income of b, at the end of the nth month, he 
plans to invest in a venture whose monthly gains and losses are described in terms of a random 
variable Y,. He is able to vary his monthly commitment to the investment fund by means of a 
multiplicative constant a, > 0. Thus his financial situation can be described by means of a 
stochastic difference equation 


(4.1) X, = Xn-4 + b, + a,(¥, - Y,-1) 


for which X) is given and Y) = 0. 
Such difference equations are readily summed to yield 


(4.2) X, = Xo + 5+ ~ a(Y¥, - ¥_4). 
j=l j=l 
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Assuming that the investment venture is “fair” in the sense that E(Y,) = 0 for 7 = 1,2,..., we 
can take expectations of (4.2) to establish the intuitively obvious fact that 


nN 
(4.3) E(X,)=X%)+ Lb. 
J=l1 

However, the questions which really interest our investor are not answered by (4.3). Concerns 
such as, ‘““What are the chances that I will go broke?’’, or hopes such as, “Can I become a 
millionaire in two years?”, require more information about the distribution of X,. For most 
investment schemes such information is very hard to come by, but there are some special forms of 
stochastic difference equations for which we can obtain an explicit representation of X,,. 

For example, it is readily shown from (4.2) and (4.3) that if E(Y,) = 0 for 7 = 1,2,..., then 


H 2 
(4.4) Var( X,) = El} L a,(¥, — ¥-1) 
J=1 
However, in most situations the computation of the right side of (4.4) poses a major challenge. 
The point which was made by our experiments of Section 2 is that the situation is very different 
if one invests in a nk* /2-Wienerspiel—i.e., an investment program whose payoffs are given by W, 
instead of Y,. For now the equation 


(4.5) X,=X%+ b+ Yal(w-w_,), 
j=l j=l 


combined with the fact that the sums of independent binomial random variables are again 
binomial, shows that X, will itself be a shifted binomial random variable whose distribution is 
fully determined by the mean and variance. Since the mean of X, is given by (4.3), it is 
straightforward to use (4.4) and the independence of the various AW, = W — W,_, to determine 
that 


n 


Var( X,) = ¥ aE(W— Ww.) 


thereby providing a complete mathematical description of the process. 


5. Epilogue. It only remains to rewrite (4.1) in the form 
(5.1) AX, — b, + a, AY, 


and to ponder a passage to the limit as investments are compounded at increasingly shorter time 
intervals. One is tempted to represent the limiting process in the form 


(5.2) dX = b(t) + a(t) aY, X(0) = Xo 


and to pose analogous questions in this continuous case. For a very special class of dY—.e., for 
dY = dW, when dW is a form of “white noise” —equation (5.2) also allows a complete solution 
in terms of normal, rather than binomial, distributions. 

But this is another story, one which this note was intended to motivate rather than tell. A very 
readable introduction to continuous Wiener processes is given in [2]. 
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THE BISECTION METHOD: A BEST CASE ANALYSIS 


A. FINBOW 
Department of Mathematics, Saint Mary’s University, Halifax, Canada B3H 3C3 


Let p be a polynomial of degree n with integer coefficients and no rational roots in the 
interval [a,b], where sign ( p(a)) # sign (p(6)) and a and b are rational. Then we are in a 
position to use the bisection method to approximate a root of p lying in this interval. In this note 
we compute a uniform (in p) lower bound on the number of iterations required of this method to 
obtain an e-root for p (i.e., a point x such that | p(x)| < e.) 

The bisection method for approximating roots is usually introduced in a first course in 
Computing Science and works for any continuous real function f of a real variable. The algorithm 
is recursive and the (k + 1)st step (k > 0) requires of its predecessor an interval [a,, b,] with 
sign (f(a,)) # sign (f(b, )), and an approximate root 7, lying in this interval. It then proceeds as 
follows: if r, is a root, the algorithm terminates; otherwise a new interval and approximate root 
are computed by 


[7,5 5,], if sign ( f(7,)) = sign( f(a,)), 


[a,5r, |, otherwise, 


[ans Ona] = | 


and 


ea. = (ay 44 + by 41) /2. 


The user must seed the algorithm with an initial interval [a,b] with sign ( f(a)) ¥ sign (f(b)) 
at which point (the first step) the assignments a, = a, b, = b and r, = (a + b)/2 are made. 

The intermediate value theorem implies that the approximation r, is within (b — a)/2* of a 
root regardless of f. If, however, one requires that 7, be an e-root, then an upper bound on the 
number of iterations depends upon the function. (If, for example, f is differentiable and a bound 
M on {|f’(x)|: x € [a,b]} is known, then |f(r,)| < (b — a)M/2*.) The same is true in general 
of a lower bound, but we can do better when an irrational root of a polynomial with integer 
coefficients is being approximated on an interval with rational endpoints: 


THEOREM. Let p(x) =c,x" +.¢,_,x"7! +++: +¢9 be a polynomial with integer coefficients 
and no rational roots in |a, b], where sign ( p(a)) ¥ sign (p(b)) and where a = d/e and b = g/h 
for some integers d,e,g,h withe > 0, h > 0. Then, if {r,} is the sequence of approximate roots 
obtained by applying the bisection method to p in the interval [a, b], a necessary condition for r, to be 
an &-root iS 


1 _ 
k> ~ log, (ee"h") 7 


Proof. Note that for each k, r, = s,/t, for some integer s, and t, = 2*eh. Now since r, is 
rational, we can conclude that 


cst +e. jst le tes tent? 
0+ |p(m)| = a n—-1"%k k Qrkt 


n 


Ly 


The numerator of this fraction is thus a positive integer and thence at least 1. Therefore, we 
conclude that 


Ip(n)| > te" = (eh2*) ", 
from which our result follows. 


We remark that the hypothesis that p have no rational roots in [a, b] can be weakened to say 
that p have no roots in [a,b] of the form r = a + (b— a)i/2/ for integers i and /, since this 
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weakened hypothesis is equivalent to nontermination of the algorithm (cf. “Linear Convergence 
and the Bisection Algorithm” by Edwin H. Kaufman, Jr., and Terry D. Lenker, submitted to this 
Journal). 

Finally, we remark that there is no such polynomial-independent lower bound on the number 
of iterations k required in order that 7, be within distance e of a root. Indeed, pick an integer W@ 
such that YM > (b — a)/2 and YM is irrational. Then the family of quadratic polynomials 


{ c?x? — 2cdx + d* — c?M: c and d are integers with c # 0} 


has the property that (a) the set of roots of its elements is a dense subset of the irrationals, while 
(b) each of its elements has at most one root in [a, b]. 
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E 3083. Proposed by Gunnar Blom, University of Lund and Lund Institute of Technology, 
Sweden. 


If the relation 
xve+y4+r=(x+y)(xt+z)(y +z) 
is satisfied by xy, yy, Zo, where 
(x6 + y5 — 25) (x6 + 25 — yo) ye + 25 — x5) ¥ 0, 
then it is also satisfied by 
1 1 1 


Xj = 3 yi —= Z] = . 
2 2 2 2 2 2 2 
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E 3084. Proposed by Paris Pamfilos, University of Crete, Greece. 


Given a family of circles in the plane all of which pass through a common point and no two of 
which are equal, show that there is another circle enveloping all the circles of the family if and 
only if there is a straight line containing all the intersection points of the common tangents of any 
two circles of the family. 


RK 3085. Proposed by T. C. Lim, George Mason University, Fairfax, VA. 
Let g(t) be the unique nonnegative solution of 
{w+ g(u)}" + |u— g(w)l? = 2p, 
where 1 < p < 2 and0 <p < 5. Prove that 
{1-—pt g(u)}" +l —w- g(w)/? < 201 — 4). 
E 3086. Proposed by Dennis Spellman, Sacred Heart University, Bridgeport, CT. 


If c and m are positive integers each greater than 1, find the number n(c,m) of ordered 
c-tuples (n,,”>,...,n,) with entries from the initial segment {1,2,..., 7m} of the positive integers 
such that n, <n, andn,<n,< °°: <n 


Cc’ 


E 3087. Proposed by Weixuan Li, University of Waterloo, and Edward T. H. Wang, Wilfrid 
Laurier University, Waterloo, Ontario, Canada. 


Let a,, i= 1,2,...,n, be real numbers such that 0 < a, < 1, where n > 2. Find a best upper 
bound for 


2 2 2 
Si = (a, — ay) +( ay — a3) toe +(a, _ a,) ; 


and determine all cases when this bound is attained. 
SOLUTIONS OF ELEMENTARY PROBLEMS 


Summing Square Roots 


FE 2941 [1982, 273]. Proposed by Richard Johnsonbaugh, Chicago State University. 
In L. W. Gates, Summing square roots, Math. Gaz., 64, No. 428, June 1980, pp. 86-89, 


1 1/2 
S = 54 |(8n + 5)(4n +1)? — 5] 
is proposed as an approximation to 
G,= 0”. 
t=1 


Prove that lim, _,,,(S,, — G,) exists. What is the value of the limit? 


Solution by Chico Problem Group, California State University, Chico. With the proposer’s 
notation (one suspects he intended to label S, and G, in the opposite way) we show 


; 5 1 
fim (8, = G) = ~ ag {3} 


where §(s) is the Riemann zeta function. 
First, rewriting S with the aid of Taylor’s formula 


1 1 
n+— =y¥n+—=—+ O(n 3”), 
yrtg an 7 Oln) 
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we have 
s,=2(n+3)(n+4)?-3 
n 34" q a4 24 
2 5 5 
=n n[in + =| + TV" — 54 + (1) 
_ 2 1 5 
+ ne’ = yl/2 _ 
(1) =" + 50 74 + o(1). 
Hence, 
: — 2 3/2 1 1/2 _ 3 
(2) hm (S, G,,) ~~ hm | 3 n + 4" G,, 4 


provided the latter limit exists. 
From the Euler-Maclaurin summation formula we have 


P(x) | 
G,=5(1 + vn) + [Vx dx +5" ia a. 
where P,(x) = x — [x] — 5. Thus 
—23y,1 21 1 n P,(x) 
(3) G, =3n +n 6tal my dx 


The periodicity and alternating sign properties of P,(x) insure that {7°P,(x)$(x) dx is finite for 
any positive, continuous function (x) which decreases monotonically to zero as x — +00. Thus 
we can write (3) as 


—23yn,1 iz _li_l 0 P,( x) 
G, = 3" tan sta af dx + o(1), 
and comparing this with (2) we have 
; _ 1 1; P,(x) 
(4) Jim (5, - G,) = ~ag- gf saa & 


with the limit clearly existing. It remains to evaluate this last integral. 
By a standard method for continuing the zeta function analytically Qn ~° is written as the 
Stieltjes integral /7°x~°d(x — [x]) which is then integrated by parts) we have the formula 
ox —[ x] 


(5) (3) ay sf ASE x, 


Xx 


where the right-hand side is defined and analytic for Re(s) > 0. 
Replacing the numerator in the integral of (5) by P,(x) = x — [x] — 4 gives the formula 


(6) s(s) = $ 5 f° dx. 


where now the integral converges and represents an analytic function for Re(s) > —1. Substitut- 
1 


ing s = — 5 in (6) yields 


[72D ae 4429-3), 


xi/2 


which along with (3) gives 


im (5, - 6) = 4-4). 


Hi—- 0 


No closed form expressions for {(— 5) are known. A decimal approximation [1] is 


[=] = —(0.20788 62249 77354 56602, 
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so that 
lim (S, — G,) = —.00044 71083 55978 76732. 


neo 


Reference 
1. J. W. Wrench, Jr., private communication, cf. Math Comp., V. 22 (1968) 687-688. 
Also solved by P. S. Bruckman, G. H. Gonnet (Canada), T. K. Louton and C. C. Rousseau, O. P. Lossers (The 
Netherlands), W. A. Newcomb, A. Stenger, S. K. Venkatesan (India), P. Y. Wu (Taiwan), and the proposer. 


A Triangle Inequality 


E 2958 [1982, 498]. Proposed by M. S. Klamkin, University of Alberta. 


Let x,y,z be positive, and let A,B,C be angles of a triangle. Prove that x* + y* +z? > 
2yzsin(A — 7/6) + 2zx sin(B — 7/6) + 2xysin(C — 7/6). 


Solution by C. S. Karuppan Chetty, Regional Engineering College, Tiruchirapalli, India. 
Solution I. Let 
f(x, y,z) =x* +y*+ 27 — 2yzsina — 2zx sin B — 2xysiny, 
where a = A — 7/6, 8B = B—a/6 and y = C — 7/6. Since 
sina = cosBcosy — sinf siny, 
we can express {(x, y,Z) as 
(x — ysiny — zsinB)° +(ycosy — zcosB)’ 


and the result follows. 
In fact, the given inequality holds for any real x, y, z and for all A, B,C with A+ B+ C=1n7. 


Solution IJ. Defining f(x, y, z) as above, the matrix of the quadratic form / is 


1 —siny -—sinB 
U =] —siny 1 — sina 
—sinB -—sina 1 


It is easily seen that det(U) = 0, and hence the leading principal minors of U are 1, cos*y and 0. 
Therefore, f is positive semidefinite and the result follows. 


Also solved by K. L. Bernstein, K. V. Bhagwat (India), R. Breusch, Z. A. L. Gedcze (Brazil), R. Heller, W. 
Janous (Austria), H. Kestelman (England), L. Kuipers (Switzerland), M. J. Lempel, O. P. Lossers (The Netherlands), 
A. Marguina (Spain), V. D. Mascioni (Switzerland), W. A. Newcomb, T. Riessinger (West Germany), K. Rogers, 
O. G. Ruehr, I. A. Sakmar (Canada), K. L. Stellmacher, G. A. Tsintsifas (Greece), M. Vowe (Switzerland), and the 
proposer. Partial solutions by M. Bencze (Romania) and E. Braune (West Germany). 


Inradii-circumradii Products for Tetrahedra 


E 2962 [1982, 593]. Proposed by M. S. Klamkin, University of Alberta, Canada. 


It is known that if the circumradii R of the four faces of a tetrahedron are congruent, then the 
four faces of the tetrahedron are mutually congruent (i.e., the tetrahedron is isosceles) [1]. It is also 
known that if the inradii r of the four faces of a tetrahedron are congruent, then the tetrahedron 
need not be isosceles [2]. Show that if Rr is the same for each face of a tetrahedron, the 
tetrahedron is isosceles. 


References 


1. Crux Mathematicorum, 6 (1980) 219. 2. Crux Mathematicorum, 4 (1978) 263. 
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Solution by O. P. Lossers, Eindhoven University of Technology, The Netherlands. Let ABCD be 
a tetrahedron. We put BC = a, CA = b, AB = cc, DA = a,, DB = b,, DC = c,. The fact that the 


product of the circumradius and the inradius is the same for each face is then expressed by 


abc ab,c, a,be, a,b,c 
atbte atb,t+ce, a +b+e a+b+e 7" 
The first equality is equivalent to 
bb,(c — ¢,) + cc,(b — b,) = aC b,c, — bc), 
or 
(1) c(a+c)(b— b,) + b(b, + a)(ce — ¢,) = 9. 
We have two more equations obtained from (1) by cyclic permutation, namely 
(2) c(c, + b)(a-—a,) + a,(b+ a)(c-—¢,) = 9, 
(3) b,(b+ c)(a—a,) + a(a, + c)(b—- b,) =0. 


Considering (1), (2) and (3) as a system of linear equations for the unknowns a — a,,b — b,,c — 


we observe that the determinant of the system has the form 


O p q| 
r QO s| = pst + gru 
t u O 


Cy 


and does not vanish because all its elements are obviously positive. Hence a = a,, b= b,, c= G, 


which proves the statement. 


Also solved by L. Bankoff, J. Heuver (Canada), M. R. Railkar (India), G. Tsintsifas (Greece), and the proposer. 


ADVANCED PROBLEMS 


Solutions of these Advanced Problems should be mailed in duplicate to Professor D, H. Mugler, Department of 
Mathematics, University of Santa Clara, Santa Clara, CA 95053, by August 31, 1985. The solver’s full post-office 


address should be on each sheet. 
6493. Proposed by M. L. Glasser, Clarkson College. 


Show that for a > 0 


00 00 sind, X, sina, x, Sindy(x, + --: +x,) 
/ dx, ++ / dx,-— 
— & — & 


ee AER A = of min{ ay,..., a; } 


xy Xx Xp torts TX, 


(see 5314 [1965, 759]). 


6494. Proposed by A. Wilansky, Lehigh University. 


Let X be a commutative Banach algebra with 1 and M, N maximal ideals. Let S be the linear 
span of {mn: m © M, n € N}. Show that the codimension of S must be finite if M # N. (This is 


not necessarily true if M = N.) 


6495. Proposed by Robert E. Shafer, Berkeley, California. 


For x > 1, define 


Yx) = im, Tox +N) ~ y |. 


nao x tn 
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P(x) = im toe ot» + N) —  Grawero} 


Prove that P(x) > logw(x) for x sufficiently large. 


SOLUTIONS OF ADVANCED PROBLEMS 
A Right-Angle Preserving Mapping 


6436 [1983, 485]. Proposed by the late H. Kestelman, University College, London. 


Let V be a real vector space normed by a scalar product. Let f: V — V be such that whenever 
a,b,c are vertices of a triangle right-angled at a, then f(a), f(b), f(c) are vertices of a triangle 
right-angled at f(a). Show that f is a continuous collineation. 


Solution by Alain Tissier, Montfermeil, France. We assume that the dimension of V is at 
least 2. 


LEMMA 1. Suppose 
(1) A, 4,43, 44 are the vertices of a square with centre b. 
Then f(a,), f(a), f(a3), f(a@4) are the vertices of a square with centre f(b). 
Proof. Clearly (1) implies 
(2) (a,— b) -(a,,, — 6) = Oand (a, — 4,41) (4,41 — 442) = Oforl <i< 4, 
where a,,, = a,. Conversely it follows from (2), by the Pythagorean theorem, that, for 1 <i < 4, 
lla, — Bll? + Ylang — BU? = Na, = all? 
and 
a, — @,4 all? + lia — @aall? = lla, — @,42ll?, 
whence, by an easy computation, 
I]a, — 43] = lla. — agll =: 9, 
and then 
4,41 — 4,[| = 8/V2 s\la, — 5I| = lla — Bll =:4, 
and 
\|]@2 — || = |laq — bl =: B. 


But, by the triangle inequality, 5 < 2a and 6 < 28, so that a = B = 6/2. Thus (2) is equivalent 
to (1). Next, if follows from (2) and the right-angle preserving property of f that (2) holds with 
f(a,), f(P) in place of a,, b respectively. This completes the proof of Lemma 1. 


Let A be the set of numbers in [0,1] of the form m/2”, where m,n are nonnegative integers. 
LEMMA 2. Ifa,b&V,X EA, then f(1 — A)a+Ab)=(1—A)f(a)+Af(d). 


Proof. Given a,b © V, there are points c,d © V such that a,c,b,d are the vertices of a 
square. Hence, by Lemma 1, 


2) = 5) + £(0)). 


The proof can now be easily completed by induction. 
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LEMMA 3. For each a © V there is a mapping h,: [0,00) — [0,00) such that ||x — al| = p 
implies \|f(x) — f(a)|l = 4,(p). 
Proof. Given a € V, p > 0, there are points b,c € V such that 
|b — al|=|lc—all=p and a=(b+c)/2. 
Then ||x — a|| = p is equivalent to (x — b)- (x — c) = 0 which implies 
(f(x) —f(b)) (f(x) — fCe)) = 0. 
Since f(a) = (f(b) + f(c))/2, it follows that ||x — a|| = p implies 
f(x) — f(a) Il = IIfC@) — FCP) II 
which depends on a and p but not on x. 
LEMMA 4. Ifa © Vand 0 <r< op, thenh,(r)<h,(p). 
Proof. There are points b,c © V such that 
lle —all=r<||b-—all=p and (c—a)-(c— 5b) =0. 
Hence 
(f(c) —f(a)) (fc) — f(b)) = 0 
and so, by the Pythagorean theorem, 
h,(p) ={1f(a) — f(b)IP = Fe) — FCayIP + ILFCe) — FOB)IP 
>lIf(c) —f(a)iP =ACy. 


To prove f is continuous, it suffices, in view of Lemmas 3 and 4, to show that 
lim, .,.4,(2 "p) = 0 for every a € V, p > 0. This is a consequence of Lemma 2 which yields, for 
lla — b|| = p and c=(1—2°")a +27"), that 

h,(2-"p) = ||fCc) — fa) Il = 2-"1f(8) — f(a) I] = 2°", (). 
Since A = [0,1], it follows from Lemma 2 that 
f((1—A)a+Ab)=(1—A) f(a) +Af(d) 
for every A € [0,1] and hence, by a simple argument, that the identity holds for every real A. This 
shows that f is a collineation. 


Finally, it is easy to deduce from the above that || f(a) — f(x)|| = &,|la — x|| where k, is 
independent of x, 1.e., f is a similarity. 


Also solved by Aage Bondesen (Denmark) and the proposer, and partially solved by L. E. Mattics. 


An Exponential Sum 


6442 [1983, 648]. Proposed by Vladimir Naroditsky, San Jose State University. 


Show that if 


N 
lim a,exp (iA, t) 
=1 


[> 0c k= 
exists, then either all A’s are 0 or all a’s are 0. 
Solution by Piotr Biler, Institute of Mathematics, University of Wroclaw, Poland. The proposi- 


tion as stated is evidently incorrect. For example, the sum is constant if either a,A, = 0 for 
k = 1,2,...,N or all the A’s are equal and a, + a, + -:- +a, = 0. Suppose that the A’s are all 
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T 
different nonzero real numbers. Denote the sum by S(t) and its limit by S. Since T~* i S(t)dt > 0 
0 


as T > oo, it follows that S = 0. Hence, for k = 1,2,...,N, S(t)exp(—iA,t) > 0 as t> c 
and so 


0 = jim ean S(t) exp(—id,t) dt = a,. 


Also solved by Preben Alsholm, Bennett Eisenberg, J. A. Goldstein, Michael B. Gregory, J. P. Henniger 
(Canada), N. J. Lord (United Kingdom), Jacques Moisan (France), Jean-Marie Monier (France), William A. 
Newcomb, M. Pachter (South Africa), Abraham Smuckler (Israel), Allen Stenger, Alain Tissier (France), Wolfgang 
Walter (West Germany), and the proposer. 


A Probability Inequality 


6443 [1983, 648]. Proposed by C. Thorn and B. Tomaszewski, University of Wisconsin. 


Let a, >a, > +: 2 4),4, > O be a decreasing sequence of positive real numbers and let «,, 
E>,.--,€),4, be a Bernoulli sequence of independent random variables, ie. P(e, = 1) = 
P(e, = —1) = 5. Prove that 

2n+1 2n+1 


< a, 


1 
Y 4, a) (rt 7). 
- Jentl n 
i=1 
Solution by Lajos Takacs, Case Western Reserve University, Cleveland, Ohio. Let {b,} be a 
nondecreasing sequence of positive real numbers, let sy = 0 and, for n = 1,2,..., let 


S, = be, + bbe + +++ +bie t, =& +& +--+ +6,. 


nn? 


» E,a, 
1=1 


Let 


Q,,(x) = P(s, > x) + 5 P(s, =x). 


Then the required inequality can be expressed as 


20,,(b,) <1 - x([n/2)| 


when n is an odd integer. We shall in fact show the inequality to hold for every positive integer n. 
Evidently 


2Q,,(x) — O,-1(x a b, ) + Q,1(x + b,) for n = 1,2,... , 
Since Q,(x) > on? when x < y, we have 
2Q,,( jb, ) Q,- (Cj - 1) b,_ 1) + Q,- (Cj + 1) b,- 1) for n J= 1, 2.... 
Hence Q,,(/b,) efi) for n, 7 = 0,1,..., where 


2f,(j) = tin hj -1) +h. i(j +1), 


and f,(0) = 1/2 for n=0,1,...,f)(/) = 90 for 7 =1,2,.... Since f,(/) = Q,,(/) in the case 
where b, = b, = -:: = 5b, =1, we obtain that 


2f,(1) =2P(t, > 1) + P(t, = 1) =1- P(t, = 1) 


=1- (575) | for n = 1,2 
7 x [n/21] Orn = 1,2,..., 


and the required inequality follows. 


Also solved by the proposers. 


294 THEODORE SHIFRIN [April 


On Landen and Pfaff Transformations 


6445 [1983, 709]. Proposed by Richard Askey, University of Wisconsin. 


Show that Pfaff’s transformation of the hypergeometric function 
a _ XxX 
»F,( 4; x) =(1- x) Ae, D. x | 
implies Landen’s transformation of. the dilogarithm 


n 


. = Xx 1 2 . 
Liix:= » i ~ 5 og — x)|° — Li,(x/(x - 1)). 
n=1 
Solution by Mourad E. H. Ismail, Arizona State University, Tempe, Arizona. It is easy to see 
that, for —1 < x <1, 


__ 1,1, ftp fp (11. 
log(1 — x) = <2 Fi , | and Li, x ret a 5 st) at 


The Pfaff transformation and the above integral representation give, for —1 < x < 1/2, 
_ _ = 11. of dt __ ft (yh | du 
Li,(x/(x 1)) i mal 9) Ars fom 9) »U l—-w 


= [a0 4, =f, + T=} os —u)du 


o u(l — uw) l-—u 


1 
—Li,x - 5 [log (1 — xy]. 


Also solved by Paul S. Bruckman, Henry E. Fettis, C. Georghiou (Greece), Ira Gessel, M. L. Glasser, L. Kuipers 
(Switzerland), William A. Newcomb, Robert E. Shafer, and the proposer. 


REVIEWS 


EDITED BY ALLAN L. EDMONDS AND JOHN H. EWING 
COLLABORATING EDITOR FOR FILMS: SEYMOUR SCHUSTER 


Differential Topology: An Introduction. By David B. Gauld. Marcel Dekker, New York, 1982. 
v + 241 pp. $29.75. 


THEODORE SHIFRIN 
Department of Mathematics, University of Georgia, Athens, GA 30602 


The basic objects of study in differential topology are the differentiable (or smooth) manifolds 
and the differentiable functions on them. The prototype is the hypersurface X = { f(x, y,Z,) = 0} 
in R°, where f(x, y,z) is differentiable and Vf # 0 on X,i.., X contains no critical point of f. 
Surfaces, whether topological, polyhedral, or smooth, have interested many mathematicians, 
among them Descartes, Euler, Riemann and Poincaré. Smooth manifolds appear in all walks of 
life—as state and phase spaces in physics and chemistry, as indifference sets in economics, as 
orbit spaces in dynamical systems as space-time in general relativity, and so forth (cf. [1)). 
Problems in the calculus of variations, numerical analysis and applied mathematics are often 
phrased in terms of functions on manifolds (both finite- and infinite-dimensional). 

Roughly speaking, we may define a smooth n-dimensional manifold as follows. Topologically, 
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it is a locally Euclidean metric space: each point has a neighborhood which is homeomorphic to 
R”. Analytically, it admits calculus: one knows which functions ought to be called differentiable, 
and one has a well-defined tangent space at each point, as for the surfaces in multivariable 
calculus. It turns out that every manifold X may be viewed as sitting in some (large) Euclidean 
space R™; then it is not hard to prove that locally X is the zero locus of N-n independent 
functions (i.e., their gradient vectors are linearly independent along X). 


Fic. 1 


Consider the family of surfaces { f(x, y,z) = x* + y? — z* =c}. (See Fig 1.) At c = 0 the 
topological type of the surfaces changes dramatically with a small change in c, but for c # 0 the 
surfaces exhibit stability. What is so special about the case c= 0? Let f: R” > R* be 
differentiable; we say y € R* is a regular value of f if rank (df.) =k for every x € f ‘(y); 
otherwise, y is a critical value of f. The salient facts are the following: (1) (Implicit function 
theorem) If y is a regular value of f, then f-'(y) is a smooth submanifold of R”. (2) (Sard’s 
theorem) The set of regular values of f is open and dense in R*“. Here we see already the 
importance of two fundamental concepts in differential topology, as in mathematics generally: 
stability (openness) and genericity (denseness). We can now propose two routes to follow. The 
first is to study geometric properties arising from the generic or stable situation. The second is to 
infer information from the singularities themselves. 

In the first direction, one of the most elementary concepts is that of the degree of a mapping 
between manifolds of the same dimension—roughly, the number of times (counting orientations) 
that the domain covers the target under the mapping. The notion was first introduced early in the 
century by Brouwer from a combinatorial viewpoint, but it is a very natural setting for the 
differentiable viewpoint. Given a smooth map f: X — Y, with X compact and Y connected, 
choose a regular value y of f and count the (finite) number of points in f~‘(y) with signs. One 
concludes from the inverse function theorem and the topological assumptions we’ve made that 
this number is independent of the choice of y and is, indeed, invariant under perturbations of f 
(after all, how can an integer-valued function vary continuously?). As an elementary application 
of this, we mention the famous hairy ball theorem: any (tangent) vector field on the two-sphere S? 
must vanish somewhere. For if not, we could deform the identity map to the antipodal map by 
pushing in the direction of the vector field, and so the two mappings would have to have the same 
degree. As it happens, the identity map has degree +1 and the antipodal map, —1. 

In a similar vein, there is another famous elementary topological result. Call a map f: S* > S’ 
even if f(—x) = f(x), and odd if f(—x) = —f(x). It is not hard to see that an even map has 
even degree; what is not so obvious is that an odd map must have odd degree. From this one may 
deduce the fact that if g: S? — R®* is continuous, there is an x € S* with g(x) = g(—x); more 
colloquially, at any given time, there is a pair of antipodal points on the earth where the 
temperature and relative humidity are precisely equal. 

Once one has the idea of counting points, it is natural to broach the subject of intersection 
theory which plays a central role in topology, geometry, and algebraic geometry (and, indeed, as 
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index theory, in analysis). Given a manifold X and compact manifolds Y, Z Cc X, then it is a 
consequence of Sard’s theorem above that, after an arbitrarily small deformation of Z (say) in X, 
we may insure that Y and Z meet transversely, or in general position (i.e., so that the sum of their 
tangent spaces is the tangent space of the ambient space); it follows once again from the implicit 
function theorem that then Y M Z is a smooth submanifold of X. (See Fig. 2.) As a simple case of 
this, most mathematicians have at one time used the general position lemma that the slice of a 
surface in R? by the general plane is a smooth curve (in algebraic geometry, this comes under the 
guise of Bertini’s theorem and has far-reaching consequences). 


Z 
non-transverse 
transverse 


Fic. 2 


One particular case of intersection theory is that of complementary dimension, and when 
X,Y, Z, are all oriented, we may speak of the intersection number of Y and Z (in X). Such 
numbers turn out to be indicative of the global nature of X. For example, the classical Euler 
characteristic y(X) (= V — E+ F for a polyhedral surface) appears in just such a fashion; the 
link here is the beautiful result (due to Poincaré and Hopf) that this number is equal to 
the number of zeroes of a vector field on the manifold, counted appropriately. And one of the 
prettiest and most inspiring theorems of this century is Chern’s generalization of the classical 
Gauss-Bonnet theorem, which equates the Euler characteristic of an even-dimensional compact 
oriented Riemannian manifold to a curvature integral (cf. [4], [8], [12]). 

The goal of much mathematical work is, after all, the classification problem. The thrust of 
topology is to develop a list of properties (e.g., compactness, connectedness, separability) which 
distinguish among topological spaces. Algebraic topology translates this general problem into 
algebra—homology, homotopy, cohomology, bordism—and intersection theory is one geometric 
version of this (indeed, it was the classical approach of Poincaré and Lefschetz). In light of Hopf’s 
result, we see that the Euler characteristic of a connected manifold X gives the obstruction to 
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finding a vector field on X which is nowhere zero; it is a beautiful fact that the converse holds as 
well: if x( X) = 0, then there exists a nowhere-zero vector field on X (cf. [6] or [7]). 

Let us turn now to our second point of view; we wish to glean information about a manifold X 
from the singularities of a mice function defined on it. This is the domain of Morse theory. Let 
X C R™ be a compact n-manifold, and consider the height function f(x) = x, on X (see Fig. 3). 
By rotating the coordinate system shghtly if necessary, we may insure that f is a function with 
only nondegenerate critical points. That is, in any local coordinate system (z,,...,z,) on X 
centered at a critical point p, df/dz,(0) = 0 and the hessian matrix 


a’ f 
Ht = | dz,0z, | 
is nonsingular. Such a function f is called a Morse function on X. Note, by the way, that the 
critical points of f are precisely the points where the hyperplanes x, = constant fail to meet X 
transversely. Now, the number of negative eigenvalues of H is well-defined and 1s called the index 
of the critical point p. In Fig. 3 we have labelled the indices of the Morse function on the right. 


Let us begin by observing that as we pass through critical values, the topology of the preimage 
sets f~'(y) changes dramatically; this is the same phenomenon with which we began in Fig. 1. 


i 
oF 


(See Fig. 4.) Given our Morse function on X, we would like a prescription for concocting X out of 
“building blocks.” Here we adopt the simplest viewpoint. Let D* = {x € R*: ||x|| < 1}; we call 
this a k-cell; its boundary is the standard (k — 1)-sphere, 9D* = S*~!. In the purely topological 
category, ignoring differentiability, we can speak of a cell complex, a topological space built 
inductively by attaching k-cells D* to X,_,, the collection of all cells of dimension k — 1, by a 
continuous map ¢: dD* —> X,_,. For example, we may obtain the n-sphere S” by attaching an 
n-cell to a O-cell, as pictured in Fig. 5: 


The fundamental theorem here, linking algebraic and differential topology, states that any 
compact manifold X may be built out of cells, attaching one k-cell for each critical point of index 
k (for a given Morse function on X). Note that this gives many different cell structures for the 
same underlying space. For example, in Fig. 6 on p. 298, 
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FIG. 6 


for the sphere. Similarly, the standard torus with the Morse function in Fig. 3 is built from the 
standard picture shown in Fig. 7: 


FIG. 7 


The theory is very pretty and very geometric; nevertheless, to prove such a theorem, one must use 
the integral curves of a differential equation derived from the Morse function. 

Surgery theory is based on the following innocent observation. Suppose we have a copy of 
S* x D"~-* sitting in an n-manifold X; then, keeping its boundary the same, we can replace its 
interior by D**! x §"~*~!, since, after all, 


0( D**? x sr- kot) _ Sk x guvk-l _ a(S‘ x D"-*), 


This process, for example, turns the 2-sphere into the standard torus, as shown in Fig. 8: 
S° x D? 


© Ss D'xs! 


Fic. 8 


This seems an obvious, yet puzzling construction; for normally in mathematics, one performs 
modifications which preserve the invariants one is trying to study. But surgery blatantly changes 
the topology of the space. That ultimately is the point! We referred earlier to obstructions— surgery 
may be used to remove them, or at least to make them more manageable. It was to this end that 
Milnor, Browder, and Novikov, among others, invented the theory. For example, the high-dimen- 
sional Poincaré conjecture states that if X is a compact n-manifold without boundary, n > 5, 
with the homotopy type of a sphere, then X is homeomorphic to S$”. Indeed, in dimension five 
(and six), X is diffeomorphic to S° (resp., S°). The main point here is to prove that X is the 
boundary of a 6-manifold W that is contractible (i.e., homotopically a point); first one finds any 
old W (more or less), and then, using surgery techniques, one “kills” all its homotopy groups. So 
we have altered W, but produced a new manifold with boundary X which satisfies the 
prerequisites. Then the result follows from Smale’s h-cobordism theorem (cf. [10], [13]). 

We have mentioned only a few of the problems with which differential topology deals, largely 
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because these are the notions which the undergraduate texts currently available treat. We’ve 
omitted Smale’s handlebody theory, Thom’s work on cobordism (cf. [13]), all the ramifications of 
singularity theory, which has become a huge industry recently, intertwining algebraic geometry 
and differential topology (cf. [5], [2], [3]); we’ve also omitted immersion theory, characteristic class 
theory (cf. [12]), and of course the celebrated work on 3- and 4-manifolds recently by Kirby, 
Thurston and their schools. 

David Gauld’s book allegedly presumes no knowledge of point-set topology or advanced 
calculus, starting with a cursory treatment of both and then proceeding to the definitions of 
manifolds and Morse functions. I find it hard to believe that the undergraduate initiate can 
successfully handle the technicalities involved in the latter half of the book unless he is already 
well-grounded in both prerequisites. Gauld’s treatment of elementary surgery theory is, however, 
readable and almost too full of detail; perhaps it will make a good reference in that regard for a 
graduate student learning the theory. The book finishes with a treatment of the classification of 
compact, oriented surfaces. 

I am puzzled by Gauld’s bibliographical notes. He says that Wallace’s book Differential 
Topology: First Steps ({14]) “although different in aim and scope from this book, [is] at a similar 
level and covers more or less the same topics.” Indeed, having read both books with reasonable 
care, I can’t imagine how the aim and scope are different. Wallace covers more material in a terser 
fashion, and gives the reader a greater feeling for the impact of surgery theory. It seems to me that 
Gauld is more willing to get bogged down in the technical details. 

Surgery theory is unquestionably a beautiful and important part of modern mathematics; 
nevertheless, I find the subject and this book less appropriate for an undergraduate course than 
transversality theory and its applications. Admittedly, I am biased, having taught an under- 
graduate course based on [6] and [11] three times. These days, more of our students in pure 
mathematics courses come from computer science, physics, mathematical economics, and so forth. 
I believe this makes the course I prefer to teach even more appropriate, because it integrates linear 
algebra, advanced calculus, and point-set topology very strongly, and builds foundational strength 
by applying the transversality theory over and over again, obtaining a number of deep topological 
results meanwhile. It is true that the structure theory of manifolds, i.e., the geometry of Morse 
theory, is all but missing; but on the other hand, to actually do surgery theory takes plenty of 
transversality arguments (which Wallace mentions briefly) and much algebraic topology. But, 
indeed, I would like to campaign for such an undergraduate differential topology course in our 
curriculum. Unfortunately, mathematics courses at the undergraduate level tend to be far too 
independent from one another; this particular subject affords us the opportunity to demonstrate 
to our students the beauty in the unity of mathematics. 
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Examples of Commutative Rings. By Harry C. Hutchins. Polygonal, Passaic, New Jersey, 1981. 
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Commutative algebra, as a separate field of mathematics, is comparatively new. Yet already it 
has become one of the basic structures of mathematics. It arose to complement a trinity of 
nineteenth century subjects: algebraic number theory, invariant theory, and algebraic geometry. 

Like most fields still in a state of development, insight is searched for (and hopefully found!) 
by a long and careful study of many examples. The book Examples of Commutative Rings by 
Harry C. Hutchins thus provides a useful addition to the basic literature of commutative algebra. 
It is a good reference to complement a text for a student learning the subject. However, many of 
the deeper and more revealing examples which would be of interest to researchers are not 
contained in the book. For instance, important examples which deal with the behavior of rings 
under completions or group actions are not present, and very little material is devoted to the 
important topics of resolutions and Hilbert functions. 

Much of the present shape of the subject was dictated by problems in number theory and 
invariant theory. The realization that not all rings of algebraic numbers have the coveted property 
of unique factorization into prime algebraic numbers led researchers to seek other forms of unique 
factorization. Kummer successfully completed this program with his ideal numbers. In our later 
setting, his result states that every ideal in a ring of algebraic integers is uniquely (up to order) a 
product (or intersection) of prime ideals. 

Somewhat later (1890), Hilbert proved in a monumental paper that every ideal in a polynomial 
algebra is finitely generated, thus obviating to some extent the second main problem of invariant 
theory. At the same time, Hilbert proved his syzygy theorem, which became largely forgotten, and 
arose again with the advent of homological algebra, of which more will be said. (A syzygy of a 
finite collection of elements f,,...,f, in a ring R is an n-tuple (g,,...,g,) in R” such that 
r=18.f, = 9.) 

A new challenge was seen: try to extend the factorization of ideals due to Kummer to 
polynomial rings, the natural setting of both algebraic geometry and invariant theory. This was 
accomplished in 1905 by the world chess champion, Emmanuel Lasker. He found that every ideal 
in a polynomial ring is the intersection of primary ideals. (An ideal J is primary if whenever 
a-be€T, either a © I or b" € IT for some n.) The result is the best one can do in general. For 
instance the ideal in C[.X, Y] generated by X? and Y? is neither an intersection nor product of 
prime ideals. 

A far-sighted English mathematician, F. S$. Macaulay, delved deeper into this decomposition. 
He was interested in when such a decomposition was unique and “unmixed”—which roughly 
means all the primary components should have the same “size.” Rings where the primary 
decomposition behaves as well as possible are now called Cohen-Macaulay rings. Cohen-Macaulay 
rings lie at the foundations of commutative algebra today. In 1974, M. Hochster and J. L. Roberts 
showed that the ring of invariants of a linearly reductive algebraic group acting on a polynomial 
ring linearly is Cohen-Macaulay, which with other concurrent results signaled a reinvestigation of 
invariant theory. A typical example is as follows: let U be an r X t matrix of variables and V be 
a ¢ X s matrix of variables. The ring R = C[UV], generated over C by the entries of the product 
matrix can be thought of in several ways. If X is an r X s matrix of new variables, then the 
mapping of C[X,,] > R by X,, > (UV),, has kernel J = I,,.,(X), generated by the ¢ + 1 size 
minors of J. Thus R is isomorphic to C[ X,,]/J. The weaker statement that J is the nilradical of 
I,,,(X) is equivalent to the statement that the rank of the product of an r X ¢ matrix witha ¢t X s 
matrix is at most ¢. On the other hand, R is also the ring of invariants of G = GL,(C) acting on 
S = C[U,V] via the action U > UA™~!,V > AV. The theorem of Hochster and Roberts implies R 
is Cohen-Macaulay. 
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When Emmy Noether in 1921 gave a wonderfully simple proof of Lasker’s theorem, as well as 
establishing an axiomatic treatment of Dedekind domains and rings in which every ideal is finitely 
generated (now called Noetherian rings), commutative algebra did not exist as a separate field. 
Indeed some say that it was Noether’s paper which ushered in all of modern algebra. It was not 
until the work of W. Krull (1935) that the field began its process of self-realization. With his 
principal ideal theorem, and his theory of local rings and their completions, commutative algebra 
became a distinct entity. 

A commutative ring is /ocal if it has a unique maximal ideal and is Noetherian. If R is any 
commutative algebra, and p is a prime ideal, then by forming “quotients” with denominators not 
in p, One may construct a local ring R,. This process is called /ocalization. If R is local with 
maximal ideal m, then the powers of m may be used as a neighborhood basis for a topology 
(which is Hausdorff), and we may complete R with respect to this topology. The result (denoted 
R) is again a local commutative ring. For example, if k is a field and R is k[ X,,..., X,] localized 
at (X,,...,X,), then R consists of rational functions /f(X),...,X,)/g(X%,...,X,) with 
g(0,...,0) # 0, while R is exactly the ring k[[ X,,..., X,]] of formal power series. 

What have we gained by this process? A remarkable amount, as it turns out. In 1946, I. S. 
Cohen showed that if R is a local ring containing a field, then R is isomorphic to a quotient, 
kK{[X,,...,X,]|/7, of a formal power series ring. With this result a standard technique was born: 
localize, complete, and use the structure of the power series ring to analyze whatever problem one 
has. We will give an example of this process in a moment. 

It became obvious that one type of local ring was the most basic—that of regular local rings, 
which generalize the classical notion of non-singularity. Examples of regular local rings include 
power series rings over fields, discrete valuation rings, and the localization of polynomial rings at 
any prime. In fact, if R is a local ring containing a field, then R is regular if and only if R is 
isomorphic to a power series ring over a field. 

A problem which became increasingly famous as its solution evaded researchers again involved 
the unique factorization property. The conjecture stated that any regular local ring R has the 
unique factorization property. If R contains a field, then a proof can be given by showing that R 
(a formal power series ring) has unique factorization using the Weierstrass Preparation Theorem, 
then proving by descent that R also has this property. The proof of the general conjecture, due to 
M. Auslander and D. Buchsbaum in 1959, ushered in the present era of commutative algebra 
characterized by the extensive use of homological algebra. 

Loosely speaking, homological algebra is linear algebra over an arbitrary ring, or alternatively 
the study of matrices whose entries lie in an arbitrary ring. One famous problem should convince 
the reader this is highly nontrivial. If f,,..., f, are polynomials in R = k[.X,,..., X,,,] which form 
the first row of an invertible n X n matrix with coefficients in R, then Cramer’s rule shows there 
are polynomials g,,...,g, such that 1”, 2, f, = 1. J. P. Serre asked if the converse was true, which 
was finally proved independently by Quillen and Suslin in 1976. The simplest proof now available 
uses the technique of localization. (A better known version of Serre’s question asked if every 
projective module over a polynomial ring is free.) 

Homological algebra investigates the relations (syzygies) of an arbitrary matrix. The module of 
relations’is finitely generated (Hilbert) and we may put the generators into a matrix and ask for 
their relations (second syzygies). Hilbert’s syzygy theorem states that if R is a polynomial ring, 
this process stops in finitely many steps! The starting point of the modern viewpoint was the 
realization that this property characterizes regular local rings. 

A great deal of modern commutative algebra has been directly inspired by considerations in 
algebraic geometry, as in the work of Serre and Grothendieck. However, by no means is the entire 
field a subject of this benevolent tyrant. Newly found connections with topology and combinator- 
ics as well as a deeper self-analysis characterize an expanding field. While the variety of examples 
in Harry Hutchins’ book mirrors the diversity now found in commutative algebra, there are many 
other important examples which could have been included. Indeed, one of the main pedagogical 
problems confronting us today is a lack of agreement over what is truly basic in the field. 
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Fair Representation: Meeting the Ideal of One Man, One Vote. By Michel L. Balinski and 
H. Peyton Young. Yale University Press, New Haven, CT. 1982. xi + 191 pp., $27.50. 
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The apportionment problem deals with the routine operation of rounding fractions, when they 
are constrained to sum to a given constant. For example, when rounding off numbers in a 
statistical table so that they sum to precisely 100.0%; when a dean rounds salary increments to be 
multiples of $100; or when a department chairperson uses the preregistration figures to determine 
how many sections there should be for each subject. This common procedure appears, at first 
glance, to be a trivial one in practice, and to warrant little serious concern; whereas the scheme 
used to apportion fractions may, in some instances, have major consequences. For example, the 
President of the United States was determined in 1876 by the apportionment used for the U. S. 
Congress in the 1870s, which was not even in agreement with the method they were supposed to 
be using at the time. 

The authors have chosen to discuss the apportionment problem in the context of fair political 
representation, and they relate the fascinating history it has had in the U. S. House of 
Representatives. Great American statesmen lke Hamilton, Jefferson, Madison, Washington, J. Q. 
Adams, and Webster were involved with this mathematical problem and the frequently recurring 
debates over apportionment of the U. S. House. In the twentieth century, several famous 
mathematical scientists also took part, as indicated by the long time feud between E. V. 
Huntington of Harvard and W. F. Willcox of Cornell. Balinski and Young now argue in support 
of the method of Webster and Willcox over the method of Hill and Huntington which is the 
current law. (They no longer advocate the use of their own quota method, which was described in 
this MONTHLY (vol. 82 (1975) 701~730), since they found that it can take a seat away from the 
only state which has gained in population.) It should be stressed, moreover, that the apportion- 
ment problem arises in a great number of other allocation problems. It has been studied for 
various allocations which arise in a typical school district, for manpower assignments in the U. S. 
Navy, and for the distribution of public transportation vehicles in Leningrad. 

This book is likely to become a definitive contribution to a newly emerging field which can be 
named “equity analysis.” This subject is concerned with the fair division of various types of goods 
and services, as well as political representation. This theory will naturally divide into two parts: 
finely divisible commodities (money, time, cakes), and more discrete things (people, votes, calculus 
classes, vehicle loads). Fair Representation is destined to become a classic in the latter direction. 
Although equity has been studied mostly by economists, for example, Gunnar Myrdal (1898— ) 
and Arthur Okun (1928-1980), it now appears ripe for accelerated development by mathematical 
scientists; perhaps similar to the development of subjects such as linear programming, game 
theory, and parts of operations research, in recent decades. 

The extensive use of the axiomatic method in classical applied mathematics has been seriously 
questioned. See, for example, “Beware of Axiomatics in Applied Mathematics” by L. C. Woods, 
Bulletin of the Institute of Mathematics and its Applications (vol. 9 (1973) 41-44). There is surely 
some value and efficiency in axiomatizing subjects, say in the physical sciences, after they have 
matured somewhat. One unnamed colleague referred to this as “like cleaning up the workshop 
after the inventor has gone home.” There is, nonetheless, some risk to teaching a scientific subject 
in a way as to downplay the importance of intuition, experimentation, and the modelling process. 
In contrast to these latter views, axioms do play a very crucial role in the early stages of modelling 
in many of the contemporary directions of applied mathematics, such as in the social, managerial 
and decision sciences. The book under review is a truly splendid illustration of this latter use of 
axioms, where intended goals or desirable properties are listed as axioms to be satisfied whenever 
possible. Over the past decade, Balinski and Young have provided multiple characterizations of 
the classical methods of apportionment, including axiomatic definitions. It is interesting to note 
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that great mathematicians of the likes of Huntington and von Neumann reviewed the work on 
apportionment, but did not bring the axiomatic approach to bear on this problem. 

This book also provides a beautiful example of an “impossibility theorem” which can be 
lucidly explained, and possibly fully proved, to college freshmen. Three very intuitive and highly 
desirable properties, which have been well known in apportionment considerations for more than 
100 years, are in general inconsistent. No apportionment method can always satisfy the quota 
condition, population monotonicity, and monotonicity in house size (that is, avoid the “Alabama 
paradox’’). 

In short, this is a delightful book to read, while being a major synthesis of an important area. It 
is well suited for illustrating some significant points which should be well known to mathematics 
students as well as to various social scientists. 


LETTERS TO THE EDITOR 


Material for this department should be prepared exactly the same way as submitted manuscripts (see the inside 
front cover) and sent to Professor P. R. Halmos, Department of Mathematics, University of Santa Clara, Santa 
Clara, CA 95053. 


Editor: 


The proof given by J. A. Gallian (this MONTHLY, 91, p. 134) that A, is simple is indeed simple. 
However, it can be made even simpler by proving the case |N| = 2 as follows. 

If |N| = 2, then N must be a cyclic group generated by an element of order 2 which must be of 
the form (ab)(cd). Since N is normal in A,, it must contain the element 


(abc) “(ab)( cd)( abc) 
= (acb)( ab)( cd)( abc) 
= (ad)( bc) € N, 
which is a contradiction. 


K. L. Teo 

M. D. Hendy 

Dept. of Mathematics and Statistics 
Massey University 

Palmerston North, New Zealand 


Editor: 


A number of readers of the MONTHLY have observed that my proof of the simplicity of A, 
which appeared in the February 1984 issue could be shortened by using a permutation argument 
to handle the last case considered. I thank these readers for their comments. 

Unfortunately, many readers missed the point of my note. My purpose was to provide a proof 
which utilizes the same methods to prove that A, is simple (Sylow theorems and counting 
techniques) as are typically used to prove the non-existence of simple groups of various orders. 
Beyond being a nice application of these methods, such a proof has the advantage that it proves 
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As; is simple, independent of the five forms in which it commonly arises: as a group of 
permutations, as the rotation group of a dodecahedron or an icosahedron, as a group of 2 X 2 
matrices over a field of order 4 or 5 modulo the scalar matrices. 


Joseph A. Gallian 

Department of Mathematical Sciences 
University of Minnesota 

Duluth, MN 55812 


145. MISCELLANEA 


Ode to a 6? 


A thing as lovely as a @ 
I think that I shall never spy} [Choose one. ] 


If you want to make ¢ rhyme with “see”, 
But you're really determined to be 
Consistent, admit 
Though it doesn’t quite fit, 
That a disk measures r? times “pea”. 


Each of £, 7, $, x, W will claim 

An identical rhyme for its name. 
Before you get hot, 
Check Liddell and Scott!: 

Give them, not the author, the blame. 


Barry W. Brunson 
Department of Mathematics 
Western Kentucky University 
Bowling Green, KY 42101 


'A Greek-English Lexicon, compiled by H. G. Liddell & R. Scott, Oxford at the Clarendon Press, 1968 (1st 
edition 1843; widely recognized as authoritative). 
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In this newest addition to the Carus Mathematical Monographs, the authors 
examine the relationship between elementary electric network theory and ran- 
dom walks, at a level which can be appreciated by the able college student. We 
are indebted to them for presenting this interplay between probability theory 
and physics in so readable and concise a fashion. 

Central to the book is Polya’s beautiful theorem that a random walker on an 
infinite street network in d-dimensional space is bound to return to the starting 
point when d= 2, but has a positive probability of escaping to infinity without 
returning to the starting point when d = 3. The authors interpret this theorem as 
a statement about electric networks, and then prove the theorem using tech- 
niques from classical electrical theory. The techniques referred to go back to 
Lord Rayleigh who introduced them in connection with an investigation of 
musical instruments. 

In Part I the authors restrict themselves to the study of random walks on finite 
networks, establishing the connection between the electrical concepts of cur- 
rent and voltage and corresponding descriptive quantities of random walks re- 
garded as finite state Markov chains. Part II deals with the idea of random walks 
on infinite networks. 
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The Role of Mathematics in Science 
by Leon Bowden and M.M. Schiffer 

220 pp. Paperbound 

List: $16.00 MAA Member: $12.50 


This book is based on a series of lectures given over a period 
of years to high school mathematics teachers. The lectures 
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ADVICE ON HANGING PICTURES 


F. J. BLOORE 
Department of Applied Mathematics and Theoretical Physics, The University, P.O. Box 147, Liverpool 
L69 3BX, England 


H. R. MORTON 
Department of Pure Mathematics, The University, P.O. Box 147, Liverpool L69 3BX, England 


1. Introduction. The irritating way in which pictures frequently hang askew can be explained 
by simple mechanics related to the geometry of the picture and its cord. 

Suppose that the picture is suspended by a cord attached to its two upper corners, passing over 
a frictionless hook in the wall. Assume also that no friction acts between the picture and the wall. 
The picture will hang in equilibrium with its centre vertically below the hook; the parts of the cord 
from the hook to the two corners will make the same angle with the vertical. 

We shall see shortly that if the cord is shorter than some critical length, the symmetric 
equilibrium position is unstable. The picture will, if slightly disturbed, fall askew on one side or 
the other to a stable but slanting equilibrium position. This situation is illustrated in Fig. 1 where 
the cord lengths are the same in all three pictures but the tilted pictures have a lower centre of 
gravity. 


dn 


7 
iK 


FIG. 1 


It turns out (Section 2) that in the tilted equilibrium positions, the angle between the 
cords—the “support angle”—is equal to the angle c between the diagonals of the picture. The 
hook then lies on the circle Y through the centre of gravity and the two corners A, B where 
the cord is attached to the picture (see Fig. 2). If in the symmetrical level position of the picture 
the cord is so short that the support angle is larger than c, then this position is unstable. The cord 
must be lengthened to reduce the support angle to c in order to attain stability of the level 


Fred Bloore: I took a Ph.D. in nuclear theory in 1961. My interests later drifted towards geometrical aspects of 
things, particularly of quantum mechanics but also of more concrete objects such as beach pebbles and sand dunes. I 
enjoy hill walking and am a fairly active member of Amnesty International. 


Hugh Morton: As a research student at the University of Warwick in 1964-66, I saw the very beginnings of the 
Mathematics department there, under the inspiration of Chris Zeeman; since then I have been a lecturer in Pure 
Mathematics at Liverpool. My general enjoyment of things geometrical is normally directed at low-dimensional 
topology, particularly knot theory and other areas in which I can try to visualise the objects under study. 

By way of relaxation I enjoy making geometrical models from time to time. Fred and I hope to develop a robust 
demonstration model from the ideas in this article, to take its place among other mathematically based exhibits in 
our departmental display. 
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position. The hook now lies on the perpendicular bisector ¥ of AB. 

The problem is acute for a long thin picture such as a school photograph or a panorama. These 
have a small angle c so need a long cord. As an extreme example, if the Bayeux tapestry 
(5m X 70m) were nailed down on a rigid rectangle of hardboard, one would need 1 km of cord to 
hang it so the horizontal equilibrium position were stable! 

By altering the points of attachment of the cord we can change the effective geometry of the 
picture. The important features are the position of the centre of mass of the picture, and the two 
points of attachment. Provided that both points of attachment lie at the same distance from the 
centre of mass, the picture will behave as if it were a rectangular picture with the same centre of 
mass, whose corners lie at the points of attachment. The effective characteristic angle c can thus 
be increased by moving the points of attachment closer to each other along the top edge of the 
picture. 

In the next two sections we shall show why the equilibria follow the patterns described, and 
analyse their stability. As the cord length is varied, the equilibrium position of the picture moves 
relative to the hook. It is fruitful to consider the Jocus of the hook relative to the equilibrium position 
of the picture as the cord length is varied. This locus is the union of the line ¥ and the circle Y, 
Fig, 2. As the cord is shortened, the hook moves down # to F, then along one or other arc of . 
The continuation of # below E 1s unstable. 

The main phenomenon is the splitting of one stable equilibrium position into two as the cord is 
shortened (or as the cord is kept the same length and the centre of gravity raised!). The nature of 
the transition which occurs is familiar from the viewpoint of elementary catastrophe theory. This 
naturally suggests that we consider also what happens when the centre of gravity is not equidistant 
from the points of support. 

In this case, treated in Sections 4 and 5, the locus of the hook H relative to the picture is no 
longer a circle and line but a more general cubic curve called a strophoid. The points of 
attachment of the picture also lie on this strophoid. They are conjugate with respect to a natural 
involution on the strophoid, and we explore the geometry from this point of view. 


Cc 


Fic, 2(a). Short cord. Fic. 2(b). Long cord. 


2. Circle and Ellipse. Consider a rectangular picture ABCD of length AB =2a, depth 
BC = 2b with centre of mass G at its centre, suspended by a cord of length 2/ attached at A and 
B passing over a hook H, Fig. 2. 

We assume that no friction acts between the picture and the wall. Then G must lie vertically 
below H since the three forces on the picture, namely gravity and the tensions in AH and BH, 
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must pass through the same point H. The tensions in the two parts of the cord are equal, because 
the hook is frictionless. Their horizontal components must cancel, and so the pieces of cord AH 
and BH make equal and opposite angles with the vertical HG. It follows that H must lie on either 


(1) the perpendicular bisector # of AB, or 
(2) the circle Y which passes through A, G and B. 


In the latter case H must be above the arc AGB, and the equality o1 the angles is implied by the 
equality of the chord lengths AG and GB. We now prove that # and ¥ are the only possible 
locations for H. Let B’ be the reflection of B in the line HG, Fig. 3. 


HH 


FIG. 3 


Since the angles A4HG, BHG are equal, B’ must lie on HA. 
If B’ = A, then HG is perpendicular to AB and so H lies on the perpendicular bisector of AB. 
If B’ # A, then 


HBG = HB’G by reflection 


= B’AG since the triangle B’AG 1s isosceles 
= 7 — HAG. 
Thus HAGB is a cyclic quadrilateral; H lies on the circle through A, G, B. Note that when H lies 
on the arc AGB of YS, the angles 4HG, GHB are not equal but supplementary. 
Since “ and ¥ are fixed relative to the picture, it is natural to consider the equilibrium 


positions of the hook H relative to the picture, and how they change as we vary the length of the 
cord, Fig. 4. 


Let & cross ¥ at E, above G. Then AE = ay (1 +(a/b)’) , 


For “long” cords, i.e., for / > AE the hook must lie on # above FE. There is only one position 
of equilibrium; the picture hangs level in stable equilibrium. For “short” cords a < 1 < AE, there 
are three positions of equilibrium, two asymmetrical ones H, and H, on & flank a position H, 
on £ (Fig. 4). It is well known that for fixed cord length 2/, the locus of H such that 
HA + HB = 21 is an ellipse & with foci A and B (and with eccentricity e = a//). For long 
cords &, will not cross ¥ but will cross Y at H. For short cords, & will cross Y at H, and H;, 
and will cross ¥ at H, (and also at H,), Fig. 4. 

The stability of the positions of equilibrium can be decided by examining the potential energy 
of the picture. The potential energy of the picture relative to the hook is proportional to — GH. A 
position H is an equilibrium position if GH is stationary with respect to movements of H along 
the ellipse, i.e., if GH is normal to the ellipse. The equilibrium will be stable if GH is a local 


312 F. J. BLOORE AND H. R. MORTON [May 


&, for “long” cord 


&€, for “short” cord 


maximum, unstable if a local minimum. 

The circle @ with centre G and radius GH, touches the ellipse & at H, and H,. The two 
curves cannot meet elsewhere since distinct conics can intersect at most four times, and @ and @, 
have double contact at H, and at H;. So the ellipse & must lie entirely within the circle @, 
because the point H, on & is certainly inside @. We conclude that GH, and GH, are local (in 
fact global) maxima and GH, 1s a local minimum. 

Now draw GX tangent to the circle Y. One of the classical circle theorems tells us that angle 
XGB equals angle GH, B. Thus for rectangular pictures on short cords, the symmetrical position is 
unstable, and the picture tilts until the angle AH, B or AH,B between the cords has decreased 
down to the angle BGC between the diagonals of the picture. More usefully, to hang a picture in 
symmetrical stable equilibrium, the cord must be long enough that the angle between its halves is less 
than or equal to the angle between the diagonals of the picture. If the points of attachment A and B 
of the string to the picture are not at the corners, the criterion becomes: the angle 4HB must not 
exceed 7 minus angle AGB. So for a given picture and cord length, one can obtain stability in the 


HH 
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symmetrical position by moving the points A and B sufficiently close together, but keeping 
AG = BG. 

Now return to the rectangular picture hung by the corners, and consider its equilibrium 
positions relative to the hook H on the wall, as the cord increases in length from 2a (Fig. 5). Let 
the angle BGC between the diagonals be 20; draw a line HG’ on the wall vertically downwards 
and lines HA’, HB’ at an angle 6 to HG’. Then the points A and B of the picture will lie on HA’ 
and HB’ and the point G on HG’. As the length of the cord increases, the picture will move 
downwards and become eventually horizontal, when the cord reaches its critical length. The circle 
AGBH moves with the picture and ro//s on a circle twice as large on the wall, centre H. This is 
because A, B and G, respectively, traverse the fixed radii HA’, HB’, HG’ of the big circle on the 
wall, Fig. 6. These radii are perpendicular to the lines PA, PB, PG, where P is the point of 
contact of the two circles. Hence P is the instantaneous centre of rotation of . If the cord is 
lengthened further, the picture remains horizontal and moves downwards; A and B leave the lines 
HA’, HB’. 
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It must be admitted that one could “cheat” and use two hooks. In Fig. 7, the picture behaves as 
if suspended from H whereas in reality a much shorter cord AH, HB is being used. The points 
H,, H, may be chosen anywhere on the lines HA and HB. We invite readers to develop this case 
further. (We haven’t.) 


3. Evolute. We return to the viewpoint relative to the picture, and present another approach to 
the stability question using the evolute of the ellipse & of Fig. 4. This is the locus of the centre of 
curvature of &,, which is an astroid-like curve &, Fig. 8. If G lies inside .%, 1.e., if the lower cusp 
J of &, lies below G, then the circle of centre G and radius GH, will be a tighter circle than the 
circle of curvature at H, (which has centre J and radius JH,). Hence GH, will be a minimum of 
GH as H moves on the ellipse and so H, is an unstable position of equilibrium. 

If the cord is lengthened, the ellipse & becomes larger and more nearly circular; the curve x, 
becomes smaller, shrinking down to the centre point O. Its lower cusp J will move upwards 
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Fic. 8 


through G towards O. At the critical length of cord when J is on top of G, the three equilibrium 
positions H,, H,, H, (which we observed to lie on the circle through A, B and G) must coincide, 
corresponding to the (well-known) theorem that for any ellipse the foci A and B, the point H, 
and its centre of curvature J lie on a circle whose radius is half the radius of curvature at H). 

If the string is long enough that J is above G, then the circle of curvature at H, will be tighter 
than the circle of centre G and radius GH,, so that H, is a stable position of equilibrium. 


4. Catastrophe theory. Readers acquainted with catastrophe theory will recognise that the cusp 
J which governs the stability character of the hanging picture arises from a cusp catastrophe. We 
now develop this aspect. Several clear introductions to catastrophe theory exist already (see the 
introductory chapter of reference 2) so we shall just sketch the main idea. 

Consider a potential function V(a,,...,a,,,X,,...,X,) Of a system having coordinates 
X,,...,X, and adjustable parameters a,,..., a,,. Catastrophe theory describes how the stationary 
points of V (solutions of dV/dx = 0) can coalesce and disappear as the parameters « are varied. 

The simplest example is V(a, x) = x? + 3ax. For negative a, there are two stationary points 
x = + V-—a, one stable and one unstable equilibrium, which coalesce and disappear as a 
increases through the value zero, Fig. 9. 

The graph of the stationary points as functions of a, x = + ¥—a is the parabola Fig. 10. This 
sudden loss of stationary points caused by a continuous change of parameter is called a 
catastrophe. 

The next most simple example is 


(1) V(a,B,x) = x* — 6Bx? + 8ax. 
The stationary points are given by 0V/dx = 0, Le., 
(2) 4x? —12Bx + 8a =0. 
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a<0O a= 0 a>QdO 
FIG. 9 
Ss 
+y~-a@ 
s = stable 
u = unstable 
u 
—yVy-a 
Fic. 10 
B 


Fic. 11 
Two stationary points coincide if also 0°V/dx* = 0, ie., 
(3) 12x? - 128 =0. 


Eliminating x from (2) and (3) gives a* = B°, a cubic parabola 2 in (a, B)-space, Fig. 11. 

The small wiggly graphs dotted about on Fig. 11 are little sketches of the graph of V(a, B, x) 
as a function of x, for the values (a, B) given by the location at which the small graph appears. 
The function V has three stationary points if (a, 8) is above the curve 2 (“inside the cusp”). As 
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(a, 6B) crosses 2, so two of the stationary points of V coalesce and disappear. The local situation 
is cubic there, as in Figs. 9 and 10. 

We now show that the potential energy function (squared, actually) of our hanging picture is, 
neglecting terms x° and higher, of the form (1) with (a, 8) proportional to the coordinates of the 
centre of gravity G, referred to the cusp J of Fig. 8. This is the first time in this article that we 
allow G to deviate from the perpendicular bisector of AB, but we do so from now on. When G is 
just inside the cusp J of Fig. 8 the picture has 3 equilibria; the hook positions are the feet of the 3 
possible perpendiculars drawn upwards from G to the ellipse. (A fourth may be drawn down- 
wards.) As G crosses the evolute A,, two of these feet coalesce and disappear. So the cusp J is a 
catastrophe cusp like Fig. 11. 

Choose axes at O, fixed in the picture, with the x-axis along OB, Fig. 8. Let the hook H have 
coordinates (x, y). It lies on the ellipse & with equation 

2 2 


(4) ~ 47-1. 


/? l* — a’ 


It follows from coordinate geometry that the cusp J is at (0, —a*/(I* — a*)'/*). Suppose that the 
centre of gravity G has coordinates (a, —a*/(I* — a”)'/* + B), so that (a, B) is the position of G 
relative to J. We shall compute the potential energy (squared) V(a,B,x) of the picture as a 
function of the x-coordinate x of H, where H 1s restricted to the ellipse (4). 

Apart from a factor mg the potential energy is minus the distance GH. We choose instead to 
study the function — GH®° as this has the same stationary points but fewer square roots in its 
algebraic expression. Thus we choose 


(5) V(a,B,x) = —GH? = ~(x— a)’ ~(y(x) + (P= a?) — By, 
where by (4) 
v(x) =(P = @)'P(L =P), 


Let us expand y(x) in powers of x, neglecting terms above x“, 


2 
(6) y(x) = (2? -a?)'” 1- =, - xsi" , 


The approximation is good for small x. Substitution of (6) in (5) gives, to order x“, 


V(a,B,x) = Ax* — B(? — a?) x?/P? + 2ax + constant, 


where A = (a* — B(/? — a*)!”*)/49'. This is of the form (1), if we neglect the B-dependence of 
the coefficient A. We are justified in this if B is small, which is the case when G is near to J. 


5. Catastrophoid! Finally we ask: What is the locus of the equilibrium positions of the hook H 
relative to the picture as the cord length / is varied, when the points of attachment A and B are 
not symmetrically placed relative to the centre of gravity G? In the symmetrical case this locus 
was the line ¥ together with the upper arc AEB of the circle ”, Fig. 12. In all cases the locus is 
specified by the equilibrium condition that the angle A4HG should equal the angle BHG. When G 
moves off centre, the locus of line plus part of circle deforms to part of a more general cubic 
curve, called a mght strophoid, Fig. 12. The point of intersection FE disappears. The special 
bifurcation of “trident” type at E becomes a more general bifurcation, Fig. 13 (ref. 3). The upper 
stable branch remains stable and does not bifurcate, but a pair of stable and unstable equilibria 
appear, as in Fig. 10. 

We sketch the derivation of the equation of the strophoid, which is straightforward coordinate 
geometry. Given three points A, B, G in a plane, we seek the locus of the point H(x, y) such that 
AHG = BHG. Let us choose the same axes as in §4, i.e., let A be (—a,0), B(a,0) but relabel the 
coordinates of G to be (a, B). 
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s = stable 
u = unstable 


FIG, 13 


The condition 


cos AHG = cos BHG 
becomes, using scalar products, 
AH:-GH _ BH: GH 
IAH||GH| |BH||GH]- 


If we cancel |GH]|, square both sides, and then cross multiply, we obtain a polynomial equation in 
x and y, 


[(x— a)? + y?][(x + a)(x = @) +y(y - BP 
= |(x + a)” + y?][(x - a)(x - @) + (y - B)P. 
This simplifies to the cubic equation 
(7) (X? + Y?)(BX—- aY) + aB(X*? + kXY — Y’) =0, 


where X= x — a, Y=y— B,and k =(a’ + B? — a’)/af, after cancellation of a factor y. The 
possibility y = 0 corresponds to the line AB which is obviously part of the locus, but not 
accessible physically for a picture hook. If a = 6 = 0, equation (7) reduces to xy = 0. If a = 0, 
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B # 0, equation (7) reduces to 
x{ B(x? +(y — B)’) +(a? + B?)(y — B)} =, 


the line and circle of §2. 

In the general case equation (7) defines a “right strophoid” [1], [4], which we denote 
S(A, B;G). It has an asymptote of slope B/a, a node at G (X = Y = 0), at which the curve 
intersects itself at right angles and it passes through A and B. The tangents to the strophoid at G 
are the bisectors of the angles between AG and BG. Some examples are given in Fig. 14, with 
a = 1 and (a, 8) labelling the curves. Only part of the strophoid corresponds to equality of the 
angles AHG, BHG. On the rest of the curve, drawn dotted, the angles are supplementary; this 
possibility crept in when we squared the cosines. 


(a, 8) = (—0.1, -1) (a, B) = (-0.4, —-1) 


(a, B) = (—1,-1) (a, B) = (—1.5, -1) 


Fic. 14 


The interpretation of Fig. 12 is as follows. For a picture with centre of gravity G and string 
attached at A and B, if the string is long, the equilibrium position H of the hook relative to the 
picture will lie on the part BK of the strophoid, where the ellipse & cuts the curve. The potential 
energy function V = —HG or — HG? has only one stationary point as H moves on &,. GH is 
normal to the ellipse &. 
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We make the depressing observation that no matter how long the string is, the picture will 
always hang crooked if the centre of gravity is not equidistant from the points of support! For a 
“long” cord the centre of gravity will tend to lie vertically below the midpoint of AB since the line 
through G in the asymptotic direction passes through this midpoint. As the string is shortened the 
ellipse & shrinks and the hook position H will move down the curve towards B, until the ellipse 
touches the arc AFL of the strophoid, at some point F. The potential energy function for this 
ellipse has a stationary point of inflexion at F, and a minimum on the arc KB. If the string is 
shortened further, the ellipse cuts the strophoid at three points, giving stable equilibria on AF and 
BK moving towards A and B, and an unstable one on FL moving towards L. The point of 
inflexion has split apart into a maximum and a minimum. The part of the strophoid below AB is 
not physically relevant. We see that the strophoid S(A, B;G) could be defined to be the locus of 
the feet of the normals from G to the family of ellipses with foci A and B. 

In the literature [1], [4], a strophoid is usually defined in terms of a fixed point O called its 
centre and a line # with a point G on it called its node. A point H is on the strophoid if OH 
intersects Y at H” such that HH” = GH”, Fig. 15. 


Fic. 15 


As O approaches any point O) on £&, so the strophoid degenerates to a circle of radius O)G 
and centre O,, together with the line Y. The centre of the strophoid of equation (7) is at 


a a B a’ 
3 ate) Slat) 


The question arises how to reconcile our “equal-angle” construction of a strophoid with the 
Lockwood construction. How are the points of attachment A, B related to the centre O and the 
line #? Are there any other pairs of points C, D on the strophoid S(A, B; G) such that the angles 
CHG, DHG are equal or supplementary for each point H on the curve? 

The answer to the last question is that every point C on the strophoid is paired in this sense to 
another “conjugate” point D, which may be found by drawing the diameter COC’ from C 
through the centre O to the antipodal point C’, and then drawing the chord C’D parallel to the 
nodal line Y to meet the strophoid again in D. We see that G is conjugate to itself and O is 
conjugate to the point on the strophoid at infinity, Fig. 18. 

Another construction for D is to draw the two tangents to the strophoid at the node. The lines 
GC and GD are then mirror reflections of each other in the nodal tangents. But the best 
construction follows from the direct construction of the strophoid S(A, B; G) now given, Fig. 16, 
and answers all the questions just raised. 

Draw concentric circles centred at G (not enclosing A or B). For each circle, draw the two 
tangents from A and the two tangents from B. These intersect in six points, A, B, H,, H,, H;, H4, 
called the vertices of the complete quadrilateral of the four tangents, Fig. 16. We make two claims. 
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FIG. 17 


Fic. 18 


(i) All these vertices lie on S(A, B; G). 
(11) Opposite vertices are conjugate points for S(A, B; G); that is 
S(A,B;G) = S(A,, H,; G) 
= S( H,, Hy; G). 
Claim (i) is obvious from Fig. 16. Note that 4H,G = BH,G but AH,G = 7 — BH,G. 


As a special case, if we let B go off to infinity along the x-axis, Fig. 17, the strophoid 
S(A, 00; G) is the same as the strophoid constructed by the Lockwood prescription, with centre A, 
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node G and the x-axis as the nodal axis. This is because the tangents from B to all the circles 
centred at G are parallel. The equality of the angles x in Fig. 17 implies the equalities of the 
lengths H’H,, H”’H,, HG. 

To prove claim (ii) we consider the two strophoids S(A, B; G) and S(H,, H,; G). They have 
the same six points A, B, H,, H,, H;, H, in common and they have the same node. Hence they 
must be the same strophoid, because there is only a three-parameter family of strophoids with a 
given node G. (The three parameters may be chosen to be the direction of the nodal line through 
G, and the two coordinates of the centre O.) Hence H, and H, are good conjugate points to 
generate S(A, B; G). 

In Fig. 18, if a picture with centre of gravity G is hung from the conjugate points C and D as 
points of attachment, then the hook must lie on the part DK of the strophoid. If it is hung from 
the conjugate points, C’ and D’, then the hook will lie on the same strophoid, but all the 
strophoid to the right of the line C’D’ is now available. 


We thank our colleagues Peter Giblin, Chris Gibson, and John Underhill for helpful information and computer 
pictures. 
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HOW TO SOLVE THE SYSTEM x’ = Ax 
B. VAN ROOTSELAAR 
Heidepark 13, 6705 AB Wageningen, The Netherlands 


Summary. A simple formula (7) is derived for the solutions of x’ = Ax by reduction to a 
single equation. This formula is extended to the system x’ = Ax + u in conceptually the same 
way as for a single equation of first order. The method also gives easy constructions for the 
fundamental matrix, Jordan bases and the Jordan normal form. The method is completely 
elementary. 


1. Reduction to a single equation. Consider the system x’ = Ax, where A is a constant square 
matrix of order n. If the characteristic equation det(s! — A) = 0 of A has the form 


(1) s"+a,s" '}+-+++a, =0, 
then by Cayley-Hamilton’s theorem 
A" +a,A" \ +--+ +a,I = 0. 
Hence for any x 
A"x + aA" 'x +--+ +a,x = 0. 


If x is a solution of x’ = Ax, then x) = A*x, so any Solution of x’ = Ax is a solution of the 
(vector) differential equation 


(2) x) + ax D+ +--+ +a,x = 0. 


Let s(1),...,5(p) be the different roots of (1), with multiplicities m(1),...,m(p). To the 
m(/)-fold root s(/) there correspond m(/) linearly independent solutions of a single equation of 
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(2) of the following form: 


f.a(t) = (7k iexp(s(j) +t) (k= 0,...,m(j) — 1). 


Order these functions for each 7 = 1,..., p, according to descending powers of t, so as to obtain 
a vector f(t), with components f,(t),...,f,(¢). Then the general solution of (2) is 


(3) x(t) = Rf(t), 
where R is a constant matrix. Such a solution of (2) is a solution of x’ = Ax if 
(4) Rf’ = ARf. 
2. Solution with prescribed initial value. If we wish to compute the particular solution with 
prescribed initial value x(0) = c, it is not necessary to follow the ordinary practice of computing a 


Jordan base and expand c in that base, because the decomposition will follow automatically. 
We proceed as follows. Let X(t) be the matrix with columns x(t), x’(t),..., x" (ft), Le., 


X(t) = mat(x(t),...,x~(t)). (Wronskian) 
For a solution x of x’ = Ax this matrix becomes 
G(x(t)) = mat(x(r),...,A”-?x(2)). 
Let F(t) be the Wronskian of f, 1.e., 
F(t) = mat(f(t),...,f@7)(2)). 
From (3) and x’ = Ax follow the equations 
x(t) = Rf(t),..., A” 1x(t) = RF" P24), 
which condense into 
(5) G(x(t)) = RF(t). 
For ¢t = 0 we have with x(0) =c 
G(c) = RF(0), 
and because F(0) is nonsingular, the solution R(c) for R is 
(6) R(c) = G(c) F(0) . 
The solution x(t) of x’ = Ax with initial value x(0) = c is 
(7) x(t) = R(c)f(*). 
3. The matrix F(0). The m(/) rows in F(0), corresponding to the m(/)-fold root s(/), are 
fy mg jy-1(0) 5+ ++ s.Aymcg)—1 00) 


fo (0) yee A (0). 


Since f(t) = (t/i)f,,;-1(0) for i > 0, we immediately find recursion relations for the computa- 
tion of f%?(0). These relations are 


(8a) FP (0) = (Ki) fF4-P (0) fori > 0, k > 0, 
together with 

(8b) f(0)=0 fori> 0, 

and 


(8c) £0) = s(j)". 
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The result is the following table for the composition of F(0): 


0 0 QO 0 1 5s 15s? 
0 0 0 4s 10s* 208° 
0 O 1 3s 6s* 105% 1554 
0 1 2s 35% 45° 554 6s° 
l1 s s* S$? s4 s° s® 


For an m(/)-fold root s(j) we have to use the m(/) rows from the bottom, substituting s(/) for 
s. The table is easily memorized by noticing the occurrence of Pascal’s triangle. 


4. Comments. The advantage of the procedure is apparent from the representations (6) and 
(7). Note that the solution is directly expressed 1n its initial value. For any c the matrix R(c) is 
easily computed, either directly or using the following formula. 

If e; are the natural base vectors, 1.e., (e,), = 6,,, then we have 


(9) R(o) = De RCo) 


The matrices R(e,) have some interest of their own, as we shall see in the next section. 

The matrix F(0) depends only on (1) (and the enumeration of the roots s(/)) and serves for all 
particular solutions. Since it is a numerical matrix, it is better suited for computations than, e.g., 
the fundamental matrix. Note that the computation, which requires one matrix inversion and a 
number of matrix multiplications, does not require the construction of a Jordan base. In fact, the 
representation contains that part of a Jordan base that is used in the computation of x(t) from its 
initial value c, and it comes naturally. 

The following simple examples illustrate the main steps in the procedure: 


l. Let 
3 O -1 
A = 2 2 —] 9 
2 0 0 
then s(1) = 1 and s(2) = 2 (double). 


111 | 4 ~3 
F(0)=/0 1 4|{and F(0) "=|-4 -3 4 
12 4 1 1 -1 


2 4 8 
If x(0) = | then Ax(0) = Hl and A*x(0) = 2. SO 
2 4 8 


24 8if 4 2 -3 0 0 2 
R(x(0) =| 4 || - ~3 :|=|0 2 i} 
24 8iL 1 0 


Hence the solution is 
0 2 
x(t) = R(x(0)) f(t) = 2 + Ht 
0 2 


The general solution can be computed either by directly computing R(c) or by computing R(e,) 
and using (9): 


-1 0 2 0 0 0 1 0 -I1 
R(e;) = 0 2 0 9 R(e,) = 0 0 1 9 R(e;) = 0 —]l 0 . 
—2 0 2 0 0 0 2 O —-l 
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So, 
(—c, + c,)e’ +(2¢, — ¢;) e” 
x.(t) = R(c) f(t) = (2c, — ¢3) te?’ + c,e*! 
(—2c, + 2c,)e’ +(2¢, — ¢) e*! 


2. Let 


oOooown’ 
COON © 
ON KF © 


then s(1) = 2 (triple) and s(2) = 3. 


0 0 1 6 12 6 9 —8 

O 1 4 12 -1 —16 -11 -12 12 

1 2 4 g| and FOO) 7 6 6 ~6}' 

1 3 9 27 —l —l —] 1 

Since A has many zeros, in fact is a Jordan matrix, direct computation of R(c) = G(c) F(0)~* is 
feasible. This results in 


F(0) = 


0 0 «cg O 

O ¢; ¢c, O 
RI=!9 9 & of 

0 0 0 ¢ 


from which the general solution is obtained by x.(t) = R(c)f(t). 
5. The fundamental matrix. The representation 
x(t) = R(x(0)) f(z) 


enables us to compute the fundamental matrix ®(¢) of the system x’ = Ax with very little effort. 
The solution x(t) of x’ = Ax with initial value x(0) is given by 


x(t) = ®(t) x(0). 
So, if we specify x(0) to e,, we have 
O(t)e, = R(e,) f(t), 
from which we have the representation 
(10) @(1) = mat( R(e,) f(z),..., Rl en) f(4)). 
Note in passing that (9) combined with (10) gives the general conversion formula 


(11) R(v(t)) f(t) = O(4) o(2). 


Application of (10) to the examples gives 


er =60 0 0 
0 e7 ter QO 
2 ®O(7¢) = 
(1) 0 0 er! 0 
0 0 0 e! 
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Formula (10) for the fundamental matrix seems to be highly preferable to the commonly used 
series representation, and presents a definite computational advantage in practical applications, 
even if one truncates the series by means of Cayley-Hamilton’s theorem. 


6. Solving the system x’ = Ax + u. In order to solve the inhomogeneous system, we just 
apply variation of parameters to the general solution of the homogeneous system x’ = Ax. So in 
the solution x(t) = R(c)f(t) we replace c by the vector v(t), which we determine such that the 
resulting x(t) is a solution to the inhomogeneous system. Then we have for v the equation 


R(uv')f+ R(v)f’ =AR(v) ft u. 
By linearity we have R(v)f’ = AR(v)f, so 
R(u’') f =u. 
Application of the conversion formula (11) gives 
O(t)v'(t) = u(t), 
and, because ® is nonsingular, 
v(t) = ®(t)*u(t). 


Integrated, 
v(t) = ['®(r) u(r) dr +e. 
0 


Substituting in x(t) = R(v) f(t) and once more applying the conversion formula (11), we obtain 
the well-known formula for the solution x,(t) of x’ = Ax + u with x(0) =: 


(12) x(t) = O(1) ['O(r) u(r) dr + ®(t)c. 
0 
The essential point here is to compute ®(t) according to (10). 


7. Extension. Evidently the same procedure as described in the preceding sections can be 
applied to the general n-dimensional system of order m 


xO + Ax(M D4 --- +A, x =0, 


(A,,...,A,, constant square matrices of order n) after conversion into a first order mn-dimen- 
sional system by 
z, =x*-Y) (k=1,...,m). 
This results in a system 
z’ = Pz 


of dimension mn. 
The characteristic equation of this system is 


det(sI — P) = +det(s”I + 5"-14, + --- +A4,,) = 0. 


This is an equation of degree mn in s: 


s™ tas" 14+ --- +a,,, = 0. 


mn 


A particular solution of the original system is specified by x(0), x’(0),...,x°"~ (0) and from 
these values the matrix 


G(z(0)) = mat(z(0), Pz(0),...,P’""~+z(0)) 


can be computed. 
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8. The Jordan normal form. The role of the matrices R(e;) is not limited to the use in the 
composition of the general solution and the computation of the fundamental matrix. They also 
yield a Jordan base and a Jordan normal form. Although this is of no importance for the solution 
of the system, let us see how this can be done. If we write 


p m(ys)~1 
Rf = » » bi Fick. 
j=l k=0 
then from the relation Rf’ = ARf we obtain the following known conditions for b, , 
(13a) AB: mn jy-1 = s(j) Di m(jy-1> 
and for k = 0,...,m(/J) — 2: 
(13b) Ab, = SC i) bg + Di e414: 


Equations (13a) and (13b) define b,, as a so-called Jordan base. Both systems may have more 
than one solution, because a multiple root may be looked upon as a set of coinciding roots with 
different multiplicities, e.g., in Example 2 the threefold root 2 may be considered as two 
coinciding roots 2 with multiplicities 1 and 2 respectively. So from (13a) and (13b) we know that 
the columns of R(e;) are vectors of a Jordan base. However, in one column of the set of R(e,) 
there may appear vectors satisfying different systems of (13a) and (13b), and the question is how 
to sort them out. To do this we best look how these vectors behave under application of A. The 
result is then a Jordan base, together with the corresponding Jordan normal form of A, possibly 
after some scaling of the base vectors. By way of illustration we apply this to Examples 1 and 2, 
where no scaling appears to be necessary. 


1. Note that R(e,) is nonsingular, so its columns already form a Jordan base. If R(e,) = 
mat(b,,b,,b;), then we see Ab, = b,, Ab, = 2b,, Ab; = 2b, + b,. So we have the following 
Jordan normal form 


2. Here for no c the columns of R(c) form a Jordan base. R(e,) contains one non-zero vector 
e, and Ae, = 2e,. The remaining R(e,) give 


Ae, = 2e,, Ae, = 2e, + e,, Ae, = 3e,4. 


Obviously so because A was already given in Jordan normal form. The natural base is a Jordan 
base and A itself is the corresponding Jordan normal form. 

The inspection may sometimes be simplified by looking at Ax,(t) or AR(c). What we do is in 
fact the singling out of the elementary divisors and in a rather easy way. 


146. MISCELLANEA 


The Learning of this People is very defective; consisting only in Morality, History, Poetry and 
Mathematicks; wherein they must be allowed to excel. But, the last of these is wholly applied to 
what may be useful in Life; to the Improvement of Agriculture and all mechanical Arts; so that 
among us it would be little esteemed. And as to Ideas, Entities, Abstractions and Transcendentals, 
I could never drive the least Conception into their Heads. 


—Jonathan Swift, Gulliver’s Travels, 
(A Voyage to Brobdingnag). 


One of the founders of the theory of splines and optimal approximations. (See p. 337.) 
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Note added in proof: I should like to thank David Gale, David Hoffman, and Erwin Lutwak for 
interesting comments and references concerning the problem posed at the end of this paper. In 
particular, a paper of Tudor Zamfirescu (Proc. Amer. Math. Soc., 80, 3 (1980) 455~—457) studies 
the number of contact points of a convex curve with its circumscribed circle and shows that in the 
sense of Baire categories, “most” convex curves have exactly three contact points. On the other 
hand, an earlier paper of Peter Gruber (Math. Ann., 229 (1977) 259-266) shows that in the same 
sense, “most” convex curves are not C’. Thus, the Baire category approach is not appropriate to 
the four-vertex problem. Similarly, it is easy to see that in the C'-topology, the set of convex 
curves with three points of contact is dense in the set of C! convex curves. However, in the 
C*-topology there is an open set of convex curves with just two points of contact. In fact, any 
convex curve that contacts its circumscribed circle in exactly two antipodal points, and whose 
curvature at those points is strictly greater than that of the circumscribed circle, has a neighbor- 
hood (in the C?-topology) with the same property. Thus, the problem of finding an appropriate 
measure for this question is a subtle one. 
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Arthur Sard (1909-1980). The picture was taken in 1964. 


TREES AND POWER-SUMS 


A. H. STONE 
Department of Mathematics, University of Rochester, Rochester, NY 14627 


1. Introduction. Everybody knows that 3° + 4* = 5*. Many people must have noticed that 
33 + 4° + 5° = 6°, and must have been disappointed to find that 34 + 44 + 54 + 64 # 7*. Here 
we Claim that the “right” sequel is 


1 
T= 644+ 544 24 
obtaining these equalities as special cases of a general identity. The result comes, unexpectedly, 
from a problem about counting trees. 


2. Graphs. We rapidly recall some basic graphtheoretic notions and terminology, referring to 
[1], for instance, for further details. A “graph” G can be thought of as a subset of 3-dimensional 
space, consisting of a finite non-empty set V of points, the “vertices”, together with a finite set E 
of simple arcs, the “edges’’, each edge having its two end-points at vertices (with which the edge is 
“incident’’). Two edges can intersect only at common end-points. It should be remarked that some 
authors ({1], for instance) allow the two end-points of an edge to coincide, so that the edge 
becomes a loop; but this situation does not arise in the present paper, and we have excluded it for 
simplicity. But we do need to allow “multiple edges”; that 1s, several edges may join the same two 
vertices. 

The “degree”, d(v), of a vertex v in a graph G, is the number of edges of G that are incident 
with v. A vertex of degree 1 is an “end-vertex”, and the (unique) edge incident with it is the 
corresponding “‘end-line’”’. It 1s easy to see that the removal of an end-vertex v and its end-line e 
from a connected graph—that is, the replacement of V by V\ {uv} and E by E\ {e}—leaves a 
connected graph (for paths connecting other vertices are unaffected). (See Fig. 1.) 


Fic. 1. The vertex v is an end-vertex with end-line e; f, g are multiple edges. There are 3 cycles (two pentagons and 
one digon). 


A. H. Stone: | was born in London, England, and studied mathematics at Cambridge and Princeton Universities. 
I worked on aerodynamics at the Carnegie Institute of Washington, took up a fellowship at Trinity College, 
Cambridge, lectured at Manchester University, and since then have been at the University of Rochester. My family 
(wife, son, daughter) consists entirely of mathematicians. My research is centered on general topology; my other 
interests include graph theory, classical music and geometrical puzzles. 
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A “tree” is a connected graph that contains no simple closed paths (“cycles’’). In particular, a 
tree can have no multiple edges. Every tree with two or more vertices has at least two end-vertices 
(for instance, the end-points of a simple path of greatest length); the removal of an end-vertex and 
its end-line from a tree will evidently leave a tree. On the other hand, the removal of an arbitrary 
edge e from a tree 7 will disconnect the tree (else e would be part of a cycle in T) into exactly 2 
connected components (since every vertex can be connected by a path to one or other of the 
end-points of e without using e). 


3. Spanning subtrees of K,. A “complete graph” on n vertices, K,,, is a graph with n vertices 
each two of which are joined by exactly one edge; thus K,, has exactly (>) edges. A “spanning 
subtree’ of a graph G is a tree whose vertices are the vertices of G and whose edges are among the 
edges of G. A famous theorem of Cayley [2] asserts that the number of different spanning subtrees 
of K, is n"~*. We sketch one proof of this (due to Priifer [4]), since we shall need to use the same 
method in a more complicated situation. The cases n = 1, 2,3 are trivial, so we assume throughout 
that n > 4. 

Suppose the vertices of K,, are labelled 1,2,...,”. For convenience of expression we do not 
distinguish between vertices and their labels. We associate with each spanning subtree T of K,, its 
“Prifer symbol” P(T); this will be a sequence of nm — 2 integers ¢,,¢,,...,¢,>, where each f, 1s 
one of 1,2,..., (repetitions allowed). P(7) is defined as follows. T has an end-vertex; let s, be 
the least end-vertex (that is, the end-vertex of T with smallest label), and let e, be the 
corresponding end-line of 7. We define t, to be the (label of the) vertex at the other end of e. 
Now delete s, and e, from 7, leaving a tree 7’, and repeat the process on 7?: discard the least 
end-vertex s, of T', and its end-line e,, to leave a smaller tree T*, and define rt, to be the other 
end-point of e,; and so on. The process is stopped when ¢,,_, is defined; the remaining tree T”~ * 
then has just 2 vertices, and of course just one edge, the single edge joining them. Since at each 
stage there are at least two end-vertices available for removal, and the one removed (s,) from T'~* 
is the smallest one, the vertex n is never removed; thus the vertices of T”~* are ¢, 5 and n (or, if 
t,-> =n, then n — 1 and n). 

The symbol P(7) determines the spanning tree T uniquely. For we obtain T (= T°) from T’ 
by noting that s, is the smallest label that does not occur among ¢,,f5,...,¢, 7; and T° consists 
of T! together with the vertex s, and the edge s,¢,. Similarly T’~! consists of 7’ together 
with the vertex s, and edge s,t,, where s, is the smallest label not occurring among 
Sis 83500055, qy bis tia ty+ «ssl,» Thus we can work back from T"~* to T. 


Fic. 2. Tree, with Prifer symbol 23443883. 
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Moreover, an arbitrary sequence ¢,,/,,...,¢,., where each f¢, is in the set {1,2,...,n}, 
determines a spanning subtree T of K,, of which it is the Prifer symbol. For the process just 
described allows us to construct successively T”~*,7"~?,..., 7° = T; the point is that joining on 
a new end-vertex and end-line to a tree produces a tree. Hence 7" *,7"-°,...,7° thus 
constructed will all be trees; and T°, having n vertices, will be a spanning subtree of K,,. 

Thus the number of different spanning subtrees of K, is the number of different Prifer 
symbols, n”~ 7. (See Fig. 2.) 

We shall need one further property of the Priifer symbol, easily derived from the foregoing 
construction (see, for instance, [1, p. 35]): For each vertex v and spanning subtree T of K,,, the 
number of times v occurs in P(T) is d(v) — 1, where d(v) is the degree of v in T. 


4. A question and an answer. Now consider the question: Of the n"~* spanning subtrees of K,,, 
how many contain a given edge e? 


A first answer is as follows. Say the end-points of e are uw and v. If T is a spanning subtree 
containing e, the removal of e splits it into two trees, say 7, and 7,, 7; having some set V, of 
vertices (including u but not v) and 7, having the complementary set V, of vertices (including, v 
but not uw). For a given V,, say with r elements (where 1 < r < n — 1), the number of possible 
trees T, is r’ *, by Cayley’s theorem; and similarly the number of subtrees 7, of K,,, having V, as 
vertex-set, is (n — r)"-"~*. Any T, can be combined with any 7, and the edge e to produce a 
spanning subtree T of K,. Further, for a given r, the number of possibilities for V, is ("= >), 
Thus the answer to the question is 


n-1 


ol anes eC ee 


r=] 


Fic. 3. A graph (G) and its contraction (G - e). 


5. A second answer. Another method is as follows. Form the “contracted” graph “K,, - e” 
from K,, by “shrinking” the edge e to a point. (See Fig. 3.) That is, we discard e and identify its 
end-points u,v. The resulting graph can be described as being the complete graph K,,_, together 
with certain extra edges; each edge joining the vertex u (= v) of K,_, to a different vertex w 1s 
replaced by two edges, corresponding to the edges uw, vw of K,. If T is an arbitrary spanning 
subtree of K,, containing the edge e, the graph T - e produced by this contraction is again a tree 
(it consists of the trees 7, and 7, of § 4, with the vertices wu and v identified), and it spans K,, - e. 
Conversely, every spanning subtree of K,, - e arises in this way from a unique spanning subtree of 
K,, that includes the edge e. Thus the number of trees asked for is simply the number of spanning 
subtrees of K,, - e. Now, if we ignore for the moment the doubling of the edges incident with u, 
these trees correspond to the Prifer symbols on the n — 1 vertices of K,_,. However, the 
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doubling means that when u has degree d in the tree—that is, occurs d — 1 times in the Prifer 
symbol—there are 2% different trees with this symbol, since in reconstructing the tree we have the 
choice of either of two edges on d occasions. The number of these Priifer symbols with exactly r 
occurrences of u is (” ; 3) Cn — 2)"""~?, because an arbitrary set of r of the n — 3 places in the 
symbol will be occupied by u’s, and the other n — r — 3 places can be filled arbitrarily with the 
other n — 2 vertex labels. Thus the number required is 


n—-3 


y (” 7 3\ar(n _ ne 


r=0 
which, by the binomial theorem, is 2(2 + (n — 2))"~? = 2n"~?, 


6. The identity. Equating the two answers, we obtain the identity (proved for n = 4,5,6, ...) 
1 — — 2 r— n—r—-2 n- 
5h (" i} *(n—1r) =n"?, 


The terms with r = 1, r = 2, in the sum ¥ here are, respectively, (n — 1)""? and (n — 2)"~°. 
Also, because of the symmetry property (” n— =?) =(" 7 >) where r + s =n — 2, the sum is twice 
the sum of its first (n — 1)/2 terms if n is odd; if m is even we modify this by halving the middle 


term. Thus we can rewrite this identity as follows: 
If nis odd, sayn = 2k +1, 
n"-3 =(n—1)"""> +(n—2)" > 45>: +(” 7 PY rn a) ee ee 


n—2\,K-2 k-1 
+(7 7 t)k (k + 1) 


If n is even, say n = 2k, 


“3 =(n-1)"” +(n—2)"> 4 a + ("PF ) er 2(n ry o. 


The values n = 5,6,7,8 give, respectively, 


§2= 42 43?, 6 = 59443433, 74 = 644 544 44 
8 = 7+ 6° + 35° + 34°. 


7. Remarks. (1) Another (shorter but more highbrow) derivation of the answer 2n"~°, in § 5, 
can be obtained from the Matrix Tree Theorem (see for instance [1, p. 219]), which allows one to 
express the number of spanning subtrees of K,, that do not contain the edge e as a determinant 
that is not hard to evaluate. 

(2) It would be interesting to have a direct elementary proof of the identity in § 6. At any rate, 
I do not know of one. 
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THE FOUR-OR-MORE VERTEX THEOREM 


ROBERT OSSERMAN 
Department of Mathematics, Stanford University, Stanford, CA 94305 


The four-vertex theorem states that a smooth Jordan curve in the plane has at least four 
vertices. A vertex is a local maximum or minimum of the curvature. Thus, an ellipse has exactly 
four vertices, at the ends of the major and minor axes. This theorem is frequently proved, under 
the additional assumption that the curve is convex, in introductory differential geometry ((2], [5], 
[6], [7], [13], [16], [21]) as an early instance of a theorem requiring global rather than purely local 
arguments. 

The four-vertex theorem (Vierscheitelsatz, Théoreme des quatre sommets) has a long history, 
starting in 1909 with Mukhopadhaya [18], who stated and proved it for convex curves. There 
followed a succession of different proofs, generalizations, and analogies (see the References for a 
sample), including an interesting recent contribution due to Gluck [9], who proved a kind of 
converse. It is therefore somewhat surprising that the argument presented here seems not only to 
be new, but also to have a number of advantages over the usual proofs: 


1. It makes immediately obvious geometrically why the result should be true. 

2. It works not only for convex curves, but with only a little extra effort for arbitrary Jordan 
curves. 

3. It is a direct proof, rather than the usual argument by contradiction. One consequence is 
that curves with only four vertices are seen to be special in certain ways; a large class of curves 
(even restricting to the convex case) must have six or more vertices. 


The essence of the proof may be distilled in a single phrase: consider the circumscribed circle. 
In fact, one way to formulate the result would be the following. 


THEOREM 1. Let y be a smooth (C*) Jordan curve in the plane. Denote by C the circumscribed 
circle about y. Then 

1. y O C contains at least 2 points; 

2. if y O C contains at least n points, then y has at least 2n vertices. 


One could in fact make the second statement more precise: 


THEOREM 1’. In the notation of Theorem 1, if R is the radius of C, and if y Q C contains at least 
n points, then either a whole arc of y lies on C, or else y has at least n vertices where the curvature k 
satisfies k < 1/R, and at least n vertices where k > 1/R. 


We shall discuss at the end of this paper the question of the expected number of points on 
y 1 C. Note that an immediate corollary of Theorem 1 is that whenever y M C contains an 
infinite number of points (as it well may), y must have an infinite number of vertices. 

The proof of Theorem 1 depends on three elementary and general geometric lemmas, as well as 
one lemma particular to the problem: Lemma 4 below. 


LEMMA 1. Let E be a compact set in the plane containing at least two points. Then among all 
circles C with the property that the closed disk bounded by C includes E, there is a unique one of 
minimum radius R > 0. 


Robert Osserman received his Ph.D. at Harvard University under the supervision of Lars Ahlfors. Since then he 
has been on the faculty of Stanford University, with temporary or visiting positions at the University of Colorado, 
New York University, Harvard, and the University of Warwick. During 1960-61 he was Head of the Mathematics 
Branch of the Office of Naval Research; he was a Fulbright Lecturer at the University of Paris (Orsay) in 1965-66 
and held a Guggenheim Fellowship in 1976-77. He spent the past year (1983-84) at the Mathematical Sciences 
Research Institute in Berkeley. Mathematically, his interests have always been geometry-related, starting with 
geometric function theory and Riemann surfaces, through minimal surfaces to general differential geometry, 
isoperimetric inequalities, and most recently, smooth ergodic theory. 
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DEFINITION. The circle defined in Lemma 1 is called the circumscribed circle about E. 


LEMMA 2. If C is the circumscribed circle about E, then any arc of C greater than a semicircle 
must intersect E. 


Note. The proof of Lemma 2, as well as the uniqueness of C follow immediately from the 
observation that assuming the contrary, one could find a smaller circle enclosing E. 


LEMMA 3. Let a smooth oriented curve y have the same unit tangent at a point P as a positively 
oriented circle C of radius R. Let « be the curvature of y. Then if k(P) > 1/R, a neighborhood of P 
on y lies inside C, while if k(P) < 1/R, a neighborhood of P on y lies outside C. 


We now derive Theorem 1 from these lemmas. Let y be a Jordan curve, C the circumscribed 
circle, and R the radius of C. The first statement in Theorem 1 follows immediately from Lemma 
2. To prove the second statement, let P,,...,P, be points of y N C. If these points are ordered 
cyclically along y, we obtain n arcs y,,..., y, of y, each bounded by a pair of points on y 1 C. 


P; 


P, 
Fic. 1 


Assertion. Each of the arcs y, either lies on C, or else contains a point Q, such that the 
curvature x of y satisfies 


(1) K(Q,) <=. 


Before proving this assertion, let us note why the theorem is an immediate consequence. First 
of all, we assume that y and C both are positively oriented, so that the interior is to the left. Then 
at any point P, of y M C, the two curves have the same orientation and y lies locally inside (or 
on) C. It follows from Lemma 3 that 


1 
=>: 
(2) K( P,) R 
Since (2) holds at each endpoint of y,, it follows from (1) that « has a minimum at some interior 
point QO? of y,, and that 
1 
Vac 
(3) Ke ( Q; ) R 
We thus obtain vertices satisfying (3). On the other hand, each arc yz of y between successive 
Q, contains at least one point P, of y M C. In view of (1) and (2), there is an interior point Pz of 
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y, where k is a maximum, and 


1 
, >, 
(4) K-( P “) R 
We thus get m more vertices, thereby proving Theorem 1’, and hence Theorem 1. (We have 
ignored the possibility that one of the y, lies on C, in which case every point of y, is trivially a 
vertex.) 
It remains to prove the Assertion above. We formulate it as a separate lemma. 


LEMMA 4. Let y be a positively oriented Jordan curve, C the circumscribed circle and P,, P, 
points of y \ C. Let y, be the ( positively oriented) arc of y from P, to P,. Then either y, coincides 
with the circular arc P,P, or else there is a point Q, on C satisfying (1), where R is the radius of C. 


Proof. By Lemma 2 we may assume that the positively oriented arc of C from P, to P, is 
included in a closed semicircle; if not, by Lemma 2, there is a point Pj between P, and P, such 
that the arc of C from P, to P5 does le in a (closed) semicircle, and we may apply the argument 
below to the subarc yj of y, from P, to Pj. The corresponding point Q, of yj satisfying (1) will 
also he on yy. 

For convenience of referral, assume that C is centered at the origin, and that P,, P, lie on the 
same vertical line in the right half-plane, with P, above P, (Fig. 2). 


~ 


FIG. 2 


There are two possibilities. Either y, coincides with the circular arc P; P,, or else there is some 
point Q on y, that lies strictly inside C. Consider first the case where y is convex. If we translate 
the circle determined by P,,Q, P, to the left, there will be a last moment at which it intersects y,. 
Let C’ be the corresponding position of the circle, and let Q, be a point of the intersection 
CG’ y,. Since the radius R’ of C’ satisfies R’ > R, and since y, lies locally outside C’ at Q,, it 
follows from Lemma 3 that 


1 1 
KO) <a <R- 


This proves the lemma, and hence the theorem, for the case of convex curves. 


Precisely the same argument holds for general Jordan curves, with one additional caveat: we 
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must use the Jordan property to guarantee that y, has the same orientation as C’ at Q,. (In fact, 
for non-Jordan curves that need not be the case, and the lemma, as well as the theorem, need not 
hold; see Fig. 3.) 


Fic. 3 Fic. 4 


Under the assumption that y, has no self-intersections, the closed curve, consisting of y, 
followed by the arc C, of C going in the positive direction from P, to P,, is a Jordan curve whose 
interior is a domain D included in the interior of C. Note that the positive orientation induced on 
y, as boundary of D coincides with its original orientation as part of y, since at the points P, and 
P,, both coincide with the positive orientation of C. Once again, there are two cases to consider. 
Either y, coincides with the arc of C from P, to P,, or else y, contains a point Q strictly inside 
C. In the latter case, we may choose Q to the nght of the vertical line through P, and P,. (See 
also Remark 1 following the proof.) Then the circle determined by P,QP, has radius R’ > R. 
Translating this circle to the left, we again find a circle C’ containing a point Q, of y, such that 
all further translates of C’ to the left fail to intersect y,. (See Fig. 4.) It follows that the interiors of 
C and C’ intersect in a domain A that is included in D. Thus, both y, and C’ have the same 
orientation at Q,, and we may apply Lemma 2 as before to deduce 


K(Q,) <a <% 


This proves Lemma 4 and Theorem 1 for arbitrary Jordan curves. 


REMARK 1. A slight modification of the argument above produces sharper quantitative results. 
Consider all circular arcs from P, to P, lying inside C. Let C” be the one farthest to the left 
intersecting y,, and let Q” be a point of y, A C”. There are three cases, depending on whether 
O” is to the right, to the left, or on the vertical line P, P,. In the last case, the argument above 
shows that «(Q”) < 0. In the other two cases, C” is a proper circle of radius R”. If Q” is to the 
right of the line P, P,, then again 

y 1 1 
K(Q") < aa < R: 
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Fic. 5 


If QO” is the left, one has the stronger result that 
1 
KO") < —Z, <0. 
For this last, one notes that at Q”, the positive orientation of y, coincides with the negative 
orientation of C” (Fig. 5). 


REMARK 2. It might seem natural to carry out a basically equivalent “dual” approach, using 
inscribed, rather than circumscribed circles. On closer examination, however, the use of inscribed 
circles is considerably less straightforward. In fact, even their definition requires some care, and 
they are generally not unique. A paper of Jackson [12] contains a proof of the four-vertex theorem 
along those lines. He uses a proof by Erdos of the existence of specially adapted inscribed circles 
(p. 568). He also proves a result (Lemma 4.1) more general than our Lemma 4, making use of the 
Gauss-Bonnet theorem. 


REMARK 3. As we said at the outset, the usual proofs of the four-vertex theorem show that the 
presence of fewer than four vertices would lead to a contradiction. Such proofs give no hint as to 
the actual number of vertices present, either on a given curve, or “in general’’. It follows from 
Theorem 1 that a curve with only four vertices must intersect its circumscribed circle in only two 
points. By Lemma 2, those two points must be antipodal points of the circle. Clearly, that is a 
fairly special property, even within the class of convex curves. Further properties that must be 
satisfied by curves with only four vertices have been derived by Jackson [12] and others. We are 
thus led to two questions, each of which may be considered either for the class of smooth convex 
curves, or more generally, for closed Jordan curves. 


1. Is it more likely for a curve to have only four vertices, or to have at least six vertices? 
2. Is it more likely for a curve to intersect its circumscribed circle in only two points, or in at 
least three points? 


Intuitively, one may expect a “tripod” effect; that is, the circumscribed circle is most likely to 
touch the curve at exactly three points (see Fig. 1). We are thus led to formulate the following 
precise problem. 


Is there a natural measure on the space of all smooth closed curves (either convex or Jordan)? In 
terms of such a measure, what are the relative sizes of the sets of curves which intersect their 
circumscribed circles in (a) exactly two points, (b) exactly three points, (c) more than three points? 
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Note added in proof: I should like to thank David Gale, David Hoffman, and Erwin Lutwak for 
interesting comments and references concerning the problem posed at the end of this paper. In 
particular, a paper of Tudor Zamfirescu (Proc. Amer. Math. Soc., 80, 3 (1980) 455-457) studies 
the number of contact points of a convex curve with its circumscribed circle and shows that in the 
sense of Baire categories, “most” convex curves have exactly three contact points. On the other 
hand, an earlier paper of Peter Gruber (Math. Ann., 229 (1977) 259-266) shows that in the same 
sense, “most” convex curves are not C*. Thus, the Baire category approach is not appropriate to 
the four-vertex problem. Similarly, it is easy to see that in the C'-topology, the set of convex 
curves with three points of contact is dense in the set of C' convex curves. However, in the 
C*-topology there is an open set of convex curves with just two points of contact. In fact, any 
convex curve that contacts its circumscribed circle in exactly two antipodal points, and whose 
curvature at those points is strictly greater than that of the circumscribed circle, has a neighbor- 
hood (in the C*-topology) with the same property. Thus, the problem of finding an appropriate 
measure for this question is a subtle one. 
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Arthur Sard (1909-1980). The picture was taken in 1964. 


THE RESULTANT OF SEVERAL VECTORS: AN UNUSUAL INSIGHT 


J. E. SZYMANSKI 
Department of Physics, University of York, Heslington, York, YOI 5DD, United Kingdom 


Of the many basic problems arising commonly in mathematics the one that I am about to 
consider is one of the very few that is so simple that a student with only an elementary knowledge 
of trigonometry can comprehend fully the steps taken at each level of its development. I hope to 
show, however, that even this simple treatment gives a good illustration of how an invariance to 
linear transformations, clearly implicit in the problem on physical grounds, manifests itself in the 
mathematics. It will also be seen how several points relating to computational mathematics can be 
introduced and how, by performing several lines of algebra, we gain new insight into an 
apparently trivial problem. 

The problem in question is, essentially, given several vectors of known magnitudes and 
directions, what is the magnitude and direction of the resultant? The exact solution, of course, is 


Y 


L, 


FiG. 1. The resultant vector of magnitude ||c|| and direction @ for two vectors of length |ja|| and |/b|| at angles a and 


B. 


J. E. Szymanski: I received my degree in Mathematics at York in 1980, but then changed my allegiance and 
moved to the Physics Department where I spent my D. Phil. period working on total-energy calculations of 
molecules and solids under J. A. D. Matthew. I am currently concentrating on semiempirical methods of modelling 
the relaxation and reconstruction of solid surfaces, the mathematical basis of which involves the solution of large (of 
order ~ 250) systems of coupled nonlinear equations. My leisure interests include the history of science, local 
history and applied horticulture (and pest control!). 
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trivial, but consider in more detail the special case of only two vectors. Fig. 1 illustrates this case, 
and immediate inspection of the diagram leads to 


(1) tan@ = L,/L, 

and 

(2) lel = (23 + 13)", 
where 

(3) L, = |lallsin a + ||b||sin B 
and 

(4) L, = |la||cos a + |[bI|cos B. 


These expressions are exact but cumbersome, the evaluation of |le|| and @ involving the 
calculation of four sines and cosines in addition to an arctan, a square root and ten arithmetic 
operations. One obvious way of improving this slightly is to rotate the coordinate axes through an 
angle a, thus removing the need to evaluate sina and cosa at the small cost of two more 
arithmetic operations. Similarly, one could divide (3) and (4) by ||b||, so that only the ratio |la||/||b]| 
enters into the problem. These are both improvements, but it is possible to do better. To show this 
I first present a remarkably good approximate version of (1), and in justifying this approximate 
expression show how the exact expression can be improved. 


TABLE 1 

llal| [{b/I a B Y bn.op lle|| lle llapp 
7 3 2 6 3.200 3.200 9.995 9.995 
7 3 42 6 31.405 31.200 9.591 9.585 
7 3 42 66 49.140 49.200 9.817 9.816 
7 3 78 66 74.407 74.400 9.954 9.954 
11 9 78 66 72.602 72.600 19.892 19.892 
4 1 78 66 75.608 75.600 4.982 4.982 
9 1 78 66 76.806 76.800 9.980 9.980 
199 1 78 66 77.940 77.940 199.978 199.978 
7 3 120 0) 94.715 84.000 6.083 5.394 
2 5 0) 90 68.199 64.286 5.385 5.238 


The approximate version of (1) in question is 
(5) 8.59 = (lalla + |IbILB)/Clall + |IbID- 


This expression will clearly be accurate when a and B (and hence @) are small angles and it is 
valid to replace all five of the trigonometric expressions by their small angle approximations. An 
examination of Table 1, however, leads to some surprises. The table lists the values of @ and 6,,,,, 
for a range of |lal|, ||b||, a and 6 (all angles in degrees), and the first line of the table indicates that 
(5) is indeed a good approximation if a and Bf are small. The rest of the table shows, however, 
that the approximate expression remains remarkably accurate over a wide range of input values, 
the exception being the final two lines of the table. At first, it is not clear from (1) why this should 
be the case. There is no obvious reason why, if power series expansions of all the trigonometric 
functions are inserted into (1), (5) should continue to be so accurate. To see the way forward we 
must look again at (5), and rewrite it in the form 


(6) Dapp = 2(a + B) + 3( a — B)(lall — IPI) /Clall + IPI) 
(7) = $+ DR, 

defining new variables 
(8) S 

(9) D=3(a— 8), 
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and 


(10) R = (lal — |IbI) /Cilall + IIb ID. 
Note that, for finite |la|| and ||bl|, R€(—1,1) and 6, © (min(a, 8), max(a,f)). Assume 
a # B, else the problem 1s trivial. 

Can (1) be written in a similar fashion? Consider the numerator 


(11) |lallsin + |[b||sin B = 3((lall + |[b||) (sin + sin B) + 3(|lall — |[b||)(sina — sin B) 


(12) = (|lal| + ||b||)sin Scos D + (|lal| — ||b||)cos S sin D. 
Likewise, the denominator is 

(13) llallcos a + ||b||cos B = (|lal| + ||b||)cos Scos D —(|la]| — ||b||)sin S sin D, 
and (1) becomes 

(14) tan@ = (tanS + RtanD)/(1 — RtanStanD). 


If (a — B) © (-7,7), then D €(—7/2,7/2) and it is possible to define a unique angle 
D' € (-2/2, 7/2), satisfying |D’| < |D|, such that 


(15) tan D’ = Rtan D. 
(14) immediately becomes 
(16) tan@ = tan(S + D’), 


so that, if a and B € (—7,7], it follows that S € (—7,7) and 8 € (—7,7) and we can write 
immediately 


(17) 6=S+D’, 
since 6 — S € (—2/2,7/2). This gives 
(18) 6 — S = arctan( R tan D). 


We have thus reduced (1) to the stage where it exhibits the same symmetry as (7). Equation (18) 
still gives 0 exactly in terms of |lall, |b||, a and B, but the dependence on the absolute magnitudes 
of the angles has been removed into the variable S. In fact, the number of important variables in 
the problem has been reduced to two. The invariance of the answer to a rotation of the coordinate 
axes, not immediately obvious from (1), is reflected in the fact that the deviation of @ from the 
arithmetic mean of the angles a and B depends only on the relative magnitude of the vectors and 
the angle between the vectors. Similarly the invariance to a uniform, linear scaling is reflected in 
the fact that R depends only on |la||/||b||. It is also clear that (18) is computationally much easier 
to handle than (1), the calculation now only requiring two non-arithmetic operations. 

Having established (18), we find that the derivation of the approximate expression given in (7) 
is now straightforward. The standard power series expansions of tan and arctan [1] lead to 


arctan(Rtan D) = DR(1 + D?(1 — R?)/3(1 + D?[2 — 3R?]/5 


(19) 
+ D*[17 — 60R? + 45R*] + ---)) 


9 


where it is being assumed that |Rtan D| < 1. This means that this expression can only be 
guaranteed for all R if D € [—7/4, 7/4]. An inspection of Fig. 2, however, seems to indicate that 
the range of validity increases as |R| > 1. This is due to the presence of the (1 — R*) factor in all 
the higher order terms of the expansion. The figure shows the true accuracy embodied in the 
approximation of retaining only the first term of the expansion. The apparently poor results 
indicated in Fig. 2 can be easily reconciled with the good results contained in Table 1. The second 
line of the table would indicate a value of 6,,. which underestimates 9 by only 0.65%. This, in 


app 
fact, corresponds to an exact value of S = 24 and underestimating arctan(R tan D) by 2.77%. It 
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should be pointed out that the worst percentage discrepancy indicated in Fig. 2 corresponds to an 
estimate of 4.5°, where the correct result is 5.71°. Indeed, the absolute error is worse for R = 0.5, 
where, for D = 45°, 26.57° is estimated as 22.5°. 


D(°) 
10 20 30 40 
R= 0.9 
—10 
100( 6, — 9) “ 
(0-—S) (4) R = 0.5 
R=0.1 
— 20 


Fic. 2. The “true” accuracy of the approximation discussed in the text, defined by 100*[(6,,, — S) — (@ — S)]/ 
(9 —S). 


To conclude the detailed discussion above, two points remain which need to be mentioned. The 
first point is that at present there is no estimate of the length of the resultant; the second point is 
that I have only discussed the case of two vectors. As regards the length, if (5) is to be used to 
calculate an approximate value of 0, then L, and L,, defined by (3) and (4), will never be 
calculated. This renders (2) useless. The only information available is a (hopefully accurate) 
estimate of the angle of the resultant. This is usually enough. If, instead of calculating the 
projections of the vectors onto the coordinate axes, we simply approximate their projections onto 
the resultant by 


(20) lellanp = Cllall + WI) — (lal pp — &)° + II Gp» — B) )/2, 


then an estimate for ||c|| is obtained which is generally very reasonable. The final two columns of 
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Table 1 show the correct values of |lc|| and those estimated by this method for a range of input 
parameters. 

As regards the case of many vectors, it would be too much to expect a formula such as (5) to 
hold generally, and indeed, if a series of magnitudes and angles in the range (— 7, 7) are chosen at 
random, its performance is not so good. Yet, for reasons that the analysis above makes clear, if the 
vectors are spread over a limited angular range of 120°, say, the results can be remarkable. The 
n-vector approximate equations are 


(21) Oop — » wt, | » lla; || 
i=1 i=1 
and 


(22) lellanp =D lial — 0-5 D |laslI( Bop — &)°- 
t=1 1 


[= 


As an example, ten vectors with magnitudes in the range [1,10] and angles in the range [0, 120] 
were generated at random. The vectors were 


V1: magnitude = 5, angle = 79°, 
V2 : magnitude = 2, angle = 98°, 
V3 : magnitude = 5, angle = 88°, 
V4 : magnitude = 8, angle = 118°, 
V5 : magnitude = 10, angle = 39°, 
V6 : magnitude = 3, angle = 117°, 
V7: magnitude = 8, angle = 21°, 
V8 : magnitude = 7, angle = 66°, 
V9 : magnitude = 4, angle = 40°, 
V10 : magnitude = 6, angle = 93°. 
TABLE 2 
0 Papp lle llellapp 
Vil + W2 84.398 84.429 6.922 6.921 
V1+.---4+V3 85.909 85.917 11.916 11.916 
Vi+---+V4 98.718 98.750 19.171 19.163 
Vi+.--- 4+V5 79.089 78.833 25.707 25.538 
Vi+-:-: + V6 82.846 82.303 28.134 27.933 
Vi+--:+V7 70.381 70.341 32.680 32.248 
Vi+-:- +V8 69.609 69.708 39.663 39.230 
Vi+-:-:+V9 66.986 67.423 43.186 42.734 
Vi+--: +Vi0 70.087 70.069 48.649 48.198 


These were added sequentially, V1 + V2, V1 + V2 + V3, and so on until V1 + V2 + --- +V10, 
and the results are shown in Table 2. The quality of the results makes the approximate formulae 
potentially useful for applications where many vectors are to be summed in such a fashion with 
absolute accuracy not necessarily demanded. Such a situation arises in the field of direct methods 
in crystallography [2], where the exact tangent formula (the n-vector extension of equations 1-4) 
forms one of the crucial equations. In this specific example the formula is used as part of an 
iterative cycle, so that only approximate results are required. Unfortunately, the accuracy is so 
unimportant in this case that a typical approach used at present to save computer time is to 
simply round all angles to the nearest degree and consult a table of sines and cosines held in the 
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computer memory. There would thus be no real effort saved by using a linearized approximate 
tangent formula such as (21). 

Any justification of the accuracy of these formulae for several vectors would, of course, lose the 
clarity of the mathematics that is a feature of the two-vector case. Nevertheless, the approxima- 
tions discussed here, and the insight that can be gained from analysing them, form a good 
example of how even the simplest of problems can lead to a useful formula and teach a great deal 
about a general approach to mathematics. 


I would like to thank N. Anderson for pointing out the approximation and acknowledge useful discussions with 
H. Wright. 
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THE INCLUSION-EXCLUSION PROBABILITY FORMULAS BY TAYLOR’S THEOREM 


SOLOMON LEADER 
Department of Mathematics, Rutgers University, New Brunswick, NJ 08903 


Tedious combinatorial arguments in the proofs of the inclusion-exclusion probability formulas 
can be avoided by exploiting the fact that these formulas give the linear transformations between 
the coefficients of a polynomial and the coefficients of its Taylor expansion about 1. Proof of the 
formulas by this generating function technique requires only elementary calculus, basic properties 
of mathematical expectation, and the key perception that the probability sum s, in the formulas is 
just the expectation of (" ) Let P denote probability and FE expectation. 


THEOREM. Given a random variable N with values in {0,1,...,m}, let p, = P(N =k) and 
r, = P(N > k). Defines, = EL”). That is, 
(1) =D (J) ford = Oty. 

=J 

Then 
(2) y -Ecy “(is fork = 0,1,..., 
(3) mE DE) ser fork = 01. 
and 
(4) = (Yn fori= Otis. 
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If N is the number of events among A,,..., A,, which occur (counting multiplicity if the events are not 
all distinct), then 
(5) s, = P(A, ++: A,) forj =1,2,.... 

1l<iy< +--+: <i.<m 


The basic inclusion-exclusion formulas are the complementary cases k = 0 of (2) and (3): 
-y(- 1)’s, and 1% = y (-1)’s,, 
7=0 
Even for these cases the proofs are nontrivial. (See 4) The case j = 0 of (4) gives E(N) in the 
form s,; = L7_ or, that is sometimes handier than its counterpart s,; = Ly'_, kp, in (1). As 


systems of linear equations (2) is just the inversion of (1), and (3) the corresponding inversion of 
(4). The equivalence of (1) and (2) is given by the following lemma. 


LEMMA 1. For any polynomials of the form 
(6) f(t) = Le pyt* and g(t) = Ls,(t- 1)’, 
k=0 j=0 
each of the conditions (1), (2) is equivalent to 


(7) f(t) = g(t) forall t. 


To prove Lemma 1 note that the jth derivative of t* 
k(k-1)---(k-j+1)t*? = jf) 
So for f given by (6), 


f(t) =i! x4 pat! 


=J 


Thus the expansion of f about 1 has f(1)/j! = oe (Hp as the coefficient of (¢ — 1)’. 
Hence, (1) and (7) are equivalent. Similarly 


g(t) =k! E (J) yr 


So the expansion of g about 0 has 


g() _ - Ey (i]s 


as the coefficient of ¢*. Hence, (2) and (7) are equivalent. 
To get (3) and (4) from (1) by Lemma 1 note that 


(8) Pr=e-1— "es % =1fork<0,andr,=O0fork >m 
So 
(¢-D)Vnth= Vint) = Ven iat*— Le net 
k=0 k=0 k=1 k=0 
= Vina nd -— m= L pyt® -(. — po) 
k=1 k=1 


LY pt -1=f(t)-1 
k=0 
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by (6). Since 5s) = 1 and s,,,, = 0, (6) gives 
g(t) -1= ys (=D =D Ys yei(t — 1)’. 
Thus, (7) is equivalent under (8) to 
(9) wt = Ys .3(t- 1)’ forall ¢. 
k=0 yO 
Applying Lemma 1 with p,,s,, (1), (2) replaced respectively by r,,5,,1, (4), (3) we get the 


equivalence of (3), (4), and (9). Hence, the equivalence of (1), (2), (7), (9) implies the equivalence 
of (1), (2), (3), (4). To get (5) we use a simple combinatorial lemma. 


LEMMA 2. Let 


(10) N=X,+--:+X,,, 
where each term X, takes its values in {0,1}. Then 
N 
(11) [’) = > X, °° X, forj=1,2,.... 
1l<ij<:++<iu.<m 


N is the number of terms in the sum (10) of value 1. Similarly the sum in (11) is the number of 
terms of value 1. Such a term in (11) has all j factors chosen from the N terms X, in (10) of value 
1. So there are just . such terms. Hence (11). 


To prove (5) let X, indicate A,. That is, X, = 11if A, occurs and 0 otherwise. (See [2].) Then 
(10) gives the number N of events 4,,...,A,, which occur (counting multiplicity). Moreover, 
E(X, ++: X,) = P(A, ++: A4;) because X, ++: X; indicates A; --- A, . So we get (5) by 
distributing FE into (11). 
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ON TRANSCENDENTAL LINEAR OPERATORS 


A. CHARNOW AND E. CHARNOW 
Department of Mathematics, California State University- Hayward, Hayward, CA 94542 


1. Suppose V is a vector space over a field K. As usual, the set of polynomials with coefficients 
in K is denoted K[x]. A linear operator T on V will be called transcendental over K if every 
nonzero polynomial f in K[x] has the property that f(7) #0; ie. f(T) is not the zero 
transformation. When the dimension of V is finite, a very elementary argument [2, p. 191] can be 
used to show that there are no transcendental linear operators on V. 

If the dimension of V is infinite, however, a transcendental linear operator does exist. As an 
example, let B be a basis of V, let A = {v,|i = 1,2,3,...} be a subset of B indexed by the 
positive integers, and let C be the complement of A in B. Define T to be the partial shift 
operator on V satisfying T(u,) = v,,, for v, € A and T(w) = w for w Ee C. If f is any nonzero 
polynomial in K[x], then f(T)(v,) # 0, so f(T) # 0. 

If the dimension of V is not only infinite but also greater than or equal to the cardinality of K, 
we have an even more dramatic change from the finite dimensional case. It is the purpose of this 
note to prove the following: 


THEOREM. If V is an infinite dimensional vector space over a field K, and if the dimension of V is 
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tion of the Conics, an attempted reconstruction of the lost Book VIII of Apollonius’s Conics. Ibn 
al-Haytham (965-1041) is immortalized for his construction-solution of the now-called "problem of 
Alhazen (the medieval form of Al-Hasan)," from Book V of his Optics. For the specialist in the his- 
tory of mathematics and science. JK 


Od — 


Joseph W. Dauben. Biblio. of the Hist. of Sci. & Tech., V. 6. Garland, 1985, xxxix + 467 pp. 
[ISBN: 0-8240-9284-8] An annotated bibliography of primary and secondary sources in the history of 
mathematics, organized by period, by discipline, by geography, and by other useful special 
categories. Contains over 2000 items, each with brief commentary. Part of the Smithsonian Series 
of Bibliographies of the History of Science and Technology. An invaluable resource to scholars and 
others interested in the history of mathematics. LAS 
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Combinatorics, S, P. Combinatorics and Algebra. Ed: Curtis Greene. Contemp. Math., V. 34. AMS, 
1984, x + 318 pp, $30 (P). [ISBN: 0-8218~5029-6] Proceedings of the AMS-NSF conference on combina~ 
torics and algebra held at the University of Colorado, June 5-11, 1983. Ten of the papers (includ~ 
ing an introductory survey) impinge on the relationship between representation theory (especially of 
the symmetric group), and combinatorics (the Young tableaux). Remaining papers cover a wide range 
of topics: hyperplane arrangements, convex polytopes, finite lattices. LCL 


Algebra, S(17-18), P. Lecture Notes in Mathematics-1093: Lectures on Formally Real Fields. Alex-~ 
ander Prestel. Springer-Verlag, 1984, xi + 125 pp, $7.50 (P). [ISBN: 0-387-13885-4] These repub- 
lished lectures, given at the Instituto de Matematica Pura e Aplicada in Rio de Janeiro in 1974, 
serve as an introduction to the theory of real fields and its connections with valuation theory and 


quadratic form theory. LCL 


Algebra, T(18: 1), S, P. Linear Algebra Over Commutative Rings. Bernard R. McDonald. Pure & Appl. 
Math., V. 87. Dekker, 1984, viii + 544 pp, $69.75. [ISBN: 0-8247-7122-2] Starting from a fairly 
solid background in ring theory, the treatment serves as an introduction to and a survey of the 
present general theory using commutative rings as scalars. Chapters on free modules, endomorphism 
rings, projective modules, and the theory of a single endomorphism, through results of Bass and 
Kozul. Exercises, extensive bibliography, index. JS 


Algebra, S(18), P. Quadratic and Hermitian Forms. Ed: C.R. Riehm, I. Hambleton. Conf. Proc., V. 4. 
AMS, 1983, xviii + 338 pp, $36 (P). [ISBN: 0-8218-6008-9] A collection of fifteen of the survey lec~ 
tures, including historical remarks by Pfister, presented at a CMS conference on quadratic and her~- 
mitian forms at McMaster University in July of 1983. List of participants and technical lectures by 
title. JS 


Algebra, T(18: 1), S, P. Incline Algebra and Applications. 2Z.-Q. Cao, K.H. Kim, F.W. Roush. Ser. 
in Math. & Its Applic. Halsted Pr, 1984, xiii + 165 pp, $36.95. [ISBN: 0-470-20116-9] "An assembly 
of all known published and unpublished results in incline algebra" relating to Boolean and fuzzy 
algebra including recent work of Cao. Discussion of basic concepts is followed by chapters on topo- 
logical inclines, asymptotic forms, and applications. Lists of open problems, index, references. 
JS 


Real Analysis, T(17-18: 2), S, P, L. Real Analysis: Modern Techniques and Their Applications. 
Gerald B. Folland. Wiley, 1984, xiv + 350 pp, $34.95. [ISBN: 0-471-80958-6] After a brief prolo- 
gue, the treatment is fairly self-contained, proceeding through measure and integration, point set 
topology, and functional analysis, touching much of the fundamental theory en route. Concludes with 
chapters on Fourier analysis, probability, integration on groups and manifolds. Chapter notes, 
references, exercises. JS 


Complex Analysis, T(17-18: 1), S, P, L. Complex Analysis: A Functional Analysis Approach. D.H. 
Luecking, L.A. Rubel. Universitext. Springer-Verlag, 1984, vii + 176 pp, $16 (P). [ISBN: 0-387- 
90993-1] An interesting and unusual second course in functions of a complex variable. The algebra 
of holomorphic functions on a domain, and its dual, rather than individual holomorphic functions, 
are the main objects of study. Nearly all results of standard graduate course are covered. 20 
chapters, each with exercises. PZ 


Complex Analysis, T(18: 1, 2), S, P. Coherent Analytic Sheaves. Hans Gravert, Reinhold Remmert. 
Grund. der math. Wissenschaften, B. 265. Springer-Verlag, 1984, xviii + 249 pp, $36.50. [ISBN: 0- 
387-13178-7}] Sheaves are algebraic objects which represent the possibility of patching together 
local solutions to analytic problems. Coherence involves the possibility of passing from pointwise 
to local properties of solutions. Here sheaf theory in complex analysis is covered from its begin~ 
nings with Leray, H. Cartan, and Serre in the early fifties to the present, from main theorems to 
recent developments. PZ 


Complex Analysis, $(18), P*. Kiyoshi Oka, Collected Papers. Ed: R. Remmert. Transl: R. 
Narasimhan. Springer-Verlag, 1984, xiv + 223 pp, $33. [ISBN: 0~-387-13240-6] Ten fundamental papers 
(1936-1962) and two brief notes by one of the founders of function theory of several complex vari- 
ables. Commentary (in French) by H. Cartan--another pioneer of the theory--follows each paper. A 
guide to Oka’s slightly nonstandard terminology is included. PZ 


Humerical Analysis, P. Efficient Solutions of Elliptic Systems. Ed: Wolfgang Hackbusch. Notes on 
Num. Fluid Mech., V. 10. Friedr. Vieweg & Sohn (US Distr: Heyden & Son), 1984, 154 pp, $20 (P). 
[ISBN: 3-528-08084-1] Proceedings of a January 1984 seminar held at the University of Kiel. LAS 


Functional Analysis, $(18), °P. Amenable Locally Compact Groups. Jean-Paul Pier. Wiley, 1984, x + 
418 pp, $44.95. [ISBN: 0-471-89390-0] Detailed introduction to and overview of amenable locally com- 
pact (a.l.c.) groups. Presents principal characterizations of a.l.c. groups and discusses impor- 
tant examples of amenable groups as well as recent research results in this area. Substantial his- 
torical notes in each section and an extensive bibliography. No problems. BH 


Analysis, P. Elliptic Boundary Value Problems. Ed: Lev. J. Leifman. Transl: J.R. Schulenberger. 
AMS Transl. Ser. 2, V. 123. AMS, 1984, 268 pp, $75. [ISBN: 0-8218-3082-1] Translation of seven 
papers by V.G. Maz”ya, B.A. Plamenevskii, et al. LAS 


Analysis, T*(16: 1), S*, L*. Integral Transforms and their Applications, Second Edition. 8B. Davies. 
Appl. Math. Sci., V. 25. Springer-Verlag, 1985, xv + 411 pp, $28 (P). [ISBN: 0~387-96080-5] 
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Presents the classical integral transforms of Fourier, Hankel, Laplace, Mellin, as well as several 
convolution and integro-differential operators. Numerous applications, including heat and wave 
equations, diffraction problems, and representations of special functions. Provides simple proofs 
of inversion formulae for the Fourier and Laplace transforms, and other results often avoided by 
undergraduate texts: good supplement to courses in differential equations. Exercises. Prere- 
quisites: some real and complex analysis. (First Edition, TR, November 1978.) YN 


Analysis, $(18), P. Group Representations and Special Functions. Antoni Wawrzyfczyk. Math. & Its 
Appl. D Reidel, 1984, xvi + 688 pp, $119. [ISBN: 90~277-1269-7] A study of special functions via 
harmonic analysis on homogeneous spaces. Classes of special functions give rise to certain elements 
of Lie group representations; algebraic properties of the groups reflect analytic properties of the 
classes of functions. Begins with a general introduction to Lie groups and group representations. 
With exercises. PZ 


Geometry, P. Few-distance Sets. A. Blokhuis. CWI Tract, No. 7. Math Centrum, 1984, iv + 70 pp, 
Dfl. 10,80 (P). [ISBN: 90-6196-273-0] The concept of few-distance sets stems from considering a set 
of points on a sphere such that the number of distinct distances occurring between pairs of the set 
is minimal. An extensive theory of few~distance sets has evolved for Euclidean d-space, hyperbolic 
Space and for more esoteric spaces. This tract, presenting results of the most recent research, 
shows the interplay between geometry, combinatorics, algebra, operator theory and analysis in 
answering the questions in the field. SS 


Geometry, S*, L*. The Fourth Dimension: Toward a Geometry of Higher Reality. Rudy Rucker. Houghton 
Mifflin, 1984, xi + 228 pp, $17.95. [ISBN: 0-395-34420-4] A light-hearted but nontheless serious 
exploration--a "fantastic voyage"--of the fourth dimension, blending metaphysics and fantasy with 
geometry and physics: from Flatland to spacetime, from the occult (spirits in the fourth dimension) 
to the fantastic (time travel and telepathy). The promiscuous mixing of fact and fiction may feed 


public confusion about pseudoscience vs. science. LAS 


Optimization, P. Lecture Notes in Mathematics-1091: Multifunctions and Integrands. Ed: G. 
Salinetti. Springer-Verlag, 1984, v + 234 pp, $11 (P). [ISBN: 0-387-13882-X] Proceedings of an 
international conference held in Catania, Italy in June 1983 to explore recent results in the theory 
of multi-valued functions in the context of scientific models whose solutions are not necessarily 
unique. LAS 


Control Theory, T(16-18), P. Matrices in Control Theory, Revised Edition. S$. Barnett. Robert E 
Krieger, 1984, xiv + 192 pp, $14.50. [ISBN: 0-89874-590-X] Solutions to exercises have been deleted, 
errors have been corrected, new references and a few new topics have been added to the 1973 original 
edition(TR, April 1974). LCL 


Probability, T*(15-16: 1), L. An Introduction to Stochastic Modeling. Howard M. Taylor, Samuel Kar- 
lin. Academic Pr, 1984, x + 399 pp, $28. [ISBN: 0-12-684880-7] Designed to bridge the gap between 
elementary probability theory and intermediate level courses in stochastic processes. Topics 
covered include Markov chains in discrete and continuous time, Poisson processes, renewal theory, 
branching processes and queueing systems. Contains many examples and real-life applications. Good 
problem sets. RSK 
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Method of Intensively Monotone Operators. Ashot V. Kakosyan, Leo B. Klebanov, Joseph A. Melamed. 
Springer-Verlag, 1984, x + 175 pp, $9.50 (P). [ISBN: 0-387-13857-9] Intensively monotone operators 
preserve pre-ordering; here, they act on characteristic functions (Fourier transforms of densities). 
Their fixed points (invariant densities) give several characterizations, e.g.: if a random variable, 
X, has the same distribution as weighted averages of randomly many independent random variables, 
each distributed as X, then X must be normal. Clear enough to be accessible to advanced undergradu~ 
ates, but typographical errors follow the strong law of large numbers. YN 


Ce ae ne 


Szynal, A. Weron. Springer-Verlag, 1984, v + 373 pp, $18 (P). [ISBN: 0-387-13388-7] Proceedings of 
an August 1983 conference held in Lublin, Poland. LAS 


Probability, P. Lecture Notes in Mathematics-1080: Probability Theory on Vector Spaces III. Ed: D. 


Statistics, P*. The Collected Works of John W. Tukey, Volume I, Time Series: 1949-1964. Ed: David 
R. Brillinger. Stat. & Prob. Ser. Wadsworth, 1984, ixv + 689 pp, $39.95. [ISBN: 0-534~03303-2] 
First volume of a projected series, divided into subject areas, covering Tukey’s enormous contribu- 
tions to statistics. One of two volumes dealing with time series, this work includes previously 


unpublished papers as well as published ones, with comments on each by the editor. RSK 


Statistics, T(13), S. The Statistical Exorcist: Dispelling Statistics Anxiety. Myles Hollander, 
Frank Proschan. Popular Stat., V. 3. Dekker, 1984, xi + 247 pp, $21.50. [ISBN: 0-8247-7225-3] A 
unique introduction to the everyday use of statistics in decision-making, sampling, analyzing data, 
and estimating probabilities. The text, liberally punctuated with cartoons, quotations, and anec- 
dotes, consists of twenty-six appropriately directed vignettes (none of which makes any use of alge~ 
braic formulas) followed by related exercises. LCL 


Computer Programming, S(15-17), P*, L. Ada for Experienced Programmers. A. Nico Habermann, Dewayne 
E. Perry. Addison-Wesley, 1983, xvi + 479 pp, $20.95 (P). [ISBN: 0~201-11481-X] An introduction to 
Ada covering the major features of the language together with their relevance to software engineer- 
ing. Most chapters present a problem statement and both Pascal and Ada solutions. A particularly 
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valuable book for Pascal programmers who wish to learn Ada. AO 


Computer Programming, S. Nudges, IBM Logo Projects. Steve Tipps, et al. Holt, Rinehart & Winston, 
1984, ix + 273 pp, $16.95 (P). [ISBN: 0-03-000224-9] Nudges is an excellent introduction to IBM 
Logo. Contains many examples, hints, etc., as well as problems to solve. Written in a clear, 
easy-to-read style. TR 


Software Systems, S, L, P. A User Guide to the UNIX System. Rebecca Thomas, Jean Yates. Osborne 
McGraw-Hill, 1985, ix + 716 pp, $18.95 (P). [ISBN: 0-88134-109-6] A greatly expanded revision of the 
1982 original edition (TR, June-July 1982), emphasizing UNIX system concepts in the context of 
excellent tutorials on the most commonly used UNIX commands. Several useful appendices list system 


references~-commands, tables, literature. LAS 


Software Systems, S(15-17), P. CLU Reference Manual. B. Liskov, et al. Springer-Verlag, 1981, 190 
pp, $14.95 (P). [ISBN: 0-387-91253-3] Serves as both an introduction to the programming language CLU 
and a reference manual for the language. The first three printings of this book appeared as Volume 
114 in the "Lecture Notes in Computer Science" series. AO 


Computer Science, T(18), S, P. Algorithms, Software, and Hardware of Parallel Computers. Ed: J. 
Miklo¥ko, V.E. Kotov. Springer-Verlag, 1984, 395 pp, $32. [ISBN: 0-387-13657-6] There has been an 
explosive growth in the construction of parallel computing system in the last 10 years~-e.g., CRAY, 
ETA, and Control Data. However, there has not been a parallel growth in the understanding of how to 
most effectively use these computers--i.e., programming languages, techniques, and algorithms. This 
text contains 12 research monographs covering different areas of on-going research in algorithms for 
parallel and array processing systems. These areas include parallel languages, correctness of 


parallel programs, operating systems for parallel computers, and scheduling techniques. MS 


Applications, P. Lecture Notes in Biomathematics-55: Modelling of Patterns in Space and Time. Ed: 
W. J&ger, J.D. Murray. Springer-Verlag, 1984, viii + 405 pp, $23 (P). [ISBN: 0-387-13892-7] 
Proceedings of an interdisciplinary workshop held in Heidelberg, July 1983, covering models of 
dynamic patterns such as morphogenesis, chemical oscillations, fluid patterns, cell cultures, etc. 


LAS 


Reviewers 


MA: Melissa Anderson, St. Olaf; FA: Fahrad Anklesaria, Macalester; DA: David Appleyard, Carleton; 
RB: Richard Brown, Carleton; JNC: Judith N. Cederberg, St. Olaf; CEC: Clifton E. Corzatt, St. Olaf; 
JD-B: John Dyer-Bennet, Carleton; JRG: Jennifer R. Galovich, St. Olaf; SG: Steven Galovich, Carle- 
ton; BH: Bruce Hanson, St. Olaf; PH: Paul Humke, St. Olaf; RSK: Richard S. Kleber, St. Olaf; JK: 
Joseph Konhauser, Macalester; LCL: Loren C. Larson, St. Olaf; RM: Richard Molnar, Macalester; RWN: 
Richard W. Nau, Carleton; LN: Linda Ness, Carleton; YN: Yves Nievergelt, St. Olaf; AO: Arnold 
Ostebee, St. Olaf; TR: Teresa Reardon, St. Olaf; AWR: A. Wayne Roberts, Macalester; MS: Michael 
Schneider, Macalester; JS: John Schue, Macalester; SS: Seymour Schuster, Carleton; JAS: J. Arthur 
Seebach, Jr., St. Olaf; KS: Kay Smith, St. Olaf; LAS: Lynn Arthur Steen, St. Olaf; MT: Michael 
Tveite, St. Olaf; MU: Milton Ulmer, Carleton; TAV: Theodore A. Vessey, St. Olaf; MW: Martha Wallace, 
St. Olaf; FLW: Frank L. Wolf, Carleton; PZ: Paul Zorn, St. Olaf. 


Section Reports 


An asterisk (*) by the title of a paper indicates that copies of the paper are available from 
the author. Papers presented under special sponsorship as part of joint meetings are so noted in 
parentheses. 


Northeastern Section 


The fall meeting of the Northeastern Section was held on November 16-17, 1984 at Western New 
England College in Springfield, Massachusetts. There were 225 registrants. 


Invited Addresses: 


"Role of Computers in Discrete Mathematics," by John G. Kemeny, Dartmouth College. 

"Role of Discrete Mathematics in the College Curriculum," by Martha Siegel, Towson State Univer- 
sity. 

"When Mathematics Says No! The Nature of Impossibilities in Mathematics," by Philip Davis, Brown 
University. 

"Applications of Discrete Mathematics,” by Fred S. Roberts, Rutgers University. 


Contributed Papers: 


"Open Questions About Clique Iterations,” by Reverend Bruce Hedman, University of Connecticut at 
Hartford. 
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"Hybrid and Purebred Partially Ordered Sets," by C. Edward Sandifer, Western New England College. 
Panel Discussion: 


"Computer Science vs. Data Processing--Two~Year and Four-Year Science Curriculum," by Joseph 
Menard, Community College of Rhode Island (Moderator); Nelson Rich, Monroe Community College; 
Ken Lane, Colby College. 


Student Paper: 


"A Student Perspective on the Teaching of High School Computer Courses," by Mike Mair, Colby Col- 
lege. 


Workshops: 


"Exercises in Computer~-Enriched Learning," by Leo F. Kuczynski, Southern Connecticut State 
University. 

"Teaching Mathematics Using APL," by Edward LeCuyer, Jr., Western New England College. 

"Introduction to Modula~2," by Hoyt Warner, Western New England College. 

"Growing Mazes With Unique Solutions,” by Steven Snover, University of Hartford. 


Southern California Section 


The fall meeting of the Southern California Section was held at San Diego State University on 
November 9-10, 1984. It was a joint meeting with the American Mathematical Society. 


Invited Addresses: 


"Why Don’t You Just Get a Bigger Computer,” by Everett Bull, Pomona College. 
"Symbolic Mathematics by Computer, New Opportunities for the Mathematician," by Fred Dashiell, 
Inference Corporation. 


Short Presentation: 


"Reminiscences of an Immigrant Mathematician," by Michael Golomb, Purdue University and San Diego 
State University. 


The Mathematical Association of America 
The Sixty Eighth Annual Meeting of the Association 
Anaheim, California 


The Sixty Eighth Annual Meeting of the Association was held from January 11 through January 13, 
1985, in conjunction with meetings of the American Mathematical Society, the Association for Sym- 
bolic Logic, the Association for Women in Mathematics, and the National Association for Mathemati- 
cians. The abstracts for the Invited Addresses are as follows: 


The Search for Randomness, by Persi Diaconis, Stanford University. 


How many times must a deck of cards be shuffled until it is close to random? It was argued 
that the answer is "seven." The analysis involves work of Shannon and Reeds and joint work with 
David Aldous. The arguments use "new wave" tools called stopping times which pervade the conversa~- 
tions of young probabilists. The tools involve simple symmetry arguments and handle problems that 
Fourier analysis can’t touch. 


Algorithms, Geometry, and GL(n,Z), by Helaman R.P. Ferguson, Brigham Young University. 

The talk began with the classical Euclidean algorithm (n = 2) and viewed it geometrically and 
algebraically. Two perpendicular lines were visualized geometrically in the plane intersecting at 
the origin. These two lines lie among the points of the plane with integer coordinates. The 
feature of the classical Euclidean algorithm was to construct pairs of integers as points in the 
plane on or close to this pair of lines. The Euclidean algorithm was algebraically regarded as a 
process to construct matrices in GL(2,Z), the 2 x 2 matrices with integer entries and determinants + 
1. Both of the latter properties relate to the geometry of the two lines and pairs of integers. 
The stage was then set for a generalization of the classical Euclidean algorithm to a space of n 2 2 
dimensions. Two perpendicular lines were replaced by a line and its perpendicular hyperplane. This 
configuration was situated among the n~-tuples of integers. The generalization of the Euclidean 
algorithm which was considered here gave an iterative construction of n-tuples of integers on or 
close to the hyperplane and line pair: Algebraically this translated into constructing matrices in 
GL(n,Z), the n x n matrices with integer entries and determinant + l. Then n-tuples of integers on 
or close to the hyperplane or lines were a column or rows of a matrix of GL(n,Z) or its inverse 
(respectively!!!!). A proof of the existence of such iterative algorithms was given; the tools were 
matrix multiplication and mathematical induction. The proof was shorter than the case n = 2 of Book 
X of Euclid’s Elements, indeed shorter than this abstract! A few applications of such algorithms 
Biven, say, decimal representations of real numbers were discussed: construct integral linear 
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combinations of real numbers which are zero or small; find minimal polynomials of algebraic numbers, 
excude small minimal polynomials; give simultaneous independent rational approximations to sets of 
real numbers; factor rational numbers given logarithms of primes. 


Toolkit for Nonlinear Dynamics, by John M. Guckenheimer, University of California, Santa Cruz. 


This was an expository lecture about methods developed to study the geometry of solutions to 
nonlinear systems of differential equations. 


The Many Lives of Invariant Theory, by Gian-Carlo Rota, Massachusetts Institute of Technology. 


A survey of current work in invariant theory from the point of view of the nineteenth century. 
What are invariants for? Which of these are worth classifying? How does one compute them? Do ten-~- 
sors have canonical forms? 


Some Diophantine Problems, by Murray M. Schacher, University of California, Los Angeles. 


Some of the techniques and considerations leading up to Faltings” solution of the Mordell con- 
jecture were outlined. Applications to the Fermat problem and other questions in number theory were 
discussed. 


Combinatorial Set Theory and Its Applications to Topology, by Franklin D. Tall, University of 
Toronto. 


The field of set-theoretic topology has arisen in the past fifteen years as the result of the 
application of the new techniques of combinatorial set theory to the old field of general topology. 
The most notable phenomenon is the proliferation of independence results~~-the usual set~theoretic 
axioms used by mathematicians are provably inadequate to settle many of the major problems in the 
field. Some of the set~theoretic techniques and topological problems were discussed. The implica- 
tions of independence results (which are now appearing in other areas as well) for mathematics in 
general were also addressed. 


SA NS! AS SS aT 


Many problems in analysis can be formulated as questions about the continuity of linear opera- 
tors defined on certain function spaces. Such questions can be answered effectively if one can 
reduce them to the study of these operators on appropriate classes of simple elements phat generate 
the entire space. Examples of such reductions are furnished by orthonormal bases for L‘2) or by the 
collection of characteristic functions in L‘l1). During the past 15 years such spanning sets of sim- 
_ ple functions have been discovered for a large number of function spaces that arise in analysis. 
Using these simple "building blocks" one can explain recent advances in analysis to mathematicians 
who are not analysts. 


Trusting Computers, by Joseph Weizenbaum, Massachusetts Institute of Technology. 


Special Sessions 
Minicourses 


"The Teaching of Applied Mathematics," by W. Gilbert Strang, Massachusetts Institute of Tech- 
nology. 


"APL-~A Functional Computer Language for Mathematicians," by Garry A. Helzer, University of 
Maryland. 


"Teaching Problem Solving," by Alan H. Schoenfeld, University of Rochester. 
Applications of Discrete Mathematics," by Fred S. Roberts, Rutgers University. 
"Groups, Graphs, and Computing,” by sugene M. Luks, University of Oregon. 
"PROLOG," by Frederick Hoffman, Florida Atlantic University. 

"Linear Programming," by Charles E. Haff, University of Waterloo. 


"Teacher In-service Problems (a COMET minicourse)," by Eugene A. Maier, Mathematics Learning 
Center and Portland State University. 


"Constructing Placement Examinations," by Richard E. Prosl, College of William and Mary. 


Panel Discussions 


"How To Give A Successful Talk To Secondary School Students," sponsored by the Committee on 


Secondary School Lectures. 
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"Training Programs For Adjunct Faculty and Teaching Assistants-~An Informal Exchange," spon-~ 


sored by the Committee on the Teaching of Undergraduate Mathematics and organized by Bettye Anne 
Case, Florida State University. 

"Calculus Instruction: Crucial But Ailing," jointly sponsored by AMS and MAA and organized by 
Ronald G. Douglas, University of California, Berkeley, and Stephen B. Maurer, Swarthmore College. 


"Panel on Discrete Mathematics," led by Martha J. Siegel, Towson State University. 


"Panel Report of the Fifth International Congress on Mathematical Education," presided by 
Donald M. Hill, Florida Agricultural and Mechanical University. 


"Panel on Computers in Mathematics Education," organized by Christopher Nevison, Colgate 
University. 


"Panel on Project Equality," co-sponsored by MAA and the National Council of Teachers of 
Mathematics. 
Also presented: "Challenge of the Unknown," an American Association for the Advancement of Sci- 
ence project; "The Electronic Spreadsheet--A Creative Program For Mathematics and Mathematicians," 
by Deane Arganbright, Whitworth College; a special program on computer science was sponsored by the 
ACM/MAA Joint Committee on Retraining for Computer Science. The film "Can You Hear the Shape of a 
Drum?" by Mark Kac was shown in memorium. 


Contributed Paper Sessions 


Teaching Introductory Statistics: Topics, Trends and Techniques. Organizer: Ann Watkins, 
Pierce College. 


"Developing Understanding of Statistical Concepts Through Microcomputer Simulations," by Julian 
Weissglass, University of California, Santa Barbara. 


"Using Industrial Acceptance Sampling To Introduce Hypothesis Testing in an Introductory 
Statistics Course," by Benjamin Fine, University of California, Santa Barbara. 


"Teaching Introductory Statistics," by Clark Kimberling, University of Evansville. 


"A Public Opinion Survey Project for Introductory Statistics Students," 


University of Wisconsin-West Bend. 


by Gary L. Britton, 


"A Computer Based Laboratory for Introductory Statistics," by Elliot A. Tanis, Hope College. 


"Computer Graphics Simulations for Elementary Probability and Statistics," by Florence S. Gor- 
don, New York Institute of Technology, and Sheldon P. Gordon, Adelphi University. 


Making Mathematics Majors Marketable: Undergraduate Training for Nonacademic Careers. Organizer: 
Ann K. Stehney, Institute for Defense Analyses. 

"Making Mathematics Majors Marketable for a Military Service Career," 
U.S. Coast Guard Academy. 


by Ernest J. Manfred, 
"Teaching Mathematical Modeling to Undergraduates at a State School," by Patricia C. Kenschaft, 
Montclair State College. 


"Marketable Mathematics at an Undergraduate College," by Charles F. Peltier, St. Marys Col- 
lege. 


"Clemson’s System of Options for Mathematics Majors," by Stanley M. Lukawecki, Clemson Univer- 
sity. 


"Satellite Project Brings Problems of the Workplace into the Classroom," by Lee Badger, Weber 
State College. 


"Mathematicians at the Air Force Flight Test Center," by Henry F. Bunch, Air Force Flight Test 
Center, Edwards Air Force Base. 


"The Applied Mathematics Laboratory," by Martha J. Siegel, Towson State University. 


Strategies, Tactics, and Techniques in Teaching Lower Division and Remedial Courses. Organizer: Ann 
D. Holley, San Diego Mesa College. 


"Visualizing Functions Without Graphs," by Martin E. Flashman, Humboldt State University. 


"Using Elementary Graph Theory to Teach Non-mathematics Majors to Apply Mathematics in Problem 
Solving," by Sister Helen Christensen, Loyola College. 
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"Math As They See It: Unravelling Our Students” Thought Processes," by Deborah Hughes Hallett, 
Harvard University. 


"Teaching as Though Students Mattered," by Alvin M. White, Harvey Mudd College. 


"Teaching Estimation in Mathematics Courses for General Education," by James R. Smart, San Jose 
State University. 


"A Direct Approach to Integration,” by J.C. Smith, Virginia Polytechnic Institute and State 
University. 


"Techniques to Foster the Development of Problem Solving Skills in Lower Division Courses," by 
Ann M. Chisko, University of Cinncinnati, Raymond Walters College. 


"Word Problems a la Polya--Does it Work?" by Dan Kalman, Augustana College. 


"The Mastery Oriented, Self-paced Instructional System for Pre-calculus Mathematics at Colorado 
State University," by Kenneth F. Klopfenstein, Colorado State University, Fort Collins. 


"First Year Follow-up of a Modular Competency-based System for Teaching Remedial Algebra," by 
E. James Peake, Iowa State University. 


"Students Helping Students," by Betty Mayfield, Hood College. 
"Let the Computer Help," by Erik A. Schreiner, Western Michigan University. 
Teacher Training and Retraining. Organizer: Calvin T. Long, Washington State University. 


"Maximizing the Effectiveness of Teacher Retraining and Retention Efforts," Eunice Krinsky, 


California State University, Dominguez Hills. 


"Mathematics Through Computers: An Inservice Program Designed to Integrate Computers into the 
Teaching and Learning of Mathematical Problem Solving," by Barbara Pence, San Jose State University. 


"A Microcomputer Approach to Teaching Mathematical Abstraction to Teachers," by L.W. Linkous, 
University of Alabama. 


"Student Teacher Correspondence: A Transformation," by Trudy Cunningham, Bucknell University. 


Does Research in Mathematics Learning at the College Level Exist? Organizer: James J. Kaput, 
Southeastern Massachusetts University. 


"An Overview of New Directions in Research on the Cognition of Learning and Using Mathematics," 
by James J. Kaput, Southeastern Massachusetts University. 


"Representation of Mathematical Relationships in Four Countries: A Study of College Students” 
Mathematical Fluency," by Jack Lochhead, University of Massachusetts. 


"Calculus Students” Utilization of Graphs as Visual Representations," by Dena Patterson, North- 
ern Arizona University. 


"College Students” Understanding of the Function Concept," by Tommy Dreyfus, San Diego State 
University. 


"On Mathematics and Language: Why We Need to Teach the Language of the Mathematician When 
Teaching Mathematics," by Hadas Rin, University of California, Davis. 


"Reasoning, Practice, and Success-oriented Structure in Mathematics Achievement," by Seymour W. 
Pustilnik, New York City Technical College. 


"Heuristics and Strategies Used in Solving Equations Containing Elementary Functions," by 
Ronald H. Wenger, University of Delaware. 


"Two Mental Orientations Manifested by Algebra and Calculus Students," by Saleh I. Assad, Erie 
Community College, North Campus. 


"The Research Component in the Mathematics Major at Hollins College," by Caren L. Diefenderfer, 
Hollins College. 


"Research Oversold," by S.C. Bhatnager, University of Nevada, Las Vegas. 
Board and Business Meetings 
The Board of Governors met on January 10, 1985, and the business meeting was held on January 


12, 1985. The major items of business at these meetings will be reported to the membership in 
FOCUS. 
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by (6). Since 55 = 1 and s,,,, = 0, (6) gives 
()-1= ds eV =D Ys yri(t— 1)". 
Thus, (7) 1s equivalent under (8) to 
(9) yt = Ys4,(¢- 1)’ for all ¢. 
k=0 J=O0 
Applying Lemma 1 with p,,s,, (1), (2) replaced respectively by r,,5,,,, (4), (3) we get the 


equivalence of (3), (4), and (9). Hence, the equivalence of (1), (2), (7), (9) implies the equivalence 
of (1), (2), (3), (4). To get (5) we use a simple combinatorial lemma. 


LEMMA 2. Let 


(10) N=X,+-:: +X, 
where each term X, takes its values in {0,1}. Then 
N . 
(11) ["”) - Y X, °° X, forj=1,2,.... 
1l<i,< +++ <1.<m 


N is the number of terms in the sum (10) of value 1. Similarly the sum in (11) is the number of 
terms of value 1. Such a term in (11) has all 7 factors chosen from the N terms X, in (10) of value 
1. So there are just (") such terms. Hence (11). 


To prove (5) let X, indicate A,. That is, X, = 1 if A, occurs and 0 otherwise. (See [2].) Then 
(10) gives the number N of events 4,,...,A,, which occur (counting multiplicity). Moreover, 
E(X, +++ X,) = P(A, ++: A; ) because X, --- x; indicates A, --- A, . So we get (5) by 
distributing EF into (11). 
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ON TRANSCENDENTAL LINEAR OPERATORS 


A. CHARNOW AND E. CHARNOW 
Department of Mathematics, California State University-Hayward, Hayward, CA 94542 


1. Suppose V is a vector space over a field K. As usual, the set of polynomials with coefficients 
in K is denoted K[x]. A linear operator T on V will be called transcendental over K if every 
nonzero polynomial f in K[x] has the property that f(7) #0; 1e., f(T) is not the zero 
transformation. When the dimension of V is finite, a very elementary argument [2, p. 191] can be 
used to show that there are no transcendental linear operators on V. 

If the dimension of V is infinite, however, a transcendental linear operator does exist. As an 
example, let B be a basis of V, let A = {u,|i = 1,2,3,...} be a subset of B indexed by the 
positive integers, and let C be the complement of A in B. Define T to be the partial shift 
operator on V satisfying 7(v,) = v,,, for v, € A and T(w) = w for w € C. If f 1s any nonzero 
polynomial in K[x], then f(T)(v,) ¥ 0, so f(T) # 0. 

If the dimension of V is not only infinite but also greater than or equal to the cardinality of K, 
we have an even more dramatic change from the finite dimensional case. It is the purpose of this 
note to prove the following: 


THEOREM. If V is an infinite dimensional vector space over a field K, and if the dimension of V is 
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not less than the cardinality of K, then there exists a linear operator T on V such that f(T) is 
invertible for every nonzero polynomial f in K[ x]. 


The key to the proof of this theorem lies in two lemmas, the first of which is used only to prove 
the second: 


LEMMA 1. If K(x) is the field of rational functions (ratios of polynomials) over the field K, then 
as a vector space over K we have dim, K(x) = max{card K, No }. 


The set of K-linear maps from V to V is denoted by Hom, (V,V). 


LEMMA 2. Let V be an infinite dimensional vector space over a field K, and assume dim,V > 
card K. If K(x) is the field of rational functions over K, then there is a ring homomorphism 
a: K(x) ~ Hom, (V,V) such that a(r)=r-1, for allr © K, where 1, denotes the identity map 
on V. 


To see how the Theorem follows easily from Lemma 2, let a be the homomorphism described 
above, and set T = a(x). If f is any nonzero polynomial in K[x], then the application of a to 


ff’ =1=f''f yields 
{(T)a(f") = 1p =a fF") f(T). 
Thus f(7) is invertible, as desired. 


2. Proofs of the lemmas 


Proof of Lemma 1. Let S = {(x — a)" '|a © K}. The ‘linear independence of S over K is easy 
to verify. Choose a basis B of K(x) over K containing S. Then 


dim, K(x) = card B > card S = card K. 


Because dim, K(x) is not finite, dim, K(x) > card K implies dim, K(x) = dim, K(x) card K. 
From the infinite dimensionality of K(x) over K we may also conclude that dim, K(x)card K = 
card K(x) [3, Lemma 1, p. 245]. Therefore, it suffices to prove: 


(1) card K(x) = max {card K,&,}. 


Because a polynomial in K[x] of degree n is a sequence of n + 1 elements from K, we have [1, 
Example 8, p. 25] 


card K[ x] = 8.) card K = max {card K,&)}. 


Any element u € K(x) can be written uniquely as u = fg’, where f and g are relatively prime 
polynomials with g monic. The mapping u > (f, g) is an injection of K(x) into K[x] x K[x]. 
Hence 


card K(x) < card K[ x] card K[ x] = max {card K,&}. 
Obviously card K(x) > max {card K,N, }, so (1) is established. 


Proof of Lemma 2. We shall first show that the scalar multiplication on V can be extended in a 
manner which makes V a vector space over K(x). The hypothesis of this lemma is that 
max {card K,8,.} < dim, V. Therefore, by Lemma 1, the vector space K(x) over K satisfies 
dim, K(x) < dim, V, and since dim, V is infinite, we have dim, K(x) dim,V = dim, V. Let B 
be a basis of K(x) over K. Choose a vector space V’ over K(x) with dim,,,,)V’ = dim, V, and let 
B’ be a basis of V’ over K(x). If S = {bb'|b © B, b’ € B’}, then S can easily be shown to be a 
basis of V’ over K. It follows that 


dim, V’ = card S = card Bcard B’ = dim, K(x) dim, V = dim, V. 
Thus V and V’ are K-isomorphic. Let @ be a K-isomorphism of V onto V’. For s © K(x) and 


v € V, define sev € V tobe @ ‘(s(6(v))). Clearly rev = rv if r € K, so © extends the original 
scalar multiplication on V. Moreover, if s € K(x), s’ € K(x), and v € V, then 
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(ss’)ou 


o-*((ss’)6(v)) = @ (s(s'6(v))) =o (5($6°*(s'6(v)))) 
s0(p '(s’(v))) =5e(s’ov). 
The other vector space axioms are verified in a similar fashion; therefore V is a vector space over 
K(x). 

For s € K(x), let a, be the mapping of V into V defined by a,(v) = s°v for uv € V. Clearly, 
a, © Hom, (V,V). If «a: K(x) ~ Hom, (V,V) is defined by a(s) = a, for s € K(x), then a is a 
ring homomorphism satisfying a(r) = r-1, for all r © K, and the proof is complete. 
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DINI TYPE THEOREMS ON POSITIVE SERIES 


PIOTR BILER 
Institute of Mathematics, University of Wroctaw, pl. Grunwaldzki 2/4, 50-384 Wroctaw, Poland 


Consider a convergent series °°_, a, with positive terms a, > 0. Let 7, be the tail r, = 
~~ _,,a,, and let ¢ be a positive number. The classical Dini theorem [1] states that L?°_, a,/n,' 
converges if ¢ < 1 and diverges if ¢ > 1. Our aim in this paper is to extend Dini’s theorem by 
characterizing all functions F: Rt— R* such that D°_, a, F(r,) converges for every positive 
convergent series L°°_, a,. Theorem 1 accomplishes this goal. The complementary results for 
divergent series are contained in Theorem 2. 


THEOREM 1. Let F be a positive function on R*. The following conditions are equivalent: 


(1) For any convergent positive series Xa,,, the series La, F(1r,) is also convergent. 
(2) There exist a § > 0 and an increasing positive function f on (0,8]| such that 


F(x) < (f(x) —f(y))/A* — y) forall0<y<x <8. 


(3) There exist a 8 > 0 and a decreasing function h on (0,8) such that h is integrable on (0,8) 
and h majorizes F: F <h. 


Proof. Observe that (3) implies (1) trivially: 


Ya FUm) < L(t tevrdh(m) < fh) de < 00 


n=m 


for any m such that 7, < 6. 


Now we assume that the condition (1) holds. First we shall show the existence of a 6 > 0 such 
that La, < 5 implies La, F(7,) < 1. If no such 6 exists, then there is a positive sequence (a, ,) 
satisfying 

00 n(1) 
6 = La,, <2 and Ya, 4 F(% 3) > 1 
n=l] n=l 
for suitably large n(1), where r, , = L*_,, a,,.,. Likewise there is a positive sequence (a, >) with 
the property 


00 n(1) 
6, = )}) a,» < min} 2~’,6, — )) a,, 
n=] n=l] 
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Adding up these last inequalities, we get £°_,7, < b, < f(1) and so fj h < oo. The proof of 
Theorem 1 is now complete. 


Below s,, denotes the partial sum s, = U7, _,a,, of a series L°_, a,. 
THEOREM 2. Let F be a positive function on R™. The following conditions are equivalent: 


(1’) For any divergent positive series La,, the series La, F(s,,) is convergent. 
(2’) There exist a 8 > 0 and a decreasing function f on [8, 00) such that 


F(x) < (fly) —f(x))/(x — y) forall <y <x. 


(3’) There exists a decreasing function h, integrable on (8, 0), 8 > 0, which majorizes F: F < h. 


The idea of the proof is similar to that in Theorem 1. If (1’) holds, there exists a 6 > 0, perhaps 
large, such that the inequality s,, > 6 implies UL? ,, 4, F(s,) < 1. The function f is now defined 
on [6, 00) by f(x) = suph*_ 4,4, F(s,), the supremum being taken over all divergent positive 
series La, with ©”, a, = x. If (2’) holds, we put 


g(x) =, int (f(») ~/)) Ax 9), 


and then the function h(x) = sup, ., g(t) is decreasing and integrable on (6, oo). All other details 
of the proof can be written as before. We leave them to the reader. 


Acknowledgement. The author wishes to thank Professor C. Ryll-Nardzewski for suggesting a possible generali- 
zation of the Dini theorems. 
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CLASSIFICATION OF SURFACES 


C.E. BURGESS 
Department of Mathematics, University of Utah, Salt Lake City, Utah 84112 


We use “surface” synonymously with “compact connected 2-manifold without a boundary”, 
1.e., a surface is a compact connected metric space that is locally homeomorphic with the 
Euclidean plane E’. The purpose of this note is to prove the following classification theorem, first 
discovered in the early part of the twentieth century. (See comments and references in [3, p. 53].) 


THEOREM 1. Every surface is homeomorphic with a space obtained by removing a finite number of 
disjoint disks from a 2-sphere and replacing each of them with a Mobius band or a punctured torus. 


The first step of the proof is to model the given surface with a polygonal disk D whose edges 
are identified in pairs. The existence of such a model depends on the fact that every surface can be 
triangulated. (See comments and references in [3, p. 52].) The remainder of the proof usually rests 
on a tedious process of cutting and pasting operations, performed on D, that produce another 
disk D’ so that the given surface results from identifying, in pairs, the edges of D’ in such a 
manner that all vertices of D’ are identified at a single point and any two identified edges are 
adjacent on the boundary of D’. (Discussions of this procedure can be found in any of the four 
references.) 

In this note, we give an alternative proof by using induction on the number of edges of the disk 
D. For this, it will be convenient to use the concept of “a connected sum of two surfaces’’. (See 
pp. 8-10 in [3] and the first definition below.) 


DEFINITIONS AND NOTATION. Let M, and M, be two disjoint surfaces and D, and D, be 
disks in M, and M,, respectively. Let M = (M, — Int D,) U(M, — Int D,), where M, — Int D, 


350 C. E. BURGESS [May 


Fic. 1.2 


FIG. 2 


Call 
b 


FIG. 3 


and M, — Int D, are identified (sewn together) on their boundaries, i.e., for some homeomor- 
phism h of Bd D, onto Bd D,, M is the quotient space 


[( Mm, — Int D,) UCM, — Int D,)] /{( x, h(x)) |x © Bd D,}. 


The surface M is called the connected sum of M, and M, and is denoted by M,#™M,,. (It is well 
known that M,#M, is independent of the choice of D, and D, and that 


M,#( M,#M;) = ( M,#M,) #M,, 


but it is not our purpose in this note to elaborate on these properties.) 

A 2-sphere, denoted by S, is a space that is homeomorphic with the graph of x* + y* +z? =1 
in E>. 

A Mobius band is a space obtained by identifying, or sewing, two opposite edges of a 
rectangular disk as indicated in Fig. 1.1. Equivalently, a M6bius band is obtained by identifying 
two adjacent edges of a triangular disk as indicated in Fig. 1.2. To see that a Mobius band results 
in 1.2, draw a diagonal in the rectangle of 1.1, form the Mébius band by identifying the two ends 
as indicated, and then cut along the simple closed curve that results from identifying the end 
points of this diagonal. 

A projective plane, denoted by P, is a space obtained by sewing a Mobius band and a disk 
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Fic. 4.1 Fic. 4.2 


Fic. 5 


together on their boundaries, as in Fig. 2. Equivalently, a projective plane is obtained with the 
identification indicated in Fig. 4.2. 

A torus, denoted by 7, is a space homeomorphic with the Cartesian product S' x S', where S? 
denotes a circle. A torus is obtained by identifying the four edges of the square disk as indicated 
in Fig. 3. 

Let D(n) denote a disk with n edges on its boundary, where n is even, and let M(n) denote a 
surface that is obtained by identifying, in pairs, the edges of D(n). 

Two identified edges of a disk are called a twisted pair if the identification involves the same 
direction for the two edges around the boundary of D. The pairs labeled with c and d in Fig. 1 
and with a in Fig. 6.1 are illustrations of twisted pairs. 

Two pairs of identified edges of a disk D are called separated pairs if the two edges in one pair 
separate the two in the other pair on the boundary of D. The pairs labeled with a and b in Figs. 3 
and 7.1 and with b and c in Fig. 6.1 are illustrations of separated pairs that are nontwisted. 

The second step in the proof of Theorem 1 is to present two lemmas that are used in the 
inductive procedure described in the fourth step. 


LEMMA 1. If M(2) is a surface obtained by identifying the two edges of a disk D(2), then M(2) is 
either a sphere or a projective plane. 


There are two different identifications as indicated in Fig. 4, with the two resulting surfaces— the 
sphere and the projective plane—labeled S and P. 


LEMMA 2. If A is an annulus with n edges (n even) on one component C, of its boundary, then 
any space obtained by identifying these n edges in pairs is homeomorphic with a punctured M(n), 1.e., 
there is a disk D in M(n) such that the resulting space is homeomorphic with M(n) — Int D. 
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This is easily seen by identifying the boundary of a disk D with the other component C, of the 
boundary of A to obtain a disk D’ with boundary C,, where D’ = D U A. Identify the edges of 
D’, in pairs, to obtain a surface M(n) that contains D. The special case where the identification 
of the edges of D’ produces a torus is illustrated in Fig. 5. 

The third step in the proof is to notice that Theorem 1 can be re-stated as follows: 


ALTERNATIVE STATEMENT OF THEOREM 1. If M is a surface, different from a sphere, then 
M = M,|#M)# --: #M, where for each i, M, is either a projective plane or a torus. 


The fourth, and final, step is a proof of this alternative statement, using induction on the 
number of edges in the disk D obtained in the first step. Let n be a positive even integer such that 
for any even integer k, where 2 < k <n, any surface M(k) resulting from identifying, in pairs, 
the edges of a disk D(k) is a connected sum as required in the conclusion of the alternative 
statement of Theorem 1. (See the description above of the notation D(k) and M(k).) We wish to 
show that any surface M(n) is such a connected sum. Let D(n) be a disk with the edges 
identified to obtain M(n). We assume, by Lemma 1, that n > 4. The inductive argument is 
separated into four cases, with some overlapping among them. 


CASE 1. There is a twisted pair in the identification of the edges of D(n). Identify the two 
edges in such a twisted pair to obtain a Mébius band B with n — 2 edges on its boundary. There 
is an annulus A in B such that one component of the boundary of A is the boundary of B. (See 
Fig. 6 for an illustration of this where n = 6.) Let J denote the other component of the boundary 
of A. Notice that the closure of B — A is a Mdbius band and that J is its boundary. Identify the 
edges in the boundary of B as specified for D(n). By Lemma 2, it is now easy to see that 
M(n) = P#M(n — 2), where J becomes the identified boundaries of the two disks that are 
removed to obtain the connected sum of P and M(n — 2). 


CASE 2. There are two separated pairs of edges of D(n) that are nontwisted. If n = 4, then 
M(n) is a torus. (See Fig. 3.) If n > 6, it is easy to see, by use of Fig. 7, that a punctured torus 
results from identifying the edges in two separated pairs of edges that are nontwisted. By Lemma 
2, M(n) = T#M(n — 4). 


CASE 3. There is a nontwisted pair of adjacent edges in D(n). A disk D(n — 2) is obtained by 
identifying these two adjacent edges as indicated in Fig. 8. Thus M(n) = M(n — 2). 


CASE 4. There is a nontwisted pair of nonadjacent edges of D(n) that does not separate any 
other identified pair. An annulus A is obtained by identifying the edges in some such nontwisted 
pair, as indicated in Fig. 9. It is easy to see that n > 6. There are two positive even integers h and 
/ such that h + /=n-— 2, where h denotes the number of edges in one component of the 
boundary of A and / the number in the other component. Each edge in each of these two 
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components must be identified with an edge in the same component. By Lemma 2, 
M(n) = M(h)#M(1) = M(n - 2). 


REMARKS. The one-sided property cf a Mébius band can be used to show that the connected 
sum of three projective planes is topologically the same as the connected sum of a torus and a 
projective plane. This can be used, as in [3, pp. 26-29], to obtain the following more specific 
classification of surfaces. 


THEOREM 2. Any surface different from a sphere is either a connected sum of a finite number of 
tori or a connected sum of a finite number of projective planes. 


The inductive procedure in the proof of Theorem 1, combined with Theorem 2, furnishes the 
following information about the number and types of surfaces that can be obtained from the 
various identifications of the edges of a given disk. 

If k is odd, then (3k + 1)/2 topologically distinct surfaces can be obtained from D(2k). The 
connected sum of k projective planes is the only such surface that cannot be obtained from 
D(2k — 2), where k > 1. 

If k is even, then (3k + 2)/2 topologically distinct surfaces can be obtained from D(2k). In 
this case, both the connected sum of k projective planes and the connected sum of k/2 tori can 
be obtained from D(2k) but not from D(2k — 2). 
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AN INEQUALITY CONCERNING MINORS OF A SEMIDEFINITE MATRIX 


BORIS KIMELFELD 
Konakousky pr. 15, apt. 23, Moscow 125565, USSR 


A B 
Let S -|( RoC 
its submatrices of size m by m, m by (n—m), and (n—™m) by (n-—™), respectively 
(0 < m<n). Here M* denotes the complex conjugate of the transpose of a matrix M. 


be a Hermitian n by n matrix of complex numbers, where A, B, and C are 


THEOREM. If S is semidefinite (either positive or negative) then 
(1) ldet S| < |det A - det C|. 
For definite S equality in (1) holds if and only if B = 9. 


Inequality (1) was proved originally by E. F. Beckenbach and R. Bellman in [1], Chapter 2, 
§ 10 and 14, but their proof is based on a representation of the determinant as a multiple integral. 
In this note we give an alternative proof of (1) which is purely algebraic. 
2 2 1 
REMARK. The example of S=|2 2 1] with m= 2 shows that the word “definite” in the 


1 1 2 
last assertion of the Theorem cannot be replaced by “semidefinite”. 


Using the Theorem for m = 1, and proceeding inductively, one derives the following well-known 
corollary. 
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COROLLARY. If S is semidefinite, then \det S| is less than or equal to the absolute value of the 


product of all elements of the main diagonal of S. For definite S the equality occurs if and only if S is 
diagonal. 


Proof of the theorem. Since (1) holds trivially when det S = 0, and both sides of (1) do not 
change when S is replaced by —S, we may reStrict ourselves to the case when S is positive 
definite. In this case C and, consequently, C~' are positive definite, and S = U*SU, where 


Em 0 ~_ (A 0 , 
u=( | | § =| | and A =A — BC”'B*. 
CRY EB) 0 C 


(Here E, denotes the identity matrix of order k.) Since detU = 1, we have det S = det S = 
det A - det C. So (1) is equivalent to the inequality det A < det A, and equality in (1) holds if and 
only if det A = det A. Positive definiteness of S implies that both A and A are positive definite 
and, consequently, can be reduced simultaneously to the diagonal form, meaning that there exists 
an m by m matrix Q with detQ =1 such that O*AQ = diag(a,,a,,...,a,,) and Q*AQ = 
diag(a,,a,,...,4,,). (In [2], Chapter 4, Sect. 12, this fact is proved for real symmetric matrices, 
but the proof holds equally for Hermitian matrices.) The differences a, — 4, (i = 1,2,...,m) are 
the eigenvalues of the Hermitian matrix R = Q*(BC 'B*)Q = (B*Q)*C_'(B*Q), which is 
positive semidefinite because of the positive definiteness of C~'. Hence a, > 4, > 0 for all 
i= 1,2,...,m, and therefore 


det A = detO*AO =a,a,-::a 
This proves (1). 

Now we see that the sign of equality in (1) holds if and only if all eigenvalues (a, — @,) of the 
Hermitian matrix R vanish, that is, if and only if R=0. It remains to show that R= 
(B*Q)*C” '(B*Q) = 0 only if B = 0. To prove this, suppose that B # 0, so B* # 0, and there 
exists an (n — m)-dimensional column-vector x such that y = B*x # 0. Letting z = Q~'x, we 


have z*Rz = y*C_'y > 0 because of the positive definiteness of C~', and consequently R + 0. 
This completes the proof. 


> 4,4, --- G, = detQ*dQO = det A. 
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MISCELLANEA 
147. The Mathematics of Fiction 


[The writer] must think out completely, as coolly as any critic, what his fiction means, or is 
trying to mean. He must complete his equations, think out the subtlest implications of what he’s 
said, get at the truth not just of his characters and action but also of his fiction’s form, 
remembering that neatness can be carried too far, so that the work begins to seem fussy and 
overwrought, anal compulsive, unspontaneous, and remembering that, on the other hand, mess is 
no adequate alternative. He must think as cleanly as a mathematician, but he must also know by 
intuition when to sacrifice precision for some higher good, how to simplify, take short cuts, keep 
the foreground up there in front and the background back. 


-John Gardner, The Art of Fiction, 
Alfred A. Knopf, New York, 1983, p. 7. 
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A HEURISTIC FOR THE POISSON INTEGRAL FOR THE HALF PLANE AND SOME CAVEATS 


RIDGLEY LANGE 
Department of Mathematics, Central Michigan University, Mt. Pleasant, MT 48859 


ROBERT A. WALSH 
1203 1/2 Washington Avenue, Golden, CO 80401 


Let H(x, y) be a real-valued function depending on the real variables x and y defined on the 
upper half plane y > 0. Let AH denote the Laplacian H = H,, + H,,,. In this note we want to 
indicate an intuitive solution to the Dirichlet problem on the half plane accessible to a student 
who has had a one semester course in complex variable. Specifically, we give a new heuristic to 
derive the Poisson integral solution of the boundary value problem 


(1a) AH(x,y) =0, —0 <x<o;y>Q0, 
(1b) H(x,0) =f(x), —0 <x <0, 
where f(x) denotes the boundary values of the (unknown) harmonic function H on y > 0. If f is 


bounded and piecewise continuous on (— 00, 00), the well-known solution H(x, y) to (1) at (x, y) 
is given by the integral formula (y > 0) 


f(s) dé 
(2) f(x,y) -2 f° Oe 
~0(x—£) +y? 
We show how to construct (2) from simple cases of Problem (1) whose solutions can be verified by 
inspection. 

For example, suppose boundary values (1b) are defined by the step function f = 1 on (— 0,0] 
and f=0 on (0,00). For (x, y) in the upper half plane, it is easy to see that H(x,y)= 
7 arg(x + iy) is harmonic and satisfies the boundary condition. If a is any real number and 
f=1o0n(—o,a] and f = 0 on (a, oo), then the corresponding solution is 


(3) A(x, y) = —arg(z ~ a), 


where we write z = x + iy. Next suppose f = 1 on [a, b] and vanishes off [a, b]. The linearity of 
A implies that the solution for (1) is 


(4) A(x, y) = =[arg(z — 6) — arg(z — a)]. 


Equation (4) is our basic building block for the heuristic for Equation (2). 

Now let f be bounded and piecewise continuous on (— 00, co) and fix z= x + iy, y > 0. Let 
A <0 < B and partition [A, B] into a subintervals [é;_,,&,], 7 = 1,2,...,7,& <& < ++: <&),, 
such that no such interval contains a discontinuity of f in its interior. Let f (&, ) be a constant 
approximation to f on the jth interval. In this way we approximate f on [A, Bl by a sum of n 
step functions. Using the linearity of A again, we may reasonably say that the Riemann-Stieltjes 
sum 


(5) 


¥ #(&)[arelz - &,) — are(z - §)] 


1 
TOI 
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approximates the desired solution to (1), where at this stage we simply ignore the boundary values 
off [A,B] for A and B sufficiently far from zero. Letting n — oo in the sum (5) gives the integral 
approximation 


(6) A(x») ~ = J? 1(€)d(are(z ~ §)). 


Assuming further that our error is due to the finiteness of A and B, we predict the exact solution 
to be 


(7) A(x,y)= 2” fla alare(z ~ 8). 


To see that this leads to (2), we perform the following computation using the usual principal 
branch of the logarithm: 


(8) d(arg( z ~ €)) = Geare(z ~ €) dé 


= Im 5 In(z — 2) dé 


= Im| =] dé. 
Now (8) can be rewritten as 
. ydé 
9 d — £)) = 2 __, 
(9) (are(2 8) = 


and (2) follows from (7) and (9). 
As a function of z the integral over [A, B] in (6) is, according to (8), the imaginary part of the 
analytic function 


hence (6) represents a harmonic function on y > 0. Moreover, an easy estimate by means of the 
boundedness of f on (— oo, co) and identity (9) shows that the integrals (6) actually converge to 
integral (2) uniformly on discs in z in y > 0. This suggests that (2) represents a harmonic solution 
to Problem (1). 

Tacit in this last heuristic step is that a system of harmonic functions converging uniformly on 
neighborhoods of their common domain have a harmonic limit. This rather vague description can 
be made precise enough to give a proof that (2) is a harmonic solution. From the more familiar 
Poisson integral for a disc, it follows that h 1s harmonic on an open region D if and only if it 
satisfies the mean value property, 1.e., 


(10) h(z)) = = [a(x + re'®) dO 
0 


on each disc |z — z,)| < r contained in D. (See [1, p. 234], for details.) It is evident that the mean 
value property is preserved under uniform convergence, thus the existence of solution (2) as a 
harmonic one can be established. 

To complete a rigorous solution, it must also be proved that (2) represents the only bounded 
solution to (1). (There may be infinitely many unbounded solutions, e.g., if f = 0 in (1b), then 
(x,y) =y and H,(x, y) = 2y are distinct unbounded solutions to (1).) The uniqueness of the 
bounded solution may be established along the lines of [1], Exercise 2, p. 169, with the help of 
(10), and the elementary facts that a nonconstant harmonic function is nonconstant on each disc 
and has no local extrema on its domain. 

We close with some cautionary remarks about boundary behavior on the half plane. Using 
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representation (2) with elementary proofs, one can show that at each point x, of continuity of f 
in (1b) 


lim, (x99) = f(%0) 
y> 
and for a jump discontinuity of f at xo, 
; 1 
(11) Jim, H(x9,¥) = 5 L f°) +f) 


so the boundary limit is the average of the left and right-hand limits of f at x). Property (11) 
looks like one students learn in elementary Fourier expansions, but some new subleties are worth 
pointing out. For example, the limit in Equation (11) must be radial, or normal to the boundary; 
consider Equation (3) above with a = 0. Here (11) becomes (as we expect) 


, 1 1 
lim A(0,y) = lim, —arg(iy) = 5; 
y0* yo>o* 7 2 
but if (x, y) — (0,0) along the diagonal x = y, we get 
lim f(x,y) =+ lim are(x + xi) =+. 
(x.y) (0,0) 7 x—+0* 4 


For a discontinuity at x9 which is not a jump, the limit lim, _,.+H(x9, y) need not have a 
“reasonable” value. A student might be tempted to guess the radial limit at x, = 0 for the 
boundary function f(£) = sin’(1/é) for £ > 0 and f(é) = 0 for € < 0. By (2) if y > 0, 


sin’ (1 : 
A(0, y) = ain = . sin’ (1/&) dé, 
and the last integral can be written as 
; 2% dt 2 r~ cos(2t/y) dt 
12 H(0,y) =— -=-f[ ——_. 
(12) (09) za) r+) za te +1 


The value of the first integral in (12) is clearly 1, while the second can be found by contour 
integration or otherwise to be e °””. Hence 


lim H(0,y) =1, 
y>0" 


a result that “intuition” cannot account for. Moreover, if we take f(£) = sin’(1/é) for £ # 0, 
then the radial limit as (0, y) ~ (0,0*) is found by a calculation similar to (12) to be +! 
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148. MISCELLANEA 
Definition 
In fact, I am rather inclined to agree with those who maintain that the moments of greatest 
creative advancement in science frequently coincide with the introduction of new notions by 


means of definition. 


~Alfred Tarski, “The semantic conception of truth and the foundations of semantics,” 
from Philosophy and Phenomenological Research, vol. 4, 1944. 
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ELEMENTARY PROBLEMS 
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E 3088. Proposed by Solomon W. Golomb, University of Southern California. 
Show that, for every positive integer n, 


" k-klion 
y (i) a7. 
k=1 


nh 


E 3089. Proposed by Eliot T. Jacobson, Ohio University. 

Determine all positive even integers n having the following property: given any integer b with 
1 < b<n and gcd(b,n) = 1, there is a solution to the congruence (b — 1)x = 5 (mod n). 

E 3090. Proposed by Alfonso Villani, Seminario Matematico, Catania, Italy. 


Let f: [0, +00) — [0, + co) be a continuous function. Let h > 0 be given and let g: [0, + 00) 
— R be defined by 


f(x) 
f(x+h)-1- 


(a) Show thatif liiminfg(x)> 0, then lm f(x) =0. 
Xx + 00 xX + 00 


g(x) = 


(b) Show that if hmsup g(x) <0, then lm f(x)= +0. 
x7 + 


x— +00 0° 


(c) Assume that lm g(x) = (0. Is it possible for f to satisfy both liminff(x) = 0 and 
Xx + 00 


Xx +00 
lim supf(x) = +00? 
X— +00 
(d) How, in (a) and (b), can the assumption “f is continuous” be weakened? Can this 
assumption simply be omitted? 
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E 3091. Proposed by Calin P. Popescu, student, Bucharest, Romania. 


Equilateral triangle A BC is inscribed in triangle XYZ, with A between Y and Z, B between Z 
and X, and C between X and Y. Show that X¥4 + YB+ ZC < XY + YZ + ZX. Is equality 
possible? 


E 3092. Proposed by Rick Luttmann, Sonoma State University. 


Problem A-6 on the 1982 Putnam Competition was: 


“Let o be a biyection of the positive integers, that is, a one-to-one function from {1,2,3,...} 
onto itself. Let x,, x5, x3,... be a sequence of real numbers with the following three properties: 


(1) |x, | 1S a strictly decreasing function of n; 
(11) |o(n) —n|-|x,|~ 0Oasn— oo; 
(111) Lim, ook =1%k = 1. 


Prove or disprove that these conditions imply that 
n 
lim Y xg) = 1.” 
no? OO k=] 
Solve the problem with (11) replaced by: 


(11)’ |o(2) — n+ |XQ¢n)| 7 0 as n > ©. 


SOLUTIONS OF ELEMENTARY PROBLEMS 
The Extended Erdés-Mordell Inequality 


FE 2462 [1974, 281]. Proposed by Huseyin Demir, Middle East Technical University, Ankara, 
Turkey. 


Let P be a point interior to the triangle A,A,A,. Denote by R; the distance from P to the 
vertex A,, and denote by 7, the distance from P to the side a, opposite to A,. The Erdés-Mordell 
inequality asserts that 


R, + R, +R; >2%(4+nH4+ 47). 


Prove that the above inequality holds for every point P in the plane of 4,A,A, when we make 
the interpretation R, > 0 always and r, is positive or negative depending on whether P and 4A, 
are on the same side of a; or on opposite sides. 


Editorial note: Professor Clayton W. Dodge, Department of Mathematics, University of Maine 
refereed the “solutions” submitted to this problem in 1974 and found that there were no solutions. 
Since that time Professor Dodge himself has solved the problem. His solution appears in Crux 
Mathematicorum, vol. 10, no. 9, November 1984, pages 274-281. 


Another Manifestation of the Cauchy-Schwartz Inequality 


FE 2959 [1982, 498]. Proposed by Jack Garfunkel, Flushing, New York. 


Triangle ABC is inscribed in a circle. The medians of the triangle intersect at G and are 
extended to the circle to points D, E and F. Prove: AG + BG + CG < GD + GE + GF. 


Solution by P. Boente, Universitat Mannheim, West Germany. We prove a generalized form of 
the proposition, namely: Let P,, P,,...,P,, be points of an n-sphere S, and G their center of 
gravity. If the extensions of P,G intersect S at Q;, then 
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Proof. We denote the center of S by M, its radius by r and by (.,.) and ||.|| the natural inner 
product in R” and its associated norm, respectively. Then we have: 


mr = + (PM) = YPM |? 
i=1 i=1 


= Vi \|PG + GM|| 
i=1 

= ¥ (PG) +2| & PG, GM|+m(GM)y 
i=1 i=l] 


m(GM)° + ¥) (2G)’, 
i=1 
since Lit iP.G = 0, by the definition of G, whence 
lo 2 2 ; 
my L (PG) =r°—(GM)°=PG-QG forall j, 
by the chords theorem. Division by P,G and summation yields 


m ‘om 


“(P.G)'Y (RG) 


m 
GQ; = 
=1 j=l i=1 


| 
™ 


J 


m 


¥ (RG) *E (ROY 
y 


ro) Y (PG) 


i=1 


~1 m 2 
G| = ra | 
MM \ j=] 


i 
m 


V 
5 
f 


WV 

3 
a 
iMs 

a 


where the former inequality is given by the relation between the arithmetic and harmonic mean 
and the latter one is a well-known application of the inequality of Schwarz. 
Also solved by 27 other readers and the proposer. A number of readers referred to E 2505 [1974, 1111; 1976, 


59-60]. A number of readers also noted that equality occurs if and only if triangle 4 BC is equilateral. 


An Inequality for Acute Triangles 


E 2968 [1982, 697]. Proposed by George Tsintsifas, Thessaloniki, Greece. 


The points Aj, 45, A3 lie on the sides A,A,;,A3A,,A,A, of an acute angle triangle A,A,A;, 
respectively. Show that 


2)\ a’cosA, > > a,cosA,, 


where a,, a, a, are the sides of the tnangle A,A,A, and aj,a‘, a4 are the sides of the triangle 
A} A5 A. 
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Solution by O. P. Lossers, Eindhoven University of Technology, Eindhoven, The Netherlands. 
We put 
A\ A, = pi, Aj A3 = G1, A9A3 = Pr, AZA1 = G2, AZA1 = D3, AZA2 = 93. 


Since the distance between Aj and A’ is at least equal to the distance of their orthogonal 
projections onto A,A, we have 


a, > a, — g,COS A, — p,cosA,, 
a > ay — g,cOS A; — p3Cos A,, 
a4 > a, — q,COS A, — p,cosA,. 
Hence, since cos A, > 0, 
2aicos A; > Xa,cos A; — a,cos A,cos A, — a,cos A,cos A, — a,cos A,cos A,. 
With the circumradius of AA,A,A, as the unit of length we find 
2a,cosA, = Xsin2A, = 4sin A,sin A,sin A,, 
and 
a,cos A,cos A, + a,cos A,cos A, + a,cos A,cos A, 
= 2cos A,(sin A,cos A, + cos A,sin A,) + 2 sin A,cos A,cos A, 
= 2sin A;(cos A; + cos A,cos A, ) 
= 2sin A,sin A,sin A; = 42a,cosA,. 
This leads to 
22a'cos A, > 2a,cosA,, 


with equality if and only if in each of the inequalities for aj, a4, a, equality occurs, that is if and 
only if Aj, A, and A4 are the midpoints of the sides A, A,, A3A,, 4,45, respectively. 


Also solved by J. Dou (Spain), I. E. Leonard, W. A. Newcomb, K. L. Stellmacher and the proposer. 


ADVANCED PROBLEMS 


Solutions of these Advanced Problems should be mailed in duplicate to Professor D. H. Mugler, Department of 
Mathematics, University of Santa Clara, Santa Clara, CA 95053, by September 30, 1985. The solver’s full 
post-office address should be on each sheet. 


6496. Proposed by Ryszard Szwarc, University of Wroctaw, Poland. 


Let T be a bounded operator with spectral radius 1 on a Hilbert space H, and let c > 1. Prove 
that there is an invertible operator A on H such that ||ATA~+|| < c. 


6497. Proposed by Richard Askey, University of Wisconsin. 
Let 0 < g < 1, Rea > O and Reb > 0. Show that 


(—1t5 d)0(-4/t3 Goo t T(a + b) 
and 


(2) peti ide de. —lopq KOTO) 
0 (HB q)ol-d/tao  * 1—q T(a+b) ’ 
where 
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(o@) 


(x; q)o:= [] GQ —- xq"), 


n=0 


T,(x):=(93 geo(l — 4)” */( 45 Q)ees 


and 
[10 a= 0-4) ¥ Ha")a" 


These extend the gamma function identity 


ia dt _ T(a)T(d) 


(1+2)?(1+¢)4 Tard) 


to g-gamma functions (for properties see R. Askey, Ramanujan’s extensions of the gamma and 
beta functions, this MONTHLY, 87 (1980) 346-359). 


SOLUTIONS OF ADVANCED PROBLEMS 
Degrees of Irreducible Polynomials over a Field 


6043 [1975, 766; 1977, 304]. Proposed by Brian Peterson, University of California, Berkeley. 


Let P be a nonempty proper subset of the primes. Consider algebraic extensions F of the 
rationals Q with the property 


(*) Every x in F has degree over Q divisible only by primes in P. 


A Zorn’s lemma argument shows that there exist maximal extensions satisfying (*). Is such a 
maximal extension unique up to isomorphism? 


Solution by M. J. Pelling, University College, London. The published solution (1977, 304) is 
invalid since if u = 10u, = 10(275° + 5%v)'””, then v = —2? + u?/1075° © Q(u) contrary to 
assertion. One can even show that the solver’s method never works for P = {5} since it is 
impossible to have two irreducible quintics f(x), g(x) over Q such that g(x) factors over QO( wu), 
where uw is a root of f(x), into the product of an irreducible quadratic and an irreducible cubic. 

However, the idea can be made to work for P = {7}, since there do exist irreducible 
polynomials f(x), g(x) over QO of degree 7 such that g = g,g, factors into the product of an 
irreducible cubic g(x), and an irreducible quartic g,(x) over Q(u) where u is a root of f(x). 

To prove this let G be the simple group of order 168 which may be obtained as the Galois 
group of a suitable irreducible polynomial f(x) over Q of degree 7, with G acting as a doubly 
transitive permutation group on the roots u,,...,u, of f(x). It is known that G can be obtained 
as a doubly transitive permutation group on 7 symbols in essentially one way only, so that with 
the roots of f(x) suitably numbered the 7 quantities 


VU; = Us + Ug + UZ Us, = Uy + U3 + Us 
Uy = U3, + Ug t+ Us Us = Uy + Ug + Us 
U3 =U, + Uy t+ Ue Ug = Uy + U3 + Ue 


Uz =U, + Un + Uy 


are permuted (doubly transitively) amongst themselves by the action of G. The v, are therefore 
the roots of an irreducible polynomial g(x) over Q of degree 7, and since the subgroup of G 
stabilising u, has, acting on the u,, orbits { vs, u.,v7} and { 01, Vy, v3, U, }, it follows that v., ug, v7 
are the roots of an irreducible cubic g(x) and U,, v5, v3, v, are the roots of an irreducible quartic 
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g,(x) over Q(u,). Hence a maximal extension of Q satisfying (*) for P = {7} and containing a 
root of f(x) cannot be isomorphic to a similar extension containing a root of g(x). 

There are also choices of P for which the maximal extension F is unique up to isomorphism: 
obviously this happens if P is the set of all primes, since then F is the algebraic closure of Q, but 
it also happens if P omits just one prime p. To prove this, suppose F, F’ are two maximal 
extensions with isomorphic subfields K, K’, respectively, and let t € F\K so that ¢ will satisfy 
an irreducible polynomial h(x) over K of degree not divisible by p. If s:K— K’ is the 
isomorphism, the conjugate polynomial 4*(x) is irreducible over K’ and must have a root ¢’ in 
F’: for if h* = h,h, --+ h, 1s the factorization of h° into irreducibles over F’, then some h, has 
degree not divisible by p, but if deg(h,) > 1, we could adjoin a root of h, to F’ and get a larger 
field satisfying (*) so contradicting maximality of F’. 

The isomorphism s can therefore be extended to one of K(t) and K(t’), mapping ¢ to ¢’, and 
from this an application of Zorn’s lemma shows that F, F’ are themselves isomorphic. 


The Derivatives of x* 


6392 [1982, 429; 1984, 60; 1984, 372]. Proposed by C. Ward Henson, Bruce Reznick and Lee A. 
Rubel, University of Illinois. 


Is there some x, > 0 such that if f(x)=x*, then f'(x)>0 for all x > x, and all 
n=0,1,2,...? 


Apology. The original published solution by Paul R. Chernoff [1984, 60] was mistakenly judged 
to be incorrect [1984, 372]. The editors wish to apologize to Paul Chernoff and to thank Bertram 
Walsh for pointing out that Chernoff’s solution is valid since the Taylor series of f(x) = x* about 
xX = X, > 0 in fact converges to f(x) in a neighbourhood of xo. 


Stability of Subspaces under Normal Linear Transformations 


6432 [1983, 402]. Proposed by Michael Barr, McGill University. 


Problem 4 on p. 348 of Jacobson’s Basic Algebra I reads, “Call a linear transformation normal 
if it commutes with its adjoint [with respect to a symmetric or alternating, nonsingular bilinear 
form on the vector space V—not its classical adjoint]. ...Show that if U [a subspace of V] is 
stabilized by a normal linear transformation T, then U~ is stabilized by T.” 


(a) Show by example that the assertion 1s false. 

(b) Show that if the form is anisotropic on a finite dimensional space (no nonzero vector is 
orthogonal to itself—this requires the form to be symmetric) and the ground field is perfect, then 
T is diagonalizable over the algebraic closure of the ground field. 

(c)* Is it the case that if U is stabilized by a linear transformation T normal with respect to an 
anisotropic form (over a perfect field, if necessary), then U~ is stabilized by T? 


Solution to part (a) by the proposer. Let the bilinear form be given by (u,v) = u'Av, where A 
is a nonsingular n X n matrix and u and v are n-dimensional column vectors. Take n = 4 and 


0 01 0 1 00 0 
_10 0 0 1 _1|0 1 0 0 

A=!) 9 0 of ?=10 0 01)? 
0 a 0 O 0 0 1 0 


where a is chosen as 1 or —1 according as the example is to be symmetric or alternating. Since 
A~-1=aA, the adjoint T* = aAT'A = aATA so that normality reduces to ATAT = TATA. 
Since T~-1 = T and A~! = aA, this reduces to (AT)* = I which is easily calculated. Thus T is 
normal. If {v,, v5, 03,04} is the standard basis, then (v,)~ = (U1, 02, v4) and T(v,) € (v,) while 
T (01) * = (01, 02,03) F (01)~. 
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Solution to parts (b) and (c)* by Daniel Zelinsky, Northwestern University, Evanston, Illinois. 

(b) According to the usual theory, if the minimum polynomial of 7 is II p* where the p,’s are 
distinct, monic, irreducible polynomials over the ground field F, then V is the direct sum of 
T-stable subspaces 

W,= {v © Vip(T)"(v) = 0} 
and each W, is necessarily stable under any transformation that commutes with T (e.g., under 7%, 
if 7 is normal). We prove that if 7 is normal, then every e, is 1. If not, pick f such that 
e,/2 <f <e,, and write g = p/. Then p* divides g’, so g(T)* acts as 0 on W.. But the minimum 
polynomial of the restriction of T to W, is p?, which does not divide g, so q(T) is not zero on 
W.. The defining property of 7* implies (u, g(T*)v) = (q(T)u, v) for all u,v in V, so, for all v 
in W,, 
(q(T) q(T*)v, q(T*) q(T) 0) = (a(T) 4(T) a(T*) 0, q(T) 0) = 0 
because g(T*)v € W, (since W, is T*-stable) and q(T)’ is zero on W,. By the anisotropic 
property and g(T*)q(T) = q(T) q(T*) (which follows from TT* = T*T), we know q(7T*)q(T)v 
= (0. Now do it again: 
(q(T) v, q(T) v) = (v, 4q(T*) q(T) v) = (v,0) = 0, 

so g(T)v = 0. Hence g(T) is zero on W,, a contradiction. This proves e, = 1. Since the ground 
field F is perfect, it now follows that the minimum polynomial of T factors as a product of 
distinct linear factors over the algebraic closure F of F. Thus T is diagonalizable over F. 


(c)* Let V be the space of 4-tuples with entries in the field F of rational numbers. Define 
linear transformations on V by 


T([x,y,2,t]) =[y,—-x,t, -z] and S([x,y,z,t]) =[y,-x, -t,2]. 

Then ST = TS, but there are 7-stable subspaces that are not S-stable, for example ([x, y, x, y]: 
x, y © F). If we construct an anisotropic inner product on V for which S = 7*, we shall have a 
counterexample to (a) and (c) because U* is not stabilized by T if U is not stabilized by 
T*(= S). 

Define the inner product of [x, y, z,¢] and [x’, y’, 2’, t’] to be 

xx’ — yy’ + 3xy’ + 3yx’ + Tzz’ + Ttt’. 
It is easy to check that S = 7%, that is, (7(u), v) = (u, S(v)) for all 4-tuples u and v. This inner 
product is anisotropic because x? — y* + 6xy + 7z? + 7t* cannot be zero for rational numbers 
x, y,Z,¢t that are not all zero. If it were, then by clearing denominators, the same quadratic form 
would be zero for integers x, y, z,¢ with no common factor. Then 
(x + 3y)° — 10y? = 0(mod7). 
But —10 is not a quadratic residue mod 7, so x + 3y and y are both divisible by 7; hence x and 
y are. Dividing x* — y* + 6xy + 7z? + 7t? by 7, then reading again mod 7, we get 
z* + t* = 0(mod7) 

and, since —1 is not a quadratic residue either, both z and ¢ are divisible by 7. Thus x, y, z, ¢ all 
have a common factor 7, which is a contradiction. 


Parts (a) and (b) were also solved by Edward T. Wong. Note that the counterexample to part (c)* affords another 
counterexample to part (a). 


Avoiding Meromorphic Functions 


6437 [1983, 485]. Proposed by Bhaskar Bagchi, Indian Statistical Institute, Calcutta. 


Given n meromorphic functions f,, f,,...,f, on a planar region U, show that there exists a 
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holomorphic function g on U such that f(z) # g(z) for 1 <j <n and for all z in U. 


Solution for the case n = 1 by the proposer. Given a meromorphic function f on U, we can write 
f =fi/f, where f,, f, are holomorphic with no common zeros. Let A be the set of all zeros of f,. 
For a € A, let n(a) be the order of the zero of f, at a. It suffices to show the existence of a 
nonvanishing holomorphic function h on U such that 


(1) KO (a) =ff’(a) for 0<k <n(a),aeA. 


For then g = (f, — 4)/f, is holomorphic on U and f — g =h/f, is nonvanishing. 
An easy computation shows (since f,(a) # 0 for a € A) that there are complex numbers 
a,(a) for0 < k < n(a), a € A, such that whenever a holomorphic function h, satisfies 


(2) h(a) =a,(a) for 0<k<n(a),a€A, 


then h = exp(h,) satisfies (1). But, by a well-known consequence of the Mittag-Leffler theorem, 
there is a holomorphic function h, on U satisfying (2) (see, for example, Walter Rudin, Real and 
Complex Analysis, Second Edition, McGraw-Hill, p. 327). This completes the solution for the case 
n=l. 


Observations by Lee A. Rubel, University of Illinois, Urbana, Illinois. The assertion is false in 
general. In case U=C, let f,(z) = 0, f,(z) =1, and f,(z) =z. Then if the entire (or even 
meromorphic) function g has a graph that never hits the graphs of f, and f,, g must be a 
constant (by Picard’s Little Theorem) and so g(z))=/;(z)) for suitable z). In the paper, 
Interpolation and unavoidable families of meromorphic functions, by myself and Chung-Chun Yang, 
in the Michigan Math. J., 20 (1973) pp. 289-296 (see Theorem 3, p. 294 ff.), we show that no two 
meromorphic functions f, and f, have the desired property. Essentially the same proof works if 
U is the plane with finitely many punctures. 


Also solved by Michael von Renteln (West Germany). 
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Lars Valerian Ahlfors: Collected Papers, Volume I, 1929-1955, Volume IT, 1954-1979. By Lars 
Ahlfors. Contemporary Mathematicians. Edited by Gian-Carlo Rota. Birkhauser, Boston, 
1983. Vol. I, xix + 520 pp. $65.00 Vol. II, xix + 515 pp. $65.00 


CLIFFORD J. EARLE 
Department of Mathematics, White Hall, Cornell University, Ithaca, NY 14853 


Geometric function theory was born on December 26, 1851, when Riemann presented his 
inaugural dissertation. Today, more than 130 years later, the subject is still flourishing, thanks in 
large part to Lars Ahlfors, this century’s premier geometric function theorist. 

As its name suggests, geometric function theory emphasizes the geometric properties of the 
mappings defined by complex analytic functions. Thus, a one-to-one analytic function is a 
conformal map, and arbitrary analytic functions are, at least locally, branched covering maps. 
From this point of view, for instance, the complex exponential function w = e’ defines a universal 
covering of the punctured w-plane C* = C \ {0} by the complex z-plane. The horizontal lines 
y =2nqmi, n an integer, divide the z-plane into parallel strips, each of which is mapped 
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conformally onto the complement D of the positive real axis in C*. The z-plane can be 
reconstructed from infinitely many copies of D by assembling them so that crossing the positive 
real axis corresponds to moving from one of the parallel strips to one of its neighbors. This point 
of view has far-reaching consequences, leading inevitably to the introduction of Riemann surfaces 
and analytic mappings between them. 

The best way to convey the flavor of modern geometric function theory is to discuss some of 
Ahlfors’ theorems, and the obvious place to start is with the Ahlfors theory of covering surfaces, 
for which he received the Fields medal in 1936. Let f be a nonconstant analytic function defined 
in the entire plane, and let D be any region in the plane. A bounded plane region G such that f 
maps G onto D in an n-to-one manner (counting multiplicities) is called an island over D, and n 
is called its multiplicity. Thus, if G is an island of multiplicity n, the map f: G — D is a branched 
covering of degree n, and if n = 1, then f is a conformal map of G onto D. If f omits some value 
Wy, there can be no islands over any region that contains w,. For instance, if f is the exponential 
function, there are no islands over the unit disk. A more interesting example is f(z) = sin z. This 
function has the property that f’(z) = 0 if f(z) = +1, so there are no islands of multiplicity one 
over the disks {w; |w — 1| < 4} or {w; |w + 1] < 4}. 

We wish to state two theorems of Ahlfors, which show that these examples represent the 
extreme case. The first theorem states that if the regions D, and D, are bounded, simply 
connected, and have disjoint closures, then there is an island over at least one of them. This result 
immediately implies Picard’s theorem that a nonconstant analytic function defined in the entire 
plane can omit at most one value. The second theorem is even more remarkable. It states that if 
we are given three bounded simply connected regions with disjoint closures, there must be an 
island of multiplicity one over at least one of them. The Ahlfors theory deals with more general 
situations, but these two theorems should be sufficient to illustrate its power. 

The use of ideas from differential geometry in geometric function theory is well illustrated by 
the Ahlfors extension of Schwarz’s lemma. The classical Schwarz lemma says that if the analytic 
function f(z) maps the open unit disk U into itself and f(0) = 0, then |f(z)| < |z| for all z in U. 
In 1916 G. Pick observed that the special role of the point z = 0 can be eliminated by using the 
hyperbolic (non-Euclidean) distance 


d(z,w) = tanh"! 


Z—W | 
1 — Zw 
in U. Pick’s version of the Schwarz lemma states that d(f(z,), f(Z>)) < d(z,, Z,) whenever f(z) is 


an analytic function that maps U into itself. Since the hyperbolic distance comes from the 
Riemannian metric ds = (1 — |z|*)~*|dz| on U, Pick’s theorem is equivalent to the inequality 


(2) - fC) P) <= 12?) -, 


valid whenever f: U — U is analytic. Ahlfors interprets that inequality as a statement about 
metrics of negative curvature in the following way. If A(z) is a positive smooth function in some 
region, the (scalar) curvature of the Riemannian metric ds = A(z)|dz| is defined by the formula 


K(z) = —A(z)~“A(logA)(z). 


Here A is the Laplacian. The Ahlfors extension of Schwarz’s lemma states that if A(z) is a 
nonnegative function in U such that K(z) < —4 whenever A(z) > 0, then A(z) < (1 — |z|*)7’ 
everywhere in U. The proof is astonishingly elementary. Pick’s inequality follows at once, since 


Mz) =f(Z)(1 - If) P) 


satisfies K(z) = —4 when A(z) > 0. In other applications the hard step is usually to construct an 
appropriate metric A(z)|dz|. Ahlfors makes this task somewhat easier by reformulating the 
condition K(z) < —4 in a way that does not require A(z) to be smooth, but such technical 
refinements need not concern us here. 
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The fact that these illustrative examples come from the mid-1930’s should not lead the reader 
to conclude that geometric function theory has stood still since then. On the contrary, the 1950’s 
and 1960’s saw rapid development in the theory of quasiconformal mappings and their application 
to the study of Teichmiiller spaces. Ahlfors and Bers were the leaders of this activity, and many of 
the papers collected here deal with these subjects. 

In 1964 Ahlfors published a paper about finitely generated Kleinian groups that brought this 
subject back to life and initiated a growing body of important research. Consider a finitely 
generated group G of Mobius transformations. Let D be the largest open subset of the Riemann 
sphere in which G acts properly discontinuously, and assume D is not empty. Then the quotient 
map D — D/G 1s a branched covering, and every connected component of D/G is a Riemann 
surface. The Ahlfors finiteness theorem asserts that D/G has only finitely many branch points 
and connected components and that each connected component is either compact or the 
complement of a finite subset of a compact Riemann surface. With the help of that deep theorem 
much can be said about the geometry of finitely generated Kleinian groups. More recently, 
Thurston’s theory of 3-manifolds has made Kleinian groups even more important. Much of 
Ahlfors’ recent work has dealt with extending to the n-dimensional case the analytic tools that 
have been so successful in studying these groups in two dimensions. The very last paper in these 
volumes deals with the proof of an important theorem of Dennis Sullivan about ergodic properties 
of n-dimensional Kleinian groups. It is clear that geometric function theory is still very much 
alive, and that the papers collected here are central to its modern development. 

One unexpected feature of these volumes calls for comment. The editors had the brilliant idea 
of asking Ahlfors to provide his own running commentary on these collected papers. His 
comments are always enlightening, and the reader who wants to learn more about geometric 
function theory would do well to start there. It goes without saying that those who already know 
and love the subject will find these volumes a treasured and indispensable addition to their 
libraries. 


Political and Related Models. Edited by Steven J. Brams, William F. Lucas, and Philip D. Straffin, 
Jr. Modules in Applied Mathematics, Volume 2. Springer-Verlag, New York, 1983. xx + 396 


PP. 


JOSEPH MALKEVITCH 
Department of Mathematics, York College, Jamaica, NY 11451 


The power of mathematics to solve problems in the physical and biological sciences and in 
engineering is widely known outside the mathematics community. Unfortunately, however, the 
mathematical techniques used to solve these problems lie beyond what can be easily described to 
laymen and nonadvanced students. The standard tools for applications in these areas often 
involve ordinary and partial differential equations, complex variables, Fourier analysis and special 
functions. This situation is unfortunate if one believes that it is valuable to teach applications to 
encourage interest and participation in mathematics by a broader group of people. However, in 
the years since World War II there has been tremendous growth in using mathematics to solve 
problems in nontraditional areas such as political science, management science, computer science 
and psychology. This is noteworthy not merely because persons outside the mathematics commun- 
ity (and to some extent within it) are not aware of these applications, but also because the 
methods in these applications usually involve discrete mathematics. This makes it possible to 
teach significant mathematics, motivating the study by novel applications, to students with 
minimal mathematical preparation. Furthermore, the mathematics these applications lead to is 
elegant, important, often deep, and gives hope of bringing full circle the delicate interplay of 
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applications-oriented and pure mathematical developments that are so much a part of the history 
of mathematics. We hope the books in this series, especially Volume 2, will make more people 
aware of these exciting developments. 

Volume 2 consists of fourteen articles written by people who generally have reputations both as 
researchers and expositors. The volume focuses primarily on models in political science, with a 
smattering of other subjects also being treated. The lead article, “The Process of Applied 
Mathematics” by Maynard Thompson, gives a brief, useful, albeit somewhat “standard” discus- 
sion of the modeling process. The article centers on two examples, one from physics and one from 
psychology, which is unfortunate since the flavor of the applications in political science is rather 
different. 

The strength of this book lies in the material in: Chapter 2 (“Proportional Representation” by 
Edward Bolger), Chapter 3 (“Comparison Voting” by Steven J. Brams), Chapter 4 (“Modeling 
Coalitional Values” by William F. Lucas and Louis J. Billera), Chapter 9 (“Measuring Power in 
Weighted Voting Systems” by William F. Lucas), Chapter 10 (“To the (Minimal Winning) Victors 
Go the (Equally Divided) Spoils: A New Power Index for Simple n-Person Games” by John 
Deegan, Jr., and Edward W. Packel), Chapter 11 (“Power Indices in Politics’ by Philip D. 
Siraffin, Jr.), Chapter 12 (“Committee Decision Making” by Peter Rice), and Chapter 14 (“The 
Apportionment Problem” by William F. Lucas). These articles, augmented by Chapter 5 (“Urban 
Wastewater Management Planning” by James P. Heaney), cover a fairly broad range of applica- 
tions to political science and deal with a rich collection of ideas. The nonpolitical-science articles 
round out the volume, and though the best of them (for example, Chapter 8, “How to Ask 
Sensitive Questions Without Getting Punched in the Nose’ by John C. Maceli) contribute 
interesting ideas, the tend to dilute the “unity” of the rest of the book. The “unity problem” was 
certainly an inevitable consequence of the fact that nearly all the articles which make up the four 
books in the series arose from two writing projects sponsored by the Mathematical Association of 
America with support in part from the National Science Foundation. Considering that no attempt 
was made to coordinate what was originally written, the modules go together remarkably well. 

Since the book is aimed at teachers and students, nearly all the chapters have exercises, “notes 
for the instructor,” and bibliographies. The exercises are very well chosen. There are some 
mechanical problems, but there is particular emphasis on more open ended problems and projects. 
The union of the references in these papers is very fine and will enable one to explore in many 
directions. (References are given to both general works and research papers.) The notes to the 
instructors are of a rather gereral kind and are unlikely to be of great value. More valuable would 
have been an index which, unhappily, none of the four volumes has. 

Minor quibbles aside, these papers constitute a fresh, well-written, fascinating account of how 
mathematics is being used to obtain exciting, useful results in political science and other areas. 
This book deserves to be in the personal library of all mathematicians looking for new teaching 
materials and intellectual stimulation. 


Chess Skill in Man and Machines. Peter W. Frey (Editor). Second Edition. Texts and Monographs 
in Computer Science, edited by David Gates. Springer-Verlag, New York, 1983. xiv + 329 pp. 


HANS BERLINER 
Computer Science Department, Carnegie-Mellon University, Pittsburgh, PA 15213 


Before saying anything else, it should be noted that this is by far the best book on computer 
chess. It is the result of a seminar series held at Northwestern University in 1974. This series 
brought together a number of technical people who were actually developing computer chess 
programs, and others who were either good chess players or had studied the psychology of chess 
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playing. Among the former were the outstanding duo at Northwestern, David Slate and Larry 
Atkin, whose programs dominated computer chess competition during the 1970’s. Their account 
of the workings and development of their program is the cornerstone of this volume. Their chapter 
is a very detailed exposé of the innards of the best machine at the time, and how it got to be that 
way. Enough detail is provided to allow a programmer to build a program to mimic large parts of 
CHESS 4.5 (as the Northwestern program was then called), and this article has, no doubt, been 
used amply in the construction of the many microprocessor programs that are commercially 
available now. 

There are two other parts of the first edition that I also like very much. One is a chapter by 
Neil Charness on the psychology of chess and the experimental evidence that supports this. This is 
also far and away the best article of its kind. The other chapter that I liked was one by the volume 
editor, Peter Frey, on how to understand the workings of a chess program. This is a patient and 
thorough treatment that is easy to understand, and one from which the interested layman can 
learn a lot. This core of three fine papers was supplemented by others that were either not so fine, 
or which had less lofty aims, or had been published in essentially the same form elsewhere. 
However, these did add a number of aspects to the whole, including a treatment of the history of 
computer chess tournaments up to 1975, two fine appendices of games of CHESS 4.5 and 4.6, and 
an overview of some approaches that did not work out. In any case, the three excellent papers 
were clearly worth the price of the book. 

The second edition contains all of the first plus two new chapters and a large new appendix of 
computer games. One new chapter is Joe Condon and Ken Thompson’s account of how Belle, a 
once mediocre program, was transformed into the most powerful chess machine of today with the 
relentless refinement of parts of the program by the addition of special purpose hardware to do 
the computations faster and in most cases in parallel. This special purpose parallelization has two 
aims. It allows machines to 


(1) Avoid the cost of “general purpose’ computation in favor of computations that do the 
minimal necessary amount of work, and 

(2) Where possible, have several machines operating in parallel on different aspects of some 
task that must be performed, thus avoiding the standard serial costs of computation. 


I found this chapter very worthwhile, both technically and as an overview of what must be done to 
achieve the levels that Belle has reached. 

The other new chapter is by David Wilkins and discusses his program that uses plans to guide 
the search. This is very interesting work which unfortunately cannot be judged at this time, since it 
was a doctoral dissertation subject which showed promise but has not been pushed further. 

Many excellent computer games have now been played. A good sample of these is in the new 
appendix and offers insights into what the standard type of program, searching much deeper than 
at the time the volume first came out, can now do and not do. All these games are annotated, 
some by more than one commentator. For some games, computer printout data is also provided. 
All this makes them very useful both to interested chess players and computer chess afficionados. 

This is an overview of what worthwhile there is to be found. However, it is interesting to 
examine this book from another perspective: that of the practitioner versus the theorist. From this 
point of view the theorist (meaning the academic) comes off very badly. 

Since people in glass houses shouldn’t throw stones, let me start by explaining that I 
participated in the original symposium, but had already committed my contribution to another 
volume before the idea for this book was broached. I was clearly among the theorists, though I 
had built one moderately good chess program (for its time). My paper was one that had a number 
of “promising” ideas, which were duly exposed, and the virtues of which were extolled. 
Unfortunately, the program that they were embodied in never played well, though I still cherish 
some of the ideas and believe they will eventually find their place in the scheme of things. 

This general tone, of the academic who must praise his research while either not having the 
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time or being otherwise unwilling to expose it to the cauldron of test by trial, permeates a number 
of papers in this volume. This is in contrast with the papers of Slate and Atkin, and Condon and 
Thompson, both the products of superachievers who consistently downplay their accomplish- 
ments. Computer chess is a hard-nosed arena in which to test your skill. Clever ideas are no good 
unless they can be brought into a well-rounded vehicle that will perform correctly a high 
percentage of the time. Cleverness fades into mediocrity without this. 

Somehow, I see something tragicomic in this. Is it necessary for an academic to overpresent his 
ideas in order to have people pay attention? Can the academic simply not afford the time to test 
his product, or is he afraid that such tests will show him to be a rather mediocre pretender to 
excellence? These appear to be important questions. 

Some highly regarded academics whom I know would say that the good work will survive and 
the chaff will be forgotten. That is certainly a humane and balanced notion. I yearn for the time, 
however, when it will be possible for an academic to portray the weaknesses of his own work as 
clearly as the strengths. 


Applied Statistics: A Handbook of Techniques. By Lother Sachs. Springer Series in Statistics. 
Translated by Zenon Reynarowych. Springer-Verlag, New York, 1982, xvi + 706 pp. 


GREG CONSTANTINE 
Department of Mathematics, Indiana University, Bloomington, IN 47405 


You may find this a bit unusual, but the book by L. Sachs contains no proofs; it describes, just 
as its title suggests, a handful of techniques. It provides us with recipes for performing statistical 
tests. Should such books be written? Are they of any use to anyone? I can’t answer either question 
with just yes or no. 

A book of recipes... That does not entail much intellectual challenge—it’s bound to be dull, 
and dull it is. But then one of the best meals I ever had was prepared by a friend of mine just the 
other day. Alas, no gourmet cook is my friend. Yet, a veritable gourmet piece was created (believe 
me, I can tell) by faithfully following a recipe from a cookbook, and on a first try. 

Should we really present mathematics in the form of recipes? It depends on the audience. 
Think of Pdélya’s Enumeration Theorem. It counts the number of patterns of a certain weight 
which arise on a set under the action of a group, such as the number of ways one can place 16 
checkers on an ordinary chessboard—two arrangements being the same if one can be obtained 
from the other by a rotation of the chessboard. The proof of this theorem entails a knowledge of 
group theory, generating functions, and a great deal of counting. It is enjoyable for a mathemati- 
cian to go over it in detail. As is well known, the theorem also has a great number of practical 
uses. For example, a chemist might on occasion want to use it. Should we then require him to 
spend lots of time beforehand in order to understand all the details of the proof? Instead, we 
could just present the theorem to him in the form of a clear recipe. Something (roughly) like this: 
Ingredients: Set S and group G. (1) Find the cycle structure of G as a permutation group on set S 
(then throw group in garbage can). (2) Obtain from (1) the polynomial P, (x,,...,x,,) called the 
cycle index. (3) Substitute the number ©, (w(r))' for x, in P;—and out pops the number of 
nonequivalent patterns under G using the weight function w (which is what the chemist wants). It 
works: don’t knock it till you try it. 

It is much faster for a chemist to learn how to implement a recipe (such as the one above) than 
to master the details of a proof. And statistics often lends itself to results that can efficiently be 
formulated as recipes. Without a doubt there is great need for the use of statistics in business, 
economics, engineering, medical and social sciences, and other fields. This is the audience to 
which Sachs’s book is addressed. These professionals are not statisticians; 1t seems appropriate 
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then to evaluate the book from their point of view. 

As such, it is a book in which the statements are clear, unambiguous, and often illustrated with 
examples. It describes statistical concepts with great care, emphasizing aspects which are often 
overlooked or misinterpreted by practitioners. And (although it invites to some extent comparison 
with a cookbook) it offers useful insights and a great body of statistical results that are of much 
help to many people. The chapters on experimental design, however, could be expanded 
substantially and presented in a more unified way using matrix notation. The book is excellent for 
the audience intended. It is, however, not a book for mathematicians or statisticians. The author 
himself points this out in the introduction. Let me put it this way: if the null hypothesis is that the 
book is good, the sample I got gives results that are not significant. 

I believe that books such as this one serve the useful purpose of making many statistical results 
accessible to researchers in other fields. It seems the answer to both questions that I initially raised 
is yes. 


Problems in Analysis. By Bernard Gelbaum. Problems Books in Mathematics. Series Editor P. R. 
Halmos. Springer-Verlag, New York, 1982. vii + 228 pp. 


HENRY HELSON 
Department of Mathematics, University of California, Berkeley, CA 94720 


This is the third in Springer’s series of Problem Books in Mathematics, edited by P. R. Halmos, 
whose own book of problems about Hilbert space perhaps started a new trend. What are books of 
problems for? Why do publishers print them? 

Books of problems are not alike. One Hundred Problems in Elementary Mathematics by H. 
Steinhaus (Pergamon Press, 1963) contains the wit and wisdom of a distinguished mathematical 
personality. The Olympiad collections from the Soviet Union have the specific purpose of 
identifying talent in high school students. These are two examples of problem books that are not 
closely related to the one under review. 

In the case of Halmos’ own book the answer to the questions above is not hard: the book is an 
authoritative account of a field of work that is currently expanding rapidly. It is useful to anyone 
interested in the field, from (almost) beginning graduate student to researcher. It is not only a list 
of problems; Halmos tells what the subject is about as he goes along, putting results in perspective 
and suggesting research directions. The problem format is a device that gives the author more 
choice as to the topics covered, and the way in which they are covered, than he would have under 
the constraints of an ordinary style of exposition. 

The book under review is not Springer’s first volume of problems in analysis. Its great 
predecessor is the Aufgaben und Lehrsatze aus der Analysis by Polya and Szeg6, 1925. That is a 
collection of problems and theorems, as the title says, and the book is still a marvelous way to 
study the subjects that it presents. The authors, in their preface, envisage its use as a guide for 
self-study. There is certainly some similarity between Aufgaben and the book by Halmos. 

The book by Polya and Szegé can be used in at least one very different way, however. Its 
subject matter begins with the elementary properties of complex numbers and infinite series, the 
stuff of which all mathematics was and is made. Many of the problems are excellent material for 
testing the ingenuity of advanced undergraduate and beginning graduate students, for informing 
ambitious but untried minds about themselves, for helping teachers decide who has a chance to be 
a mathematician. This function is particularly valuable nowadays when even mathematics majors 
at junior level may never have been asked to solve a problem that was not attached to a section in 
the text where the method of solution was expounded. Somewhere, some time, a serious student 
has to face the great frightening world where textbooks end. A good book of problems, like Polya 
and Szeg6, or Steinhaus or the Olympiad, can help get ready for that moment. 
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Gelbaum’s new book is different from any of these. It contains 518 problems, all stated very 
briefly, with the help of elaborate notation that is clearly explained at the beginning of each 
section. There is a glossary of symbols, and an Index/Glossary that contains clear, concise but 
accurate statements of a great number of important definitions and theorems. Each problem is 
solved, more or less completely; the solutions occupy most of the book. Many of the problems are 
hard (and not only the elaborate counterexamples we are asked to find!). Quite a few look like 
fun, but that does not seem to have been the criterion for inclusion. On the contrary, the book is 
decidedly serious. This is for students who want to learn analysis. 

Analysis here means what is taught in the usual graduate course on Real Variables. From that 
core the problems radiate into probability theory, ergodic theory, harmonic analysis and func- 
tional analysis. It is mostly analysis of the abstract kind; we find little about complex variables or 
differential equations. Banach spaces, measure theory, Lebesgue spaces (both abstract and on the 
line) are richly covered. There is no discourse, no advice, and surprisingly little reference from one 
problem to others. This is a problem book with a vengeance. 

The author does not tell how the book can be used, so the reviewer must venture an opinion. 
Teachers will find an inexhaustible supply of prelim problems, and problems to assign in the 
course on Real Variables. That course can undoubtedly be enriched by including problems outside 
the syllabus proper. Students will have a splendid way to prepare for those prelims, and if they 
want to they can learn a lot of mathematics at the same time. The Index /Glossary is a fine way to 
obtain information quickly. A lecturer who does not want to follow a text closely could very well 
use this book in place of a text, assigning problems from it but also referring to statements and 
proofs of theorems. All these suggested uses are closely connected to a course on analysis. For 
these purposes the book is beautifully written. It is careful, very condensed but clear, and contains 
a great deal of fundamental material. 

This is enough to ask of a book with this title, but it is not all we could hope for. Analysis has 
got very large, and yet today constitutes a smaller fraction of mathematics than it did a century 
ago. Many of our best mathematics students find analysis tedious, and who can blame them after 
the calculus courses we offer them? If they should have the sense to ask what analysis is, their 
instructors would be hard put to tell them. It is not that there is just one answer to such a 
question; but there should be at least one. In 1925 Polya and Szeg6 provided their answer, and we 
need a modern statement of comparable conviction. 

What is missing in Gelbaum’s book is any intimation of what he considers important or 
beautiful in analysis. We can see that measure theory occupies a great deal of space, but we are 
not helped to know why. There are no sequences of problems leading to an area of current 
research. Many problems are well-known theorems, of course, but they are not grouped into a 
theory. At the end, as at the beginning, we do not know what the author thinks analysis is. 

We welcome this admirable addition to the pedagogical literature, but fear that its use will be 
mainly in close connection with course work. Perhaps after all that is what the author intends. 


LETTERS TO THE EDITOR 


Material for this department should be prepared exactly the same way as submitted manuscripts (see the inside 
front cover) and sent to Professor P. R. Halmos, Department of Mathematics, University of Santa Clara, Santa 
Clara, CA 95053. 


Editor: 


In my note [1] I presented a greatly shortened and—I hope—clarified version of the standard 
theorem on convergence of Fourier series whose usual textbook presentation is modeled directly 
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on the work of Dirichlet. I want to point out that the proof in [1] is the series analogue of a 
corresponding argument of Ian Richards [2] for Fourier integrals. Anyone interested in basic 
Fourier analysis will enjoy reading Richards’ elegant little paper. 


References 


1. P. R. Chernoff, Pointwise convergence of Fourier series, this MONTHLY, 87 (1980) 399-400. 
2. I. Richards, On the Fourier inversion theorem for R', Proc. Amer. Math. Soc., 19 (1968) 145. 


Paul R. Chernoff 
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Editor: 


Minassian (this MONTHLY, August-September 1984, p. 456) asks why referees should know the 
identity of authors but not vice versa. He is far from the first to ask this question. Some years ago 
it was asked so insistently that the American Mathematical Society instituted a policy of “blind 
refereeing’. However, that policy has been abandoned, presumably because it was not found to 
help authors but did hinder referees. 

The disadvantages of blind refereeing are that, first, it is often difficult for authors to hide their 
identities (they may, for instance, refer to their own published work). Second, it is not easy to find 
appropriate referees who are willing to donate their time; consequently any information that will 
help the referee is useful. (For instance, one is less cautious when generally careful A says 
“obvious” or “trivial” than when careless B says the same thing.) Third, in spite of what 
Minassian seems to think, open refereeing can in fact work to the disadvantage of the established 
author at the prestigious address. We have seen referees’ reports to the effect that “this would be 
acceptable from a beginner but one expects better from so-and-so.” 

There are fairly obvious reasons why authors should not know the names of referees. 
Anonymity of referees avoids embarrassment; and few people are willing to act as referees if the 
authors are going to know who they are. Would Professor Minassian care to have to tell a 
personal friend, who did not show him a paper voluntarily, that he thinks it is a bad paper? 

We have, between us, handled (as editors) a few thousand manuscripts, and neither of us can 
remember a referee’s report that even hinted at rejection for nonmathematical reasons. Relatively 
few papers are rejected for actual errors; some are rejected for being badly written, or for having 
been anticipated, or for having been submitted to the wrong journal; most are rejected for not 
being sufficiently interesting. “Interesting” is a subjective term, and the only operational defini- 
tion is that a paper is interesting if a competent referee says it is interesting. This does not mean a 
referee cannot support a judgment that a manuscript is of “insufficient interest’, and referees 
ought to (and usually) do this. On the other hand, we believe the primary responsibility for 
demonstrating the value of a paper is the author’s. The introduction or first paragraph of a 
manuscript is the appropriate place to explain why someone else ought to care. 

Blind refereeing is almost meaningless if the editor knows who the author is. After all, the 
editor’s values are often reflected in the choice of referee or in the phrasing of the cover letter to 
the referee. This is acceptable behavior, for editors are chosen because they are thought to have a 
certain competence, and it would be stupid to forget one’s values at moments of decision. (Also 
one should not forget that editors are themselves authors!) The current refereeing system, being a 
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human endeavor, is not perfect. But we believe authors are fairly treated in the vast majority of 
cases. 

Authors who are afraid of prejudice can rebut the reasons for rejection, or they can ask for a 
second referee. They can ask to have their papers refereed with the author’s name omitted; most 
editors will honor such a request. Authors have been known to request not to have a particular 
referee; an editor would certainly take the hint. 

R. P. Boas 
Department of Mathematics 


Northwestern University 
Evanston, IL 60201 


J. J. Rotman 

Department of Mathematics 
University of Ilinois 
Urbana, IL 61801 


Editor: 


In the paper “Summing power series with polynomial coefficients” (this MONTHLY, 90 (1983) 
284—285), John Klippert proves the following interesting formula: 


f(x) = X aha(o)x*/( — x), 


where 


(o@) 


P 

f(x) = Li a(n)x", a(n) = Le nt, 
n=0 k=0 

and A is the well-known operator for finite differences. The formula is then illustrated by taking 

a(n) = n* + n + 1, which yields 


f(x) = (4 n 41x" =2x°A1 — x) + 2x/(1— x)? +1/(1 — x). 


n=0 


Klippert’s acceleration technique has the virtue of being accessible to a college sophomore, but it’s 
worth noting that the same result can be obtained more easily from the well-known Taylor-like 
formula: 


P 
a(n) = Y, [A‘a(0)/k!] nk, where n° = 1, nk#1 = n(n — 1)*. 
k=0 


In other words, {n*}, -y is a normal family of polynomials whose formal derivative is given by 
A. Hence, 


> y- [ Ata(0) /k1] nk x” 


n=0 k=0 


P 00 
» [ A‘a(0) /k!] x* y next k 
k=0 


n=0 


f(x) 
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and the formula is now proved. 
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© [A'a(0) et] xK(1/(1 — x)) 


An alternate simple proof may be given using the well-known Stirling numbers of the second 


kind. 


The method above allows direct solutions for some practical cases, such as for the example 


mentioned above: 


Y) (n? +n +1) x" 
n=0 


149. 


x [n(n —1) +2n+ 1] x” 
yx’ 


[o.@) [o.@) 
x* Von(n—1)x" 2 + 2x nx" t+ 
n=0 n=0 n=0 


2x*/(1— x) +2x/(1—x) +1/(1— x). 


Radu Nicolescu 
Computing Centre 
University of Bucharest 
125 Stefan Furtuna Street 
Bucharest, B 77116 
Romania 
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Is eager to explain 

Why mathematics is dreary 
Without category theory. 


Mark Kac 

Had lots 

Of facility 

With probability. 


If Max Zorn 

Had never been born 
We'd be in a dilemma: 
What to call his lemma? 


—R. P. Boas 
Department of Mathematics 
Northwestern University 
Evanston, IL 60201 
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RUBIK’S REVENGE: THE GROUP THEORETICAL SOLUTION 


MOGENS ESROM LARSEN 
Kobenhavuns Universitets Matematiske Institut, DK-2100 Kobenhavn a, Denmark 


Introduction. Recently Rubik and Ideal Toy have supplied us with an analog to Rubik’s Cube 
with four slices instead of three. It is sold under the name of Rubik’s Revenge; see Fig. 1. 

The advantage is that no solution is offered besides the toy. This is however a disadvantage too, 
because the “Revenge” is even more difficult to solve without mathematics than the Cube and 
might hence disappoint a few puzzlers. I shall return to these difficulties at the end. 

A general mathematician might think that this is just some more of the same. But such an 
abstract attitude provides no solution to the problem: fix this mess! And group theorists are used 
to consider groups individually; non-isomorphic groups are not much “alike’’. 


Fic. 1. Rubik’s Revenge. 


Terminology. In analogy with the Cube, see Singmaster [4], we label the six faces of the 
Revenge Front, Back, Up, Down, Right, Left, and we abbreviate these designations to the first 
letter; see Fig. 2. 


B 


D 
Fic. 2. Labels of faces. 


Mogens E. Larsen: I am 42 years old, lektor (latin for “associate professor”) at the department of mathematics of 
the University of Copenhagen, where I teach numerical analysis, applied mathematics, and occasionally complex 
variables, which last subject I studied at M.LT. in the year 1969-70. Fond of games and toys I founded the Go Club 
of Copenhagen in 1972, and I was among the first Rubik addicts, when professor Robert Fossum from Urbana left 
us with a cube in the tea room of the department in the summer of 1980. 
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face-turn 


FIG. 3 


slice-turn 


Fic. 4 


A move of one of the six faces is denoted with the capital letter beginning the name of that face 
followed by a +, — or 2. The “+” stands for turning the face 90° clockwise (seen from the 
outside), “—” for turning it 90° the other way and “2” for turning 180° (any way). Fig. 3 shows 
an example. This is the same as for the Cube. 

For the Revenge there is also the possibility of turning the slice next to the face. We shall 
denote the turn of an interior slice by the gothic capital letter of the adjacent face, together with a 
+, — or2 having the same meaning as above. Fig. 4 shows an example. 

An operation is denoted in the order in which the turns are supposed to be done. For example 

Frou 
means: “Turn the front face with the clock, then turn the left face against the clock and last turn 
the middle upper slice against the clock.” 

Finally we shall call the small cube-bricks of which the big cube is built cubinos. These are 
divided in three classes known respectively as corner-cubinos (8 of them), edge-cubinos (24 of 
them) and center-cubinos (24 of them) as shown on Fig. 5. 

The groups of possible operations are denoted by large capitals, while abstract groups are 
denoted by gothic capitals. 


The groups. The operations form a group in a natural way. The product of two operations is 
merely doing these two operations one after the other in the order of writing: 


(FY L-U-)(R*D?) = FLW RtD?. 
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This product is obviously associative. The identity is the operation E of doing “no turns”. 
We introduce the rules of reductions. (Symbol “F’” may be replaced by any of the other 12 
basic symbols.) 


FtF-=E 
F-Ft=E 
Ft Ft = F? 
F- F-= F? 
F*F* = E. 


For convenience we shall also write for the inverse 


(FYL-UW-) =UWtLtr. 


Ma a LL 
Sey 
sa 


oat 
Fic. 5. Cubinos. Fic. 6. Labels of cubino-faces. 


a La 
ff __/ 


X_N 


edge 


\_\ 
LA XN 


center 


Note that the inverse is obtained by putting the symbols in reverse order and changing + — to 
— +. This may easily be checked by reducing as in the following example. 


(FTL-U-)\UtLtF-)=E 
Let us denote the group of operations by &. This group is infinite and generated by the twelve 
elements of order 4: 
Ft Bt Ut Dt Rt Lt ot Bt Wer Ht Rt Qt 
Let us label the 96 cubino-faces with numbers from 1 to 96; see Fig. 6. (It does not matter how 
they are labeled.) 
Each operation corresponds to a permutation of these 96 figures, obtained by applying the 


operation to the labeled Revenge. Now, the product of two operations corresponds to the product 
of the corresponding permutations. So this correspondence is a homomorphism 


pp: G > Gog, 
where ©, is the symmetric group of permutations of n elements (see [1], p. 54; [3], p. 30). 


The problem is to get a hold of ». For example, given 7 € Go,, which represents a required 
pattern on the Revenge, find an operation X € © with 


= 9(X), 
means a construction of the required pattern from the solved state. Of course 7 must be in the 
range of y, else X does not exist. Let 


R = 9(G) C Sg. 
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Then R is the group of permutations obtainable by the possible operations. The structure of R can 
be investigated in several ways. 

One of the problems is to find the order of R (which is finite and less than 96! = 101°°). 

Another problem is to write down in a convenient way an operation from pm }(7) for any given 
a ER. 

Let « be the identity permutation which fixes everything, and 


E=p (ee) CG 
the kernel of gm. Then € is the group of operations leaving all cubino-faces unmoved. Thus we 
have a factor group 
R=6/E. 


It is not possible by any operation to turn any cubino-face around itself on the place where it 
is, because both types of operations, F" and %*, preserve orientation, that is, the corner nearest 
the center remains nearest the center; see Fig. 7. 


, 
f 


Fic. 7. Preservation of cubino-face orientation. 


But it is possible to permute the four equally coloured center-cubinos among themselves. This 
means that the order of R is bigger than the number of visible patterns. Therefore, let C, denote 
the subgroup of R permuting each of the six classes of four equal center-cubinos among 
themselves (i.e., C) consists of the invisible operations). This group C, is not normal in R, but 
nevertheless, the number of patterns of the Revenge is the number of cosets to C, in R. (This is to 
say, two patterns in R look the same, if one is obtained from the other by permutation of some 
equal center-cubinos of like color.) 


The structure of the group R. Each operation creates a permutation of the 56 cubinos and 
sometimes a turn of some of them too. But is it not possible to move any cubino to an arbitrary 
prescribed position. 

The 8 corner-cubinos are permuted among themselves and if possible turned individually. 

The 24 edge-cubinos are permuted among themselves and if possible flipped individually (we 
shall see that this flip is actually impossible). 

The 24 center-cubinos are permuted among themselves, but as explained above, they cannot be 
rotated individually. 

This means that each operation creates three simultaneous permutations of the three kinds 
mentioned above together with some turns and flippings. We shall now examine each of these 
kinds in more detail. 


The corner-cubinos 


First we consider the corners ignoring edges and centers. (We can imagine the Revenge with 
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edges and centers painted all black.) Then the most we can hope to do with the corner-cubinos is 
to obtain any permutation of them from ©, together with any 8-tuple of turns from 3, (the 
cyclic group of order 3). This means a total of 3°8! patterns. 

Group-theoretically we consider these patterns as forming a group of permutations, the 
so-called wreath product of 3; by Sg, written 


831 Gs, 
(see [1], p. 81; [4], p. 59). 
Intuitively the group 8; 1 GS, contains a subgroup leaving all corner-cubinos on their places, 
but turning them individually. This subgroup is normal and isomorphic to 33. The corresponding 


factorgroup consists of the permutations of the 8 corner-cubinos ignoring their possible turns, that 
1S 


(33 \ Gs)/83 = Gg. 


The possible operations create a subgroup § of 8, 1 Gg. This is the same as it is for the 
standard Cube, because the corner-cubinos are not affected by the slice-turns. From the analysis 
of the Cube (see [2], p. 50; [4], p. 17) we know that § has index 3 in 8, 1 Gg. Indeed, the sum of 
all turns of corner-cubinos must be divisible by 3. 

Next we consider the subgroup H of R consisting of those permutations obtainable by 
operations fixing edge-cubinos and center-cubinos. The group H is isomorphic to a subgroup of 
§. The group H is as above the same for the Revenge as for the Cube and hence of index 2 in §. 
It contains the subgroup of turns without permutations, T, 


T = 83, 


and the corresponding factorgroup is isomorphic to %,, the alternating group of even permuta- 
tions of 8 elements (see [1], p. 59; [3], p. 32), 


H/T = %,. 


Hence the order of H is 3’ - 4 - 8! 

The fact that H has index 2 in § means that we can only create an odd permutation of 
corner-cubinos at the expense of some odd permutation of either edge-cubinos or center-cubinos. 
We shall return to this problem in the following section. 


The center-cubinos 


First we consider the center-cubinos ignoring corners and edges. Then the most we can hope to 
do with the center-cubinos is to obtain any permutation of them from ©.,,, because they cannot 
be rotated, as already mentioned. 

The possible operations create a subgroup © of ©.,. We shall compute © below. 

Next we consider the subgroup C of R consisting of those permutations obtainable by 
operations fixing edge-cubinos and corner-cubinos. The group C is isomorphic to a subgroup of 
©, let us write 


Cc€cG,,. 
It turns out that C is isomorphic to the alternating group of even permutations of 24 elements, 
and that © is isomorphic to the symmetric group itself. 


Before computing these groups we shall recall a couple of notations from group theory. 
The commutator of two group elements X and Y is defined as 


( commutator ) [X,Y] = XYX"Y~ 
(see [1], p. 138; [2], p. 33; [3], p. 48; [4], p. 17). 


REMARK 1. Two group elements X and Y commute exactly when the commutator of them is 
the neutral element, [X,Y] = E. 
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REMARK 2. The inverse of a commutator is the “reverse” 
[X,Y] =[Y, x]. 

The conjugate of a group element X under the group element Y is 
( conjugate ) YXY-, 

(see [1], p. 13; [2], p. 31; [4], p. 13). 

REMARK 3. On the Cube and the Revenge we consider the conjugate of a “nice” operation X 
as follows. The operation X is for example a cycle of 3 cubinos. The operation Y moves 3 cubinos 
which we want to cycle to the locations where X operate. The irrelevant mess Y might create is 
cleared up by Y at the same time as the 3 wanted cubinos are returned to the original 3 places, 


but cycled. (This fact is reckoned in mathematics as the theorem that conjugate permutations have 
the same structure of cycles; see [1], p. 54.) 


REMARK 4. It is fairly easy to see that there is a possible permutation in R which moves 3 
distinct center-cubinos to any stipulated locations (no assumption is made about other cubinos). 
The same is independently true about edge-cubinos. 

We shall now return to the computation of the center-groups. 


THEOREM 1. C = Y,, and © = © 4,4. 


Proof. The group C contains the 3-cycle (1), see Fig. 8. 
(1) [sO] U]ec 


Fic. 8. A 3-cycle of center-cubinos. 


It is obvious from Remarks 3 and 4 that we can obtain any 3-cycle of center-cubinos by 
conjugation of this one. Hence C contains all the 3-cycles of center-cubinos. 

Any alternating group is generated by the set of 3-cycles (see [1], p. 61; [3], p. 33). Hence we 
must have the inclusion 


%,,cCCCCCG,,. 


Now, Ft e€ © is a cycle of length 4 and hence an odd permutation. So © is greater than U4, 
and we must have 


C=S,,. 


On the other hand, F*€R consists of 4 cycles of length 4 and hence is even, see Fig. 9. 
Indeed, it contains one 4-cycle of corner-cubinos, one 4-cycle of center-cubinos and two 4-cycles 
of edge-cubinos. Hence it can only give an odd permutation of center-cubinos simultaneously with 
an odd permutation of corner-cubinos. So, when corner-cubinos are kept fixed (the even 
permutation of nothing), then the center-cubinos must be permuted by an even permutation. 
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Fic. 9. The 4 4-cycles of F*. Fic. 10. Preservation of orientation. 


And %* € R consists of 3 cycles of length 4, so this is odd. But one of them is a 4-cycle of 
edge-cubinos and the remaining two are 4-cycles of center-cubinos. So, considered as a permuta- 
tion of center-cubinos %* is even. 

Conclusion. No operation in C can be an odd permutation. Hence 


C=%,. oO 


The order of € then is 24!, and for C it is 4 - 24!. 

The fact that C has index 2 in © means that we are only able to create an odd permutation of 
center-cubinos at the expense of some other odd permutation. From the proof it follows that the 
other odd permutation must be a permutation of the corner-cubinos. 

The last fact may be explained further. Consider the group $8 of permutations of corner-cubinos 
and center-cubinos while ignoring the edge-cubinos. The group ‘8 must be isomorphic to a 
subgroup of § x ©. The meaning of the restriction above is then that a pair of permutations 
(X,Y) © § X © belongs to ¥ if and only if the permutations X € § and Y € © have the same 
parity. This means that 8 is the subgroup of § x © of index 2 consisting of all operations that 
are even as permutations of these 32 cubinos. 

Still unsettled is the question of the group P consisting of those operations from ‘8 which fix 
all edge-cubinos. Obviously they include all pairs from H < C. We can thus say 


HxCcPcS, 


and as H X C has index 2 in §8, we know now that P is one or the other. 
When we have computed the edge-groups, we shall be able to show that 


P=. 
The edge-cubinos 


First we consider the edge-cubinos ignoring corners and centers. Then the most we can hope to 
do with the edge-cubinos is to obtain any permutation of them from ©,, together with any 
24-tuple of flips from 8,. It turns out that edge-cubinos cannot be flipped, so it is enough to 
consider the subgroups of G,, and we can avoid the wreath product. 


THEOREM 2. No operation flips any edge-cubino. 


Proof. If an edge-cubino is flipped, then the orientation from Fig. 7 must look like Fig. 10. But 
this is not obtainable, because the cubino-faces then do not preserve orientation. O 


REMARK. It is also true that if an edge-cubino is physically removed, the hidden “foot” will be 
revealed to be asymmetric. 
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This means that for any given location and any given edge-cubino, it can only be oriented in 
one way. Hence we can consider the group of operations on the edges as a permutation group & 
of the 24 edge-cubinos, 1e., 8 C G54. 

Next we consider the subgroup K of R of operations on the edge-cubinos fixing the center- 
cubinos and the corner-cubinos. 

We have from Theorem 2, that 


KCRCG,,. 
They all turn out to be equal. 
THEOREM 3. 
K=&8 =6,,. 
Proof. At first we prove that 
W,, CK. 


The group K contains the 3-cycle (2); see Fig. 11. 
(2) [2-[Lt,u-]] ek. 


[2° ,[L*,U-]] 
Fic. 11. A 3-cycle of edge-cubinos. 


It is obvious from Remarks 3 and 4 that we can obtain any 3-cycle of edge-cubinos by 
conjugation of this one. Hence K contains all 3-cycles of edge-cubinos and thereby %.,,. 

It is obvious that ® contains an odd permutation, 3 * € @ is a 4-cycle of edge-cubinos and 
odd. 

But %* indicates the existence of a 4-cycle in K too. %}* consists of a 4-cycle of edge-cubinos 
together with two 4-cycles of center-cubinos. A permutation of center-cubinos made of two 
4-cycles is even and hence belongs to C according to Theorem 1. This means that we can 
rearrange the center-cubinos with the help of a permutation from C, that is a permutation fixing 
all other cubinos. Doing this we are left with exactly one 4-cycle of edge-cubinos, which is the 
wanted odd permutation in K. So K is bigger than %,, and hence equal to 6,,. O 


The orders of K and & are both 24!. 

The fact that K=& (= ©,,) means that any permutation of edge-cubinos can be done 
without disturbing the others. This also means that any permutation in ‘8 can be done without 
changing the edge-cubinos. So from this follows, that 


P=. 
The structure of R 


The result in Theorem 2 says about R, that 
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R=K xP. 
Any possible permutation can be executed independently on the edge-cubinos and on the other 
cubinos. And from Theorem 1 we have 
HxCcPc§xG, 


where P consists of the even permutations. This is to say, that of the 3 subgroups containing 
H x C and of index 2 in § X ©, the group P is the one different from H x € and § x C. See 
Fig, 12. 


E 


FIG. 12. Diagram of center- and corner-groups. 
The order of R then is 
order(R) = order(K) - order(P), 
and 
order(P) = 2 - order(H) - order(C); 
all together 
order(R) = 24! - 2 - 3’ 3 - 8! 3 - 24!. 


To compute the number of patterns recall the group C, of invisible permutations. As Cp is a 
subgroup of C, it must consist of the even permutations of each of the six 4-tuples by itself. There 
are 4!° permutations of 6 sets of 4 elements each, and of these, half are even. The order of C, 
hence is 


. 6 
5 24°. 
The number of patterns of visible difference hence is 
37. gt. 24! 
546 = 177 62872 41975 57644 87697 82553 87965 78406 40000 00000. 


(This number includes a factor 24 from the group of physical turns of the cube (the hexahedron 
group; see [3], p. 37).) 
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How to solve a mess? The principle of solution may be based on a descending series of 
subgroups, each normal in the preceding one with a nice factor group. The use of the series shall 
be explained below. Let us define K’ = %.,, as the subgroup of K of even permutations, and E as 
the group of one element, the unit. We shall then give an example of such a series (with factor 
groups below): 


RDKxKHXCDKXTXC D KXEXCODOK XxEXCDEXEXCDEXEXE 
3B W T = 33 3> Wo, Wo, 


We consider the pattern (the mess) of the Revenge as a permutation obtained from the nice 
start position with one-coloured faces by some operation. This permutation belongs to R, and we 
want to find a way to write it as a series of the twelve generators of R, F*, 3", etc. 

If the permutation of corner-cubinos is odd, then we start with any face-turn, for example F*. 
This is to be considered as a representation of the non-unit of the first factor group 3,. Thus we 
reduce the problem to a permutation in K x H x C. 

Then we use the generators of H/T = %,, the 3-cycles of corners known from the Cube; see 
[2], p. 32; [4], p. 44, e.g., 

[F-,U*B*U ], 
to arrange the corner-cubinos on right places relative to each other. This means we have reduced 
the problem to a permutation in the group K < T x C. 


Now the generators of T = 33 turn the corners in the right directions. They are also obtainable 
from the Cube; see [2], p. 38; [4], p. 44, e-.g., 


[[Ft,D*P,u*]. 


This reduces the problem to a permutation in the group K X E xX C, and having arranged the 
corners we rejoice that permutations from this group keep corners fixed. 

If the necessary permutation of edge-cubinos is odd, then we need a generator of K/K’ = 3; 
1e., any odd permutation, e.g., 

o. 

After this reduction to the group K’ x E x C, we can replace the edge-cubinos completely by 
help of formula (2) and its conjugates. This replacement reduces our permutation to the group 
EXxXExC. 

This group is generated by the formula (1) and its conjugates. Using this we don’t need to 
reach E; it is enough to reduce to any permutation from C, (the difference is invisible). 

The descending series used here is vivid graphically, because more and more cubinos fall into 
place. The lower subgroups tend to have more complicated operations. A series can be constructed 
on the opposite principle, where the last stages use simpler operations. Watching the result is like 
magic. Nothing seems to be happening until the last few moments, when PRESTO! everything 
falls into place. 

The problems in the case of the Revenge are due to the two factor groups 3,. They are so easy 
to overlook, but to postpone these two “corrections” might double the work. Besides, they are 
hard to invent by oneself —a mathematical benefit from having heard about odd and even! 
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|arg f(z)| < as and f(0)=1, witha >1.Ineach > the set of support points is the same as 
EHF [5], [13]. 
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LINEAR METHODS IN GEOMETRIC FUNCTION THEORY 


T. H. MACGREGOR 
Department of Mathematics and Statistics, SUNY at Albany, Albany, NY 12222 


1. Introduction. This paper describes results in classical complex analysis in which convexity is 
the main theme. More generally, linear and topological ideas are central to the presentation. The 
emphasis is on finding the extreme points of families of analytic functions arising in geometric 
function theory and applying this information to the solution of extremal problems. In particular, 
such questions are discussed in the context of starlike and convex mappings of the open unit disk 
and other families defined geometrically. 

Some results mentioned are well known and others were obtained recently, a few only in the 
last couple of years. The approach represented by this development provides a way of deriving 
several established results from a coherent point of view based on a few general abstract 
principles, the most critical of which is the Krein-Milman theorem. This approach also leads to 
the solution of more difficult problems. The presentation here focuses on certain families and 
problems where there are complete and striking results. 

Several ideas used in this paper are geometric. Some only involve convexity facts about the set 
of complex numbers. A useful reference for such ideas in R”, Euclidean n-space, is [10]. Usually 
the objects to be studied here are sets of analytic functions, and thus certain concepts require a 
more abstract setting. A suitable one is that of a locally convex linear topological space. Chapter 5 
of the book [7] contains the facts needed about such spaces. Two additional useful references 
about convexity are [3] and [33], which are at a more elementary level. 

Linear methods are well established in areas of complex analysis, such as in the theory of H? 
spaces [8]. It was much more recently that such techniques had an influence on research in the 
theory of univalent functions. A number of results about linear problems and univalent functions 
are contained in [28], and a forthcoming monograph [14] has a similar orientation. This article 
represents a survey of these developments. 

The author began writing this paper in preparation for an invited address given at the 1983 
summer meeting of the Mathematical Association of America held in Albany, New York, on 
August 7-11. The title of the talk was “Convexity ideas in geometric function theory.” The author 
dedicates this paper to Vincent Cowling, who encouraged the author to give that talk and has been 
supportive of the author’s professional work for several years. 


2. Linear, topological structure of the space of analytic functions. Let ./ denote the set of 
functions that are analytic in the open unit disk A = {z: |z| < 1}. The set © is a vector space 
with respect to the usual addition of functions and multiplication of functions by complex 
numbers. A topology for # is described below; it corresponds to convergence which is uniform 
on compact subsets of A. This topology is chosen because it so intimately relates to classical 
extremal problems for .~. An important theorem of Weierstrass asserts that if { f, } 1s a sequence 
of functions analytic on a region D that converges uniformly on each compact subset of D, then 
f=limf, is also analytic in D [2, p. 176]. Moreover, { f/} converges uniformly to f’ on each 
compact subset of D. 

The topology for # is defined as follows. Let M(r, f) = max {|f(z)|:|zZ| <r} when fev 
and 0<r<1. Let {7,} be an increasing sequence of real numbers such that 0< 7, <1 
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included classes given by B. Epstein, E. Grosswald, H. Rademacher, and I. J. Schoenberg. MacGregor’s research is 
in complex analysis in the field of geometric function theory. He was a faculty member at Lafayette College from 
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(n = 1,2,...) and r, > 1. If f,g © PX and d,(f,g) = M(r,,f— 2g), then d is defined by 


(1) d(f,8) = Y Tea 


It is not difficult to show that d is a metric on W and the topology given by d is equivalent to 
convergence that is uniform on each compact subset of A [2, p. 221]. As usual for a metric space, 
f, 2 f means d(f,,f)— 0. More generally the vector space (D) of functions analytic on a 
region D can be described the same way. In particular, the metric for &(D) is obtained by 
replacing the sets {z:|z| < 7,} by an increasing sequence of compact sets whose union is D. 

A fundamental theorem of P. Montel is especially useful in determining topological properties 
of subsets of (D). If AC M(D), then F is said to be locally uniformly bounded provided that 
the following condition holds: if C is a compact subset of D, then there is a positive real number 
M such that |f(z)|< M when fe€F and z € C. Montel’s theorem states that each sequence 
{ f,} of functions belonging to a locally uniformly bounded family has a convergent subsequence 
[2, p. 224]. Montel’s theorem is a generalization of the Bolzano-Weierstrass theorem, which asserts 
that every bounded sequence of complex numbers has a convergent subsequence. In fact, the 
proof of Montel’s theorem involves a repeated application of the Bolzano-Weierstrass theorem. 
Since &@(D) is a metric space, Montel’s theorem leads to the following characterization of 
compact sets : a subset ¥ of (D) is compact if and only if F is locally uniformly bounded and 
whenever f, © A and f, > f then fe F. 

The space ./(D) also is locally convex. In particular for #, { f:|f(z)| < M when |z| < r}isa 
convex neighborhood of 0 for each M and r such that M > 0 and 0 < r < 1. Insummary, ~(D) 
has the properties of a locally convex linear topological space. Thus, the ideas and results from 
that general theory are applicable, and they shall be presented here generally in a form that 
applies to x. 


DEFINITION. If FC &, then f is called an extreme point of ¥ provided that fe F and the 
relations f=tg+(1—t)h,gE€F,heEF, and 0 <t¢ < 1, imply that g =h. 


In other words, f is not a proper convex combination of two distinct elements of F. Let EF 
denote the set of extreme points of #. A general problem discussed here is finding EF for 
certain subsets of 7. 


3. Functions having a positive real part. 


DEFINITION. Let # denote the subset of ¥ consisting of functions p such that Re p(z) > 0 
({z| < 1) and p(O) = 1. 


An example of a member of # is given by P(z) = (1 + z)/(1 — z), which maps A one-to-one 
onto the half plane {w: Rew > 0}. Clearly, F is a convex subset of .. Schwarz’s lemma [2, p. 
135] is used here and later on. It asserts that if $ is analytic in A, |$(z)| < 1 for |z| < 1 and 
(0) = 0, then |$(z)| < |z|. If p € 9, then = P~!¢p satisfies Schwarz’s lemma. This implies 
that if p € F, then | p(z)| < A + |z)/Q — |z)). Therefore F is locally uniformly bounded. Also, 
if p, € A(n = 1,2,...) and p, > p, then p € M and Re p(z) > 0 (|2z| < 1). Since p(0) = p, (0) 
= 1, the minimum principle for harmonic functions [29, p. 271] implies that we cannot have 
Re p(z) = 0, and hence p € #. Therefore Y is a compact subset of WH. 

The normalization p(0) = 1 is a convenient way of forming a compact set. Other functions in 
& having the property Re p(z) > 0 when |z| <1 are easily related to M through the map 
p > (p — iIm p(0))/Re p(O). The study of “positive harmonic functions” is essentially the same 
as the study of # because each such function wu is associated with an analytic function p by 
Rep = u. 

The family # is characterized as the set of functions p represented by 

T —it 
(2) p(z) =f" = ay (2) 


0 l-e'z 
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for |z| < 1, where p is a real-valued nondecreasing function on [0,27] and p(27) — (0) = 1. 
Equation (2) is usually called the Herglotz formula and is due to G. Herglotz [16] and F. Riesz 
[25]. An easy consequence of the fact that Re P(z) > 0 when |z| < 1 is that each function defined 
by (2) belongs to #, given the conditions on p. The converse is more difficult to prove. An 
argument depending on the Helly selection theorem is given in [9, p. 22] and [24, p. 30]. 

Equation (2) can be rewritten in terms of probability measures on 0A, the boundary of A. By a 
standard argument from measure theory, each real-valued nondecreasing function p on [0,27] 
defines a nonnegative measure p* on [0,27]. First of all, w* 1s defined on closed intervals by 
u*([a, b]) = w(b) — p(a) when 0 < a <b <2. Then p* is extended to the Borel subsets of 
[0,27] through the way such sets are related to closed intervals. If sf is normalized by 
u(t) = 5[e(t — ) + p(t + )], then the correspondence between nondecreasing functions and 
nonnegative measures can be shown to be one-to-one. The condition (27) — p(0) = 1 is the 
same as p*((0,27]) = 1; hence p* is a probability measure on [0,277]. It is convenient to regard 
u* as being defined on (suitable) subsets of dA and to write p instead of p*. In this way, the 
Herglotz formula can be stated as follows. 


THEOREM. p © # if and only if 


(3) p(zy=f = 


ixj=1 L — xz 


du(x), 


where p is a probability measure on OA. 


It is known that the correspondence between # and nondecreasing normalized functions is 
one-to-one, and consequently (3) gives a one-to-one map between # and the set of probability 
measures on JA. Let A denote the set of such measures. 

Suppose that p, qg, and r belong to Y and are associated with the probability measures p, v 
and A, respectively. If p = tg+(1—1t)r (<t< 1), then (3) implies that 


f 1+ xz , an(x) = f Lt XZ (x), 


jail — xz -1, 1 — xz 


where » = ty + (1 —1)A. Since the map #- A is one-to-one, w = ty + (1 — t)A. In other 
words, a convex decomposition of a function in # corresponds to the same decomposition of the 
associated measure. An exercise in measure theory shows that the extreme measures of A consist 
of the point masses, that is, the measures for which p({x}) = 1 for some x on 0A. This proves 
the following result. 


THEOREM. EY consists of the functions p such that p(z) = (1 + xz)/(1 — xz), where |x| = 1. 


The next thing to show is that the one-to-one correspondence between # and A may be 
regarded as a statement about the moments of probability measures. Suppose that p and q 
belong to # and are associated with the probability measures p and v, respectively, 


p(z)=1+ > p,z",and q(z)=1+ )) 4,2” ({z|< 1). 
n=] n=] 
Since 


1+ 2 oC 
P(z)=qeea14+ D2" (<0), 
n=] 


the equation (3) implies that 
Py, = 2f x" du( x) and q, = 2f x"dv(x) forn=1,2,.... 
|x|=1 |x|=1 


The functions p and gq are equal if and only if p, = q, (n =1,2,...). This shows that two 
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probability measures p and v are equal if and only if 
(4) [ x" du(x) = [ x" dv(x) (n =1,2,...). 
|x|=1 |x|=1 


The facts about # just described, including the determination of E#, are a foundation for 
obtaining a number of similar results for subsets of & in geometric function theory. 


4. Families of univalent functions. The purpose of this section is to consider certain subsets of 
& related to conformal mapping. A function f is called univalent in D if it is one-to-one in D; 
that is, if f(z,)=f(z,) and z,,z,€D imply that z, =z,. A detailed treatment of the 
information presented in this section and other facts about univalent functions are in the books 
[9], [23]. The book [29] is at a more introductory level and contains several results about univalent 
functions, especially in Chapter 12. The classic “problem-solution” volumes [22] by G. Pélya and 
G. Szeg6 also contain information about univalent functions (see part IV). Both [22] and [29] are 
useful general references for other aspects of complex analysis, including some of the facts 
mentioned earlier. 


DEFINITION. Let S denote the subset of # consisting of univalent functions f which satisfy 
f(0) = 0 and f’(0) = 1. 


The so-called Koebe function k(z) = z/(1 — z)* is a member of S. It maps A onto the 
complement of the closed ray {w: w is real and w < — 4}. This follows from the mapping 
properties of P(z) = (1 + z)/(1 — z) and the relation k = 4(P* — 1). If fe S and 0 < |x| < 1, 


then x f(xz) also is a member of S. In particular, z/(1 — xz)* belongs to S when |x| = 1 and 


maps onto the complement of a ray. These so-called Koebe functions are extremal for a number 
of problems over S. 

S is not convex, since a proper convex combination of two distinct Koebe functions is not in 
S. This follows simply from the fact that such a function has two poles of order two on OA. If f 
has a pole at a(|a| = 1) of order two, then f(z) = g(z)/(z — a)’, where g is analytic at a and 
g(a) # 0. From this relation, it can be shown that if N is a neighborhood of a, then f takes on all 
sufficiently large complex numbers except, perhaps, those in an arbitrarily small angular region 
{w: largw — al < e}, for z Ee NO A. It follows that a function with two poles of order two on 
dA takes on (most) large complex numbers at least twice, as z varies in A. 

Univalence is readily lost through convex combinations of functions. One way to construct 
suitable examples is to rely on the result that a univalent function has a non-vanishing derivative 
[29, p. 299]. (More generally, non-constant analytic functions have the property that they are n to 
1 locally, if the derivative has a zero of order n — 1 at a point [2, p. 133].) For example, without 
the normalization f’(0) = 1, the average of two univalent functions may be constant, as 42 
+ 4(—z) = 0 illustrates. Even with the normalizations of S, this approach provides suitable 
examples. For example, let f be a member of S with Re /f’(z,.) = 0 and |z)| < 1. Define g by 
g(z) = xf( xz), where x = exp (2iarg z,)). Then g © S,g'(z,) = f(z) and f’(z)) + g’(Z) = 0. 
Thus, } = 4f+ 4g has a vanishing derivative at z,, has the normalizations of S but is not 
univalent. (It is not difficult to construct functions in S such that Re f’(z,.) = 0 for some z,.) An 
even more striking example, due to A. W. Goodman [11], shows that there are two functions f 
and g in S such that $f + 4g takes on some value an infinite number of times in A. 

It is well known that S is locally uniformly bounded and, in fact, | f(z)| < |z|/(. — |z))? when 
feS and |z| < 1 [29, p. 326]. A theorem of A. Hurwitz asserts that if {f,} is a sequence of 
functions which are analytic and univalent in a region D andif f, > f, then either f is univalent 
in D or f is constant [29, p. 311]. If f, Ee S and f, — f, then f’(0) = 1, and thus f must be 
univalent in A. This verifies that S is compact. 

Consider next subsets of S$ consisting of starlike and convex mappings. A set D of complex 
numbers is called starlike with respect to w, provided that tw + (1 — t)w © D when w € D and 
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0 < ¢ < 1. In other words, if a point is in D, then so is the line segment connecting that point to 
Wo- 


DEFINITION. Let S* denote the subset of S consisting of functions f for which f(A) is starlike 
with respect to 0. 


The Koebe functions belong to S* since they map A onto the complement of a ray. Because 
S* cS, the set S* is locally uniformly bounded. Also, if f, € S* and f, > f, then f © S*. One 
way to show this is to appeal to the geometric fact that the limit of the starlike sets f,(A) is a 
starlike set. An alternative argument is based on the condition (5) given below. Therefore S* is 
compact. The example of a proper convex combination of distinct Koebe functions also serves to 
show that §$* is not convex. The property of belonging to S* is easily lost through convex 
combinations; thus it is somewhat unexpected that the techniques involving convexity considera- 
tions and described later are so effective with S$*. 

The family S* is analytically characterized by the following conditions: fe #, f(0) = 0, 
f’(0) = 1 and 


(5) Re{ Z} > 0 (|z| < 1) 

f(z) 
[29, p. 332-333]. The inequality (5) expresses the local fact that the angle of the vector f(re’’) 
increases with @. This follows from the relations 


) ) 
36 arg f(re'®) = 26 Im log f(re’”) 


~tm/ 2 )\ =m) LCC) i010) 2 Re | eee") 
= Im 39 log f(re")| = Im ire) ret = Re| (re) } 


The argument for the characterization uses the fact that if fe S* and 0<r7r <1, then /[{z: 
|z| < r}] is starlike with respect to 0. The last relation is a consequence of an appropriate 
application of Schwarz’s lemma. The argument also depends on showing that (5) together with 
f(0) = 0 and f’(0)  O implies that the curve w = f(re’’), 0 < 6 < 27, winds around the origin 
exactly once with an increasing angle. This property of the curve implies that f is univalent in { z: 
|z| < r} for each r withO <r< 1. 

Condition (5) implies a close relation between S* and F. If 

#2) (yey 

(6) p(z) 42) (\z| < 1), 
it follows that p € # if and only if f © S*. Moreover, (6) defines a one-to-one correspondence 
between # and S*. The argument for the latter fact requires verifying that (6) may be uniquely 
solved for f. To do this, first note that if f € S*, then f(z)/z is analytic and non-vanishing in A. 


Thus log f(z) 


is analytic in A, and 


4 jog M2) _ Pe) 1 


az Zz Z 


Z 


This implies that 


(7) f(z) = rexp{ [PO aw}, 


A use of (3) in (7), followed by an interchange of the order of integrations, yields the 
representation 


(8) f(z) = zexp| -2ftog(1 ~ x2) an(x)| (\21<1) 
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for function in $*, where p € A. 


DEFINITION. Let K denote the subset of S consisting of the functions f for which f(A) is 
convex. 


The function f(z) = z/(1 — z) belongs to K, and it maps A one-to-one onto the half plane 
{w: Rew > ~— 4}. Since K C S, it follows that K is locally uniformly bounded, and a geometric 
argument can be given to verify that K is closed. Therefore K is compact. Also, K is not convex. 
To see this, note that z/(1 — xz) belongs to K whenever |x| < 1, and X3_,t,z/(1 — x,z) is not 
univalent in A when x, are distinct, |x,| = 1, t, > 0 and X}_,¢, = 1. The non-univalence follows 
from the local behavior of a function at a simple pole. The function described by the sum takes on 
certain large values at least twice in A. 

The family K also has an analytic characterization. Namely, f € K if and only if fe”, 
f(0) = 0, f"(0) = 1 and 


2f"(z) 
(9) Re| ; +1} >0 (\z| < 1) 
f(z) 
[29, p. 336]. As a local condition, (9) is equivalent to the assertion that the tangent to the curve 


w=f (re’’), 0 < @ < 27, turns counterclockwise as 8 increases. This follows because the angle of 
the tangent to this curve is arg {izf’(z)} and 


10 eu 10 
re'’ fre”) n i). 
f'(re") 
. . _ 7 f(w) . 
A comparison of (5) and (9) shows that the map f— g defined by g(z) = i a dw is a 
0 


0 16 £7 H d SaeolG gr H 
56 arg { ire’ f’(re'®)\ = 3g Imlog | ire'*f’( re?) = Re| 


one-to-one correspondence from $* onto K. Note that the normalizations for S* and K imply 
the equivalent relation f(z) = zg’(z). 


5. The Krein-Milman theorem. The convex hull of a set F is defined as the intersection of all 
convex sets containing #. Likewise, the closed convex hull of #4, which is denoted by HF, is 
defined as the intersection of all closed convex sets containing ¥%. If ACH, then fe HF 
provided that f is the limit (in the topology of .) of functions having the form 


(10) do thes 
k=1 


where f, © F, t, > 0 and Lf_,t, = 1. This is a consequence of the fact that the functions 
defined by (10), with varying t,, f, and n, form the smallest convex set containing #. The set 
H¥ is compact when F¥ is compact. To prove this, it is only necessary to show that HF is 
locally uniformly bounded. This follows from the same property of #; that is, if |f(z)|< M 
when |z| < r and f & F, the same inequality holds when f € HF by first applying the triangle 
inequality to (10). 

The Krein-Milman theorem is of fundamental importance in questions concerning extreme 
points and linear extremal problems. It holds in any locally convex linear topological space X and 
consists of the following assertions [7, p. 439-446]. 


THEOREM. Suppose that Y is a compact subset of X. 


(11.1) If Y is non-empty, then EY is non-empty. 
(11.2) HEY = HY. 
(11.3) Jf HY is compact, then EHY C Y. 


As remarked earlier, the compactness of a subset ¥ of & implies the compactness of HF. 
This fact and (11.3) imply that EHAC F when F is a compact subset of #. 
In the following way, the Herglotz formula yields (11.3) in the case of the family #. If p(z) 
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; " Ll+x 
_ f + xz u(x) and » © A, then p is a limit of functions of the form }, eat 
— XK 


xj=1 lL — xz k=1 


n 


where |x,;=1, t, >0 and )) t, =1. This follows from the approximation obtained by 
k=1 
replacing pw by a convex average of point masses at x,. This is the same as the assertion that the 


set of all such finite sums is a dense subset of #. (One proof is based on the weak* compactness 
of A.) 

Suppose that X and Y are two linear topological spaces. A mapping #: X — Y is called a 
linear homeomorphism if it is a homeomorphism of X onto Y and satisfies Y(af + Bg) = 
aL (f) + BL(g) for all scalars a and f and all vectors f and g. If .% denotes the subspace of 


& consisting of functions f such that f(0)=0, then g(z) = we fw) dw defines a linear 
homeomorphism of 2% onto %. Let # denote this mapping. To show the continuity of %, 
verify first that if f, > f and g,(z)= [. fw) dw, then g, > g. Also, f(z) = zg’(z), and 


Weierstrass’ result implies that if g, > g, “then f, > f. Earlier, it was shown that #(S*) = 
and the general properties of linear homeomorphisms yield #(HS*) = HK and “(EHS y= = 
EHK. 


6. Extreme points of HS* and HK. In this section, HS*, HK and their set of extreme points 
are determined. First a family closely related to $* is studied. 

Let # denote the subset of © consisting of functions f for which f(0) = 1 and f(A) Cc F(A), 
where F(z) = 1/(1 — z)*. (We say that f is subordinate to F.) The set # is compact. To see 
that # is closed, note that if f, € 9 and f, > f, then f(0) = 1 and f(A) C F(A). The last two 
relations and the fact that non-constant analytic functions are open mappings [2, p. 132] show 
that f(A) c F(A). Also, F¥ is locally uniformly bounded because of Schwarz’s lemma applied to 
the analytic function F~! ¢ f. This yields | f(z)| < 1/(1 — |z|)? when |z| < 1 and fe #. 

If Y denotes the subset of described by the conditions g(0) = 1 and Re g(z) > 4(|z| < 1), 
then fe # if and only if f = g* where g € Y. Also, g € Y if and only if 2g -1 © F. The 
result about E# implies that FY consists of the functions g(z) = 1/(1 — xz) where |x| = 1. 

Suppose that f € EH#. Then, by (11.3), fe # and hence f = g*, where g € Y. To show 
that g € EY, assume that g € EY. Then 


g=tg,+(1-t)g, where 0<¢< 1, 2,2,€ Gand g, #g>. 


This implies that f = tf, + (1 — t)f,, where f, = gg, and f, = gg,. This contradicts fe FH# 
since gh € # whenever g,h © G. This latter fact may be regarded as a statement about complex 
numbers; namely, if Rew, > $ and Rew, > 4, then Re/w,w, > 4 (this follows by expressing w, 
and w, in polar form and later using the concavity of u(@) = logcos #). Therefore, EH#¥ consists 
only of functions having the form f(z) = 1/(1 — xz)*, where |x| = 1. It follows from (11.2) that 
each function f in H# is represented by 


1 


(12) A2)= faye te) (I<), 


where p € A. 
Suppose that f © S*. According to (8), f(z) = zexp {2q(z)}, where 


(13) a(z)= [lor z— awl) 


and p © A. The function Q(z) = log i LE ; belongs to K, as can be shown by a computation 
which verifies (9). Since (13) expresses the complex number q(z) as a convex average of values of 


Q, this implies that g(4) c Q(A). Therefore the function f(z)/z belongs to #. An application 
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of (12) shows that 


(14) =f Gop ae) 


— xz) 


for some p € A. 

Let # denote the set of functions represented by (14), where p © A. It follows that F is 
convex, and it can be shown (using the weak* compactness of A) that F is closed. Therefore 
HS* c F, Conversely, AC HS* since the Koebe functions z/(1 — xz)*(|x| = 1) belong to S* 
and F is the closed convex hull of these functions. Therefore, (14) characterizes the functions f 
in HS*. 

The map HS* — A given by (14) is one-to-one. This is a consequence of the fact that the 
coefficients of the power series for such functions are given by a, = nf)x"* du(x) 
(n = 1,2,...) and the result about the moments of probability measures described in Section 3. 
This one-to-one property shows that EHS* is the set of Koebe functions, with the same argument 
given for EF. 


THEOREM. HS* consists of functions f given by (14), where uw varies over the probability measures 
on 0A. EHS* consists of the Koebe functions f(z) = z/(1 — xz)*, where |x| = 1. 


The results about HS* transform to equivalent facts about HK through the linear homeomor- 
phism between S* and K, as described in Section 5. This leads to the following result. 


THEOREM. HK consists of the functions f represented by 


(15) f(z) = f 


[x|==1 1 — XZ 


Z 


du(x) (\z| <1), 


where pp» © A, and EHK™ consists of the functions f(z) = z/1 — xz, where |x| = 1. 


The kind of argument given above to obtain EHS* and EHK has been successfully applied to 
several families in geometric function theory, including the so-called typically real functions, the 
close-to-convex functions, and the functions with bounded boundary rotation [4], [5]. (In the case 
of functions of boundary rotation at most kz, the problem remains open when 2 < k < 4.) The 


argument given about H# is a particular case of a more general argument which shows that the 
+ CZ 


1—z 
consists of the functions F(xz) (|x| = 1), whenever |c| < 1, c # —1 and a > 1 [4]. The results 
about EHS* and EHK were first proved by L. Brickman, D. R. Wilken, and the author in [5]. 


set of extreme points of the closed convex hull of the functions subordinate to F(z) -( 


7. Applications to extremal problems. The question of finding the extreme points of a given set 
is interesting in itself, but this question is especially important in complex analysis because of its 
relation to the solution of extremal problems. In this section, a variety of extremal problems are 
described which can be solved using the information about the extreme points of families obtained 
earlier. 


DEFINITIONS. The term functional means a complex-valued function defined on &. A func- 
tional L is called continuous (on #) if L(f,) ~ L(f) when f, € # and f, > f. A functional L 
is called linear if 


L( af + Bg) = aL(f) + BL(g) 
when a,B € C and f,g EW. 
If |z)>|< 1 and nv is a nonnegative integer, then L(f) = f‘"(z,) is a continuous linear 
functional. The continuity of this functional follows from a repeated application of Weierstrass’ 


theorem, which shows that if f, - f, then ff > f( for n = 1,2,.... Another example of a 
continuous linear functional is given by the nth coefficient of the power series of functions in #. 


400 T. H. MacGREGOR [June-July 


More generally, if by, b,,...,5, is a finite sequence of complex numbers and L(f) = L7%_)b,a, 
when f(z) = ©%_,)a,z* (|z| < 1), then L is a continuous linear functional. This may be extended 


to infinite sequences {b,} provided that im, ids <1, and then L(f) = L¢_)b,a, char- 
acterizes the form of each continuous linear functional [9, p. 280]. 

A fundamental area of research in classical complex analysis is the solution of extremal 
problems. For the family ., one general form for extremal problems is described as follows : Find 
sup {ReJ(f): f © F } where J is a functional and AC W.If F is compact and J is continuous, 
then the range of ReJ is a compact set of real numbers, and thus ReJ(/f) has a maximum over 
¥#, Any function f in FA for which ReJ(f) = max {ReJ(g): g © F } is called an extremal 
function or a solution to the extremal problem. 

When the continuous functional is also linear, more can be said about extremal functions, and 
this relates to extreme points. 


THEOREM. If F is a compact subset of LS and L is a continuous linear functional on &, then 


(16) max ReL(f) = max ReL(f) = max ReL(f). 
fEEHF fEF fEHF 

The last two maxima in (16) exist because A and HF are compact. However, since EHF is 
not always compact, the existence of the first maximum is not immediately evident. Because of the 
definition of HF and (11.3), EHAC AC HF. Therefore, if we are interested in the problem of 
maximizing Re L over F, (16) permits us to solve the problem only over the smaller set EHF. 
This becomes a useful reduction when EHF can be found and is considerably smaller than F. At 
the same time, (16) shows that the extremal problem over A has the same maximum over the 
larger set HF. 

Here is an outline of the proof of (16). It is a simple abstract argument based on the 
Krein-Milman theorem. If 


M =max{ReL(f):fE€ HF} and G={fe HF:ReL(f)=M}, 


it can be shown that Y is compact and convex. By (11.1) and (11.3), Y has an extreme point, say 
g. It follows that g is an extreme point of H#, because the relations 


g=th+(1-—t)k, O<t<landh,k © HF 
imply that 
M = ReL(g) = tReJ(h) +(1 — t) ReJ(k) < tM+(1-1t)M=M, 


and thus h,k © Y. In other words, there is an element of EHF in the solution set Y. 

Here are two applications of (16), given to F and to §*. First, recall that EF consists of the 
functions P(xz), where |x| =1 and P(z)=1+%°%_,2z”. Therefore, the problem of finding 
max {Rea,:p © PY}, where p(z)=1+ L9_,a,z”, is reduced to finding max,,,_, Re(2x”), 
which equals 2. Second, if f€ S* and f(z) =z+2%_,a,z", then Rea, <n, because 


max {Rea,: f € EHS*} = max {Re(nx"~'):|x| = 1}. 


In general, linear extremal problems over #, S*, or K are reduced to specific computational 
problems of maximizing the real part of certain functions of x where |x| = 1. Several other 
families have the property that their set of extreme points may be parametrized by one or more 
real or complex variables over specific sets, and thus linear problems over these families also are 
reduced to specific computations. 

The theorems determining HS* and HK and the Herglotz formula are examples of Choquet’s 
theorem [21], which gives the representation of a continuous linear functional in terms of 
integration with respect to a probability measure on the extreme points of a set, for fairly general 
locally convex spaces. Since the extreme points of HS*, HK and # are parametrized over 0A, 
the representation of the functional L(f) = f(z) (\z| < 1) takes on the forms obtained earlier. 
Other examples of families in geometric function theory have their extreme points parametrized 
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on sets such as [0,1] or dA X 0A, and thus these families illustrate Choquet’s theorem with 
representations involving integrals over such sets. 
A real-valued function J defined on a vector space X is called convex if 


J(ix+(1—t)y) < W(x) +(1 — t)J(y) 
when x, y © X and 0 < ¢ < 1. The argument used to prove (16) shows that 
max { J(f):f © EHF} = max{I(f):f © F} = max (I(f):f © HF}, 


when J is continuous and convex on W and F¥ is a compact subset of x. For example, this 
applies to J(f) = |a,|, where a, is the nth coefficient of the power series for f, or more generally 
to J =|L| where L is a continuous linear functional. 

Another interesting convex functional is given by 


(1) ={ Pe) P a0)” 


where 0 < r <1, n 1s a nonnegative integer, and A > 1. The convexity of J follows from the 
Minkowski inequality. One application of these relations is that if f € K and F(z) = z/(1 — z), 
then 


(17) [Ur (re?) |\dé < [OF (re) |‘ do 
0 0 


for0<r<1,A>1 and n=0,1,... . The argument depends on the fact that EHK consists of 
the functions F(xz)/x (|x| = 1), and hence J is constant on EHK, as the change of variables 
xe’? — e'® shows. This result is contained in [17]. 

Another area of application concerns linear operators of order zero. In the context of the space 
WZ, L£ is a continuous linear operator provided that # is a mapping of # into W such that 
L(aft+ Pg) =aLl(f)+ BL(g) when a, B € C and f,g Ee #,and Y(f,) ~ L(f) when f, EH 
and f, — f. We say that # is of order zero if, in addition, 


LU f(z =LEFO) le] 


when f € # and 0 < |x| < 1; that is, & commutes with the change of variables z ~ xz. Some 
examples of continuous linear operators of order zero are those which assign to f the functions 
given by: 


f(z), Laf(2Y, freee dw, ef 1) dw, 


and the nth partial sum of the power series for f. It is easy to verify that these operators have the 
required properties. For example, the continuity of the first two operators is a consequence of 
Weierstrass’ theorem. In general, such operators are characterized by #[f(z)] = L9_9a,,5,2z", 
where f(z) = X£%_,a,,z” and the sequence {b, } satisfies lim, , ..|b,|'7” < 1 (see [18]). 

Suppose that F € Y, F£ is a continuous linear operator of order zero, and ¥ is a compact 
subset of w for which EHF consists of the functions of the form F(xz) where |x| = 1. If fe F, 
then the Krein-Milman theorem implies that f(z) = f\,,-, (xz) du(x), where © A. This 
integral can be approximated by suitable finite sums Lt, F(x,z). The linearity of & and the 
continuity of # yield the formula 


2UNM2) =f SEFC] da). 
Since # is of order zero, this formula can be written 


(18) PUM =) =f 2UFMeedulx), 
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Equation (18) expresses the complex number #[f](z) as a convex average of the numbers 
L£[F(-)]|,.- Therefore, #[ f(z) belongs to the closed convex hull of the set of numbers #[ F](w) 
where |w| < |z|. (We say that #[ f] is hull subordinate to #[ F].) 

As a specific example, (18) apples where #= Y and ¥& is the nth partial sum of a power 
series. Let p, be the nth partial sum of the power series of a function p in F and let P, be the 
nth partial sum of P. If 0 < r < 1, then p,[{z:|z| < r}] is contained in the closed convex hull of 
P,[{2:|z| < r}]. In particular, the largest value of r,, such that Re p,(z) > 0 when |z| < r, can be 
obtained by a specific computation using P,. Note that P(z) = (1 + z)/(1 — z), and this implies 
that P,(z) = (1 +z — 22""*)/(1 — z) for z ¥ 1. This yields the classical result that Re p,(z) > 0 
when |z| < 1/2 and n = 1,2,.... 

The diversity of continuous linear operators of order zero and the interesting examples of 
compact families # such that EHF consists of the functions of the form F(xz), where |x|-= 1, 
provide numerous applications of the method just described. The idea of linear operators of order 
zero in relation to # was introduced by R. M. Robinson in [26], and the connection of this idea 
to extreme points was presented by the author in [18]. 

Extremal problems, some of which are quite nonlinear, can be solved using ideas about extreme 
points in a less direct way. Here are two such examples. 

Suppose that J and L are continuous linear functionals and L( p) # 0 when p &€ &. Also, let 
FY, denote the set of functions of the form 


_ itu _yjitye ~ ly) = 
p(z) =t7 = +(1 et where 0<t<1 and |x|=|y|=1. 
Then 
(19) max Re( eh = max Re{ 72 |. 
peP \L(p)} pee  \L(p) 


This result is a particular case of a more general theorem due to St. Ruscheweyh [27]. The relation 
(19) provides a reduction of such extremal problems to a computational problem involving the 
parameters ¢, x and y. The proof of (19) depends on showing that J/L has the same range over 
FY, as it has over ¥. To show this, suppose that p@€ PY and X= J(p)/L(p). Then, the 
continuous linear functional N = J — AL equals 0 at p. Since each extremal problem of the form 
max {Ree’’N(p): p © P} (6 real) has a solution in EY, 0 belongs to the closed convex hull of 
the set of values that N achieves at the extreme points of #. The set { N( P(xz)): |x| = 1} can be 
shown to be an analytic curve, and, in particular, it is connected. It follows from the two-dimen- 
sional case of a theorem of Carathéodory [10, p. 35] that the number 0 is the convex average of at 


most two elements of that set. This implies that A € 7 (Po): 


The next example concerns the fact that the form of solutions to the extremal problems 
max; g Re ®[ £(5), £8)... f°), with § fixed and [f|< 1, can be identified for fairly 
general functions ®. As in previous examples, a reduction takes place, and it is an effective step in 
solving such problems at least when n is small. The method is applicable if either # is convex 
(and compact) with a simple set of extreme points, or if F is related to such a family through an 
analytic mapping. The idea is first to apply extreme-point techniques to a convex family in order 
to identify functions corresponding to boundary points of the family’s region of variability, the set 
of points (f(§), f’(),...,f°?(&) with varying f. Then that information is transferred to 
equivalent facts about the boundary points of the region of variability of the related family. This 
is possible since the two regions often are related by a homeomorphism. For example, this 
argument is directly applicable to # and to the family described by (13). The second family, in 
turn, is related to S* by f(z) = zexp[2q(z)], and this map generates the appropriate homeomor- 
phism. The details of these arguments and comments about related results, including coefficient 
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problems, are given in [19]. When ¥ is not convex, the solution to such general extremal 
problems usually gives a number which is larger over HY than over F. 

For families having a large set of extreme points, it is less likely that the methods outlined here 
will be successful. In any case, the general theory is applicable. For example, to solve the problem 
max;«s|a,|, where f(z) = z + LY_,a,2”, it suffices to consider functions in EHS. A few facts 
about EHS are described in [9, pp. 286ff.], including the result that if f € ES, then f(A) is the 
complement of a continuous curve w = w(t), 0 < t < 1, whose modulus strictly increases and 
tends to co as ¢t > 1. Such results, naturally, are of interest with regard to the famous Bieberbach 
conjecture, which states that if fe S and f(z)=z+L%_,a,z", then |a,| < n(n = 2,3,...). 
Note that, for the Koebe functions, |a,| =. (This conjecture has recently been proved by L. 
de Branges.) 


8. Families having ranges in a convex region. For the families discussed next the determination 
of the extreme points is a much more difficult problem, requiring further advanced ideas, 
including results from the theory of H? spaces [8]. The presentation in this section is generally 
more descriptive than those given earlier. 

Let D be a convex region and let w € D. Let ¥ denote the subset of for which f(A) C D 
and f(0) = w. Then ¥ is a compact, convex subset of .. The compactness follows primarily 
from Schwarz’s lemma, applied to F~'° f, where f € F and F is a one-to-one map of A onto D 
with F(0) = wo. The existence of F depends on the Riemann mapping theorem [2, p. 230]. A 
description of EF shall be given for certain specific regions D and for general regions with 
sufficiently smooth boundaries. 

The set H? is defined as the subset of # consisting of functions f such that the integrals 

67 |f(re'®)|? dO are bounded on the interval {r:0 < r < 1}. Generally it is assumed that D # C 
and D is not a half plane. A consideration of the support lines to D implies that f(A) is 
contained in an angular wedge of opening less than 7 or in an open strip. From this geometric 
restriction on the range of f, it can be shown that f © H" (see [1]). This permits the application of 
a number of facts about H'. In particular, f may be represented by the Poisson formula 


(20) f( re) = 55 [P(r 6 ~ 1) f(e) de, 
where 
1-r? 
(21) P(r, 8) = 1 — 2rcos@ + r? 
and 
(22) (eo) = lim fre). 


The limit in (22) exists for almost all 9 on [0,27) and f(e’®) is Lebesgue integrable. 

The results about E¥ are given in terms of the “boundary function” f(e'”). A critical step in 
such arguments is the construction of a function in terms of its boundary values through (20). 
Formula (20) expresses f(z) as a convex average of the numbers f(e”), since P(r,#) > 0 and 


se {27P(r, 9) d@ = 1. This permits the conclusion that f © ¥ essentially from f(e'’) € D for 


almost all 8. The actual construction of suitable boundary functions often depends on the use of 
harmonic measure [31, pp. 111{ff.]. 

One general result asserts that if fe F and f(e”) © AD for almost all 6, then f © EF [12]. 
This result is quite different from the earlier facts in that now EF is a much larger set. If F 
denotes a conformal mapping of A onto D with F(0) = wo, then ¥ is described as the set of 
functions of the form f= Fo¢, where ¢ € &, |$(z)| < 1 for |z| < 1 and $(0) = 0. Because of 
the univalence of F, the condition that f(e’?)e@ dD for almost all @ can be shown to be 
equivalent to |¢(e’’)| = 1 for almost all 6. Such functions ¢ are called inner functions [8, p. 24]. 
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Examples of inner functions are given by finite Blaschke products 


7 Zt a 
o(z) = “1d 1+ @,2z’ 
where |x| = 1 and |a,|< 1. Since the Médbius transformation w = (z + a)/(1 + az) satisfies 
|w| = 1 when |z| = 1 (given |a| < 1), finite Blaschke products satisfy |@(z)| = 1 when |z| = 1. 
Convergent infinite Blaschke products also are inner functions. 

The converse, that if fe EF then f(e'?) © dD for almost all 0, is also correct when D is a 
strip or an angular wedge with opening less than 7. A concise statement of this result is given for a 
strip. Let # denote the subset of of functions f such that —1 < Ref(z) < 1 for |z| < 1 and 
f(0) = 0. A function f in ¥ isin E¥ if and only if Re f(e’”) = +1 for almost all 0. This was 
proved by J. G. Milcetich in [20]. The result for wedges was proved by Y. Abu-Muhanna and the 
author in [1]. It is an open problem whether the same result holds when D is any (bounded or 
unbounded) convex polygonal region. 

In the case D = A and w = 0, E¥ consists of the functions f in ¥ satisfying 


(23) [tog (1 — fe) 40 = —00. 


This condition is equivalent to the same condition known for the extreme points of the family of 
functions satisfying |g(z)| < 1(\z| < 1) [8, p. 125], since the two families are related by the linear 
homeomorphism defined by f(z) = zg(z) (from # into &%). Condition (23) may be regarded as 
an analytic measure of how close |f(e’’)| is to 1. It is much less restrictive than f being an inner 
function. In particular, if | f(e’’)| = 1 on a set of positive measure, then (23) holds. 

As an example in the opposite direction, if f is analytic in A, |f(z)| <1 for |z| <1 and 
|f(z)| = 1 only for a finite number of points (on dA), then f is not an extreme point. (For 
example, f(z) = 5(1 + z) has these properties.) To verify this claim, it shall be shown that f 
satisfies {27 log (1 — | f(e'®)|) d@ > — oo. It suffices to show that if |f(z))|= 1 and z) = e'®, 
then there is a closed interval J containing a, in its interior for which f, log (1 — |f(e'®)}) dé > 
—oo. It may be assumed that z, = 1. The analyticity of f on dA implies that u(@) = Re f(e’’) 
and v(@) = Im f(e’’) are analytic in 6 in a neighborhood of the real axis. The same conclusion 
holds for the function |f(e’®)|* = u?(@) + v?(@). This function is not identically 1, and thus it 
has a power series about 9 = 0 which begins 1 — c#" + ---, where c > 0 and n is an even 
integer. Thus | f(e’?)| has a power series beginning 1 — 50" + -:+,and 

C 


log (1 — |f( e’”))|) > log ( 50") = log 3 4 n log |6| 


for all sufficiently small values of 6. Since fj log@d0 > —oo, it follows that /,log(1 — 
|f(e")|) d®8 > —oo for a sufficiently small closed interval J containing 0 in its interior. 


Let D be a bounded, convex region such that 0D is given by w = w(t), 0 < t < 1, where w 
has a continuous second derivative. When f © F, let A(@) denote the distance between f(e’’) 
and dD. Assume that the curvature of dD is never zero. Then f € EF if and only if fe F and 


(24) [°"togn(8) 40 = ~o. 


This was proved in [1]. In the case D = A,A(0) = 1 — | f(e’’)|. As an example, (24) characterizes 
the extreme points when the convex region is the interior of an ellipse. This and other results in 
this section may be roughly summarized by the statement, “the more extreme points D has, the 
more extreme points F has.” 


9. Meromorphic, univalent functions. Let = denote the set of functions that are analytic and 
univalent in {z:0 < |z| < 1} and have a Laurent series normalized by 
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(25) f(z) = ; + Y a,2"(0 <|z|< 1). 


It is preferable to study the subset of 2, denoted X%,, for which ay = 0 in (25), since the functions 
f(z) - =, where f € %,, can be shown to form a compact subset of . 
The so-called area theorem asserts that if f © 2, then 


(o@) 


(26) de ala,|? <1 


n=1 


[29, p. 320]. Inequality (26) is equivalent to the fact that the area (Lebesgue measure) A of the 
complement of f[{z:0 < |z| < 1}] is nonnegative. For example, the function f(z) = : + Zz 
belongs to 2 and the complement of its range is the interval {z: —2 < x < 2}. For Xo, the 
equality |a,| = 1 holds only for ; + xz where |x| = 1. 


A function f in 2, belongs to EH, if and only if A is zero. One direction of this result is due 
to G. Springer [30]. The argument depends on showing that the set of functions g(z) = L°_,a,2” 
for which (26) holds has as its extreme points those functions for which equality holds in (26). 
This is analogous to the fact that in R” the set of extreme points of the closed unit ball consists of 
the boundary points of that ball. The other direction of the result about EHX, was proved by 
D. H. Hamilton in [15]. The argument depends on a result of Nguyen X. Uy [32], which gives the 
construction of a suitable non-constant analytic function defined in the complement of a set of 
positive measure. 


10. Support points. Suppose that ¥ is a compact subset of w and f € ¥. Then f is called a 
support point of F if there is a continuous linear functional L such that Re L is not constant on 
FA and 


(27) ReJ(f) = max{ReJ(g):geEF}. 


In other words, support points are the solutions of nontrivial linear problems over #. The notion 
of a support point applied to a set in R” means that certain boundary points of the set are 
contained in a supporting plane to the set. A general problem, of considerable research interest in 
the last several years, is that of finding the set of support points of a given subset of &. The set of 
support points of A and the set EHF are related, and a particular connection was indicated in 
the proof of (16). 

Next, some results about the support points of the families discussed earlier are mentioned. A 
summary of facts about the support points of S is contained in [9, pp. 280ff.]. 

The set of support points of # consists of all functions of the form 


(28) p(z) = yy “Tox, 2’ 


where |x,| = 1, t, > 0 and X7_,t, = 1 (see [6, 13]). This result has the implication that the set of 
support points of # is dense in #, a quite different phenomenon than occurs in a finite 
dimensional setting. A similar result holds for families having ranges in a strip and in a convex 
angular wedge. A general setting for results of this type concerns families of analytic functions 
defined by f(A) C D and f(0) = wo, where D is a convex domain. If we let F denote a 
one-to-one analytic mapping of A onto the convex domain D with F(0) = wo, then the support 
points of F are characterized by f = F°¢, where ¢ is a finite Blaschke product and $(0) = 0 
[13]. In particular, the set of support points of the family of functions analytic in A and satisfying 
|f(z)| < 1 are the finite Blaschke products [6]. 

A simple situation occurs for S*, K and the family of functions f in # satisfying f(z) # 0, 


406 T. H. MacGREGOR 


|arg f(z)| < as and f(0)=1, witha >1.Ineach > the set of support points is the same as 
EHF [5], [13]. 
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TOWERS OF HANOI AND ANALYSIS OF ALGORITHMS 


PAUL CULL AND E. F. ECKLUND, JR. 
Department of Computer Science, Oregon State University, Corvallis, OR 97331 


1. Introduction. Mathematics has applications everywhere, but perhaps nowhere more clearly 
than in computer science, and within computer science perhaps nowhere more clearly than in 
analysis of algorithms. It would be difficult to draw a line and say that this part of analysis of 
algorithms is mathematics and that part is computer science. 

In this paper we want to give a feeling for analysis of algorithms by investigating, in some 
detail, algorithms for the Towers of Hanoi problem. But first we should say a few general words 
about the field of analysis of algorithms. As a start one should make a bow to Donald Knuth. His 
“Art of Computer Programming” [5] is the standard work in the field. In the last few years, 
analysis of algorithms courses have become a part of many undergraduate programs in computer 
science. Knuth is still the standard reference, but more widely used textbooks are Aho, Hopcroft 
and Ullman (AHV) [1], Baase [2], and Horowitz and Sahni [4]. 

The task of analysis of algorithms is three-fold: 

(1) To produce provably correct algorithms, that is, algorithms which not only solve the 
problem they are designed to solve, but which also can be demonstrated to solve the problem; 

(2) To compare algorithms for a problem with respect to various measures of resources (e.g., 
time and space), so that we can say when one algorithm is better than another; 

(3) To find, if possible, the best algorithm for a problem with respect to a particular measure 
of resource usage. This involves proving “lower bounds’, that is, showing that every algorithm 
which solves the problem must use at least so much of a particular resource. To establish a best 
algorithm one must have both a proof of a lower bound and an algorithm which uses no more 
than this lower bound. 

Here a distinction should be made between bounds for an algorithm and bounds for a problem. 
If one establishes an upper bound on a particular resource used by an algorithm for a problem, 
then one has an upper bound both for the algorithm and for the problem. If one establishes a 
lower bound for a problem, then one also has a lower bound on all algorithms which solve this 
problem. But demonstrating a lower bound for one algorithm for a problem does not establish a 
lower bound for the problem. 

In this paper we will exemplify the three-fold task of analysis of algorithms using the Towers of 
Hanoi problem. This problem is often used as an example of a problem which can be neatly 
solved by a recursive algorithm, as an example of a problem which requires exponential time for 
its solution [5], and as an example of problem solving strategies [6]. In the Towers of Hanoi 
problem one is given three towers, usually called A, B and C, and a disks of different sizes. 
Initially the disks are stacked on tower A in order of size (disk n, the largest, on the bottom; disk 
1, the smallest, on the top). The problem is to move the stack of disks to tower C, moving the 
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disks one at a time in such a way that a disk is never stacked on top of a smaller disk. An extra 
constraint is that the sequence of moves should be as short as possible. An algorithm solves the 
Towers of Hanoi problem if, when the algorithm is given as input n the number of disks, and the 
names of the towers, then the algorithm produces the shortest sequence of moves which conforms 
to the above rules. 

In this paper we will investigate a variety of algorithms which solve the Towers of Hanoi 
problem. We will prove the correctness of each algorithm, calculate the time and space used by 
each algorithm to allow a comparison among them, and prove the lower bounds on time and 
space required by any algorithm which solves the problem. We will show that the final algorithm 
which attains these bounds is the best possible for these measures. 


2. Comparing algorithms. For any (solvable) problem there will be an infinity of algorithms 
which solve the problem. How do we decide which is the “best” algorithm? There are a number of 
possible ways to compare algorithms. We will concentrate on two measures: time and space. We 
would like to say that one algorithm is faster, uses less time, than another algorithm if when we 
run the two algorithms on a computer the faster one will finish first. Unfortunately, to make this a 
fair test we would have to keep a number of conditions constant. For example, we would have to 
code the two algorithms in the same programming language, compile the two programs using the 
same compiler, and run the two programs under the same operating system on the same computer, 
and have no interference with either program while it is running. Even if we could practically 
satisfy all these conditions, we might be chagrined to find that algorithm A is faster under 
conditions C, but that algorithm B is faster under conditions D. 

To avoid this unhappy situation we will only calculate time to order. We let n be some measure 
of the size of the problem, and give the running time as a function of n. For example, in the 
Towers of Hanoi we will use n for the number of disks. We do not distinguish running times of 
the same order. For our purposes two functions of n, f(n) and g(n), have the same order if for 
some WN there are two positive constants C, and C, so that 


Clg(n)| <|f(a) |< Glg(n)| forall n> N. 


We symbolize this relation by f(n) = @(g(n)), read f(n) is order g(n). Thus we will consider two 
algorithms to take the same time if their running times have the same order. In particular, we do 
not distinguish between algorithms whose running times are constant multiples of one another. 

If we find that algorithm A has a time order which is strictly less than algorithm B, then we can 
be confident that for any large enough problem algorithm A will run faster than algorithm B, 
regardless of the actual conditions. On the other hand if algorithms A and B have the same time 
order, then we will not predict which one will be faster under a given set of actual conditions. 

The space used by an algorithm is the number of bits the algorithm uses to store and 
manipulate data. We expect the space to be an increasing function of n, the size of the problem. 
This space measurement ignores the number of bits used to specify the algorithm, which has a 
fixed constant size independent of the size of the problem. Since we have chosen bits as our unit, 
we can be more exact about space than we can be about time. We can distinguish an algorithm 
which uses 3” bits from an algorithm which uses 27 bits. But we will not distinguish an algorithm 
which uses 3” + 7 bits from an algorithm which uses 3n + 1 bits, because we can hide a constant 
number of bits within the algorithm itself. 

So we will say that we have the “best” algorithm for a problem if we can show that the 
algorithm has minimal time order, and uses minimal space to within an additive constant. 

It is not clear that such a best algorithm must exist. In some problems there is a time-space 
trade-off; a faster algorithm requires more space. We will demonstrate that this sort of trade-off 
does not exist in the Towers of Hanoi problem by eventually presenting an algorithm which 
achieves simultaneously minimal time and minimal space. 


3. A recursive algorithm. The road to a best algorithm starts with some algorithm which one 
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then attempts to improve. One often uses some sort of strategy to create an algorithm. A very 
useful strategy is to look at the problem and see if the solution can be expressed in terms of the 
solutions of several problems of the same kind, but of smaller size. This strategy is usually called 
divide-and-conquer. If the problem yields to the divide-and-conquer approach, one can construct 
a recursive algorithm which solves the problem. This construction also gives almost immediately 
an inductive proof that the algorithm is correct. Time and space analyses of a divide-and-conquer 
algorithm are often straightforward, since the algorithm directly gives difference equations for 
time and space usage. 

While these divide-and-conquer algorithms have many nice properties, they may not use 
minimal time and space. They may, however, serve as a starting point for constructing more 
efficient algorithms. 

Consideration of the Towers of Hanoi problem leads to the key observation that moving the 
largest disk requires that all of the other disks are out of the way. Hence the n — 1 smaller disks 
should be moved to tower B, but this is just another Towers of Hanoi problem with fewer disks. 
After the largest disk has been moved the n — 1 smaller disks can be moved from B to C; again 
this is a smaller Towers of Hanoi problem. These observations lead to the following recursive 
algorithm [6], [7], [9]: 


PROCEDURE HANOI (A,B,C,n) 
IF n=] THEN move the top disk from tower A to tower C 
ELSE HANOI (A,C,B,n-1) 
move the top disk from tower A to tower C 


HANOI (B,A,C,n-1). 


Is this the best algorithm for the problem? We will show that this algorithm has minimum time 
complexity, but does not have minimum space complexity. First, though we prove that the 
algorithm correctly solves the problem, the first task of analysis of algorithms as outlined in the 
Introduction. 


PROPOSITION 1. The recursive algorithm HANOI correctly solves the Towers of Hanoi problem. 


Proof. Clearly the algorithm gives the correct minimal sequence of moves for 1 disk. If there is 
more than one disk, the algorithm moves n — 1 disks to tower B, then moves the largest disk to 
tower C, and then moves the n — 1 disks from tower B to tower C. This is precisely what is 
required in a minimum move algorithm because according to the rules the largest disk can only be 
moved when all the other n — 1 disks are on a single tower. So the n — 1 disks must be moved 
from tower A to some other tower. Clearly at least one move is required to move the largest disk 
from tower A to tower C. When the largest disk is moved to tower C, the other n — 1 disks are on 
a single tower and still have to be moved to tower C. By inductively assuming n — 1 disks are 
moved in the minimum number of moves, we see that the algorithm for n disks makes no more 
than the minimal number of moves and finishes with all the n-disks moved from tower A to tower 
C. a 


Here we should remark that we have not only produced a provably correct algorithm for the 
problem; we have also shown that the minimal sequence of moves is unique. This uniqueness 
makes the proof of correctness easy. The proof would be more complicated if more than one 
minimum sequence were possible. 

We would like to calculate the running time of HANOI, but we don’t know how long various 
operations will take. How long will it take to move a disk? How long will it take to subtract 1 
from n? How long will it take to test if n = 1? How long will it take to issue a procedure call? 
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Because we only wish to calculate time to order we don’t have to answer these questions exactly, 
but we do have to make a distinction between operations which take a constant amount of time, 
independent of n, and operations whose running time depends on n. 

One possibility is to assume that each operation takes constant time independent of n. AHU 
[1] calls this assumption the uniform cost criterion. With this uniform cost assumption and letting 
T(n) be the running time for n disks we have the difference equation 


T(n) =2T(n-1) +c, 


because there are 2 calls to the same procedure with n — 1 disks and c is the sum of the constant 
running times for the various operations. Letting 7(1) be the running time of the algorithm for 1 
disk, we find 


T(n) =(T(1) + ¢)2”"' - , 
which can be verified by direct substitution. This gives 
T(n) = 0(2") 
since 


7A) y <T(n) < [TO +e) 


Another possibility is to assume that some of the operations have running times which are a 
function of n. But which function of n should we use? Each of the numbers in the algorithm is 
between 1 and n, and the disks can also be represented by numbers between 1 and 2. Since such 
numbers can be represented using about logn bits, it seems reasonable to assume that each 
operation which manipulates numbers or disks has a running time which 1s a constant times log n. 
AHU [1] calls this the logarithmic cost criterion and suggests sing it when the numbers used by 
an algorithm do not have fixed bounds. Using the logarithmic cost criterion we have the difference 
equation 


T(n) =2T(n—- 1) 4+ clogn 
for the running time of the algorithm. This difference equation has the solution 


T(n) = 2” mY) 


which can be verified by substitution. Since the summation in this solution converges, as one can 
demonstrate by the ratio test, and assuming that the constants are positive, we have 


T(n) = 6(2"). 
Since both cost criteria give the same running time, we conclude: 


PROPOSITION 2. The algorithm HANOI has running time 0(2"). 


Although we have established the running time for a particular algorithm which solves the 
Towers of Hanoi problem, we have not yet established the time complexity of the problem. We 
need to establish a lower bound so that every algorithm which solves the problem must have 
running time greater than or equal to the lower bound. We establish 6(2”) as the lower bound in 
the proof of the following proposition. 


PROPOSITION 3. The Towers of Hanoi problem has time complexity 6(2"). 


Proof. Following the proof of Proposition 1, a straightforward induction shows that the 
minimal number of moves needed to solve the Towers of Hanoi problem is 2” — 1. Since each 
move requires at least constant time, we have established the lower bound on time complexity. 

An upper bound for the time complexity of the problem comes from Proposition 2. Since the 
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upper bound and lower bound are equal to order, we have established the 6(2”) time complexity 
of the problem. & 


Now that we know HANOI’s time complexity we would like to consider its space complexity. 
First we will establish a lower bound on space which follows from the lower bound on time. 


PROPOSITION 4. Any algorithm which solves the Towers of Hanoi problem must use at least 
n + constant bits of storage. 


Proof. Since the algorithm must produce 2” — 1 moves to solve the problem, the algorithm 
must be able to distinguish 2” different situations. If the algorithm did not distinguish this many 
situations, then the algorithm would halt in the same number of moves after each of the two 
nondistinguished situations, which would result in an error in at least one of the cases. 

The number of situations distinguished by an algorithm is equal to the number of storage 
situations times the number of internal situations within the algorithm. Since the algorithm has a 
fixed finite size, it can have only a constant number of different internal situations. The number of 
storage situations (states) is 2 to the number of storage bits. Thus C- 22175 > 2”, and so 
BITS > n — logC =n + constant. H 


In order to discuss the space complexity of the recursive algorithm, let us now consider the data 
structure used. Two possible data structures are the array and the stack. An array is a set of 
locations indexed by a set of consecutive integers so that the information stored at a location in 
the array can be referenced by indicating the integer which indexes the location. For example, the 
information at location I in the array ARRAY would be referenced by ARRAY[]]. A stack is a 
linearly ordered set of locations in which information can be inserted or deleted only at the 
beginning of the stack. 

The towers could each be represented by an array with n locations, and each location would 
need at most log n bits. So an array data structure with 6(n log n) bits would suffice. Alternately, 
each tower could be represented by a stack. Each stack location would need log n bits, so again 
this is a 6(n log n) bit structure. Actually a savings would be made. Since only n disks have to be 
represented, the stack structure needs only n locations versus the 3n locations used by the array 
structure. Another possible structure is an array in which the ith element holds the name of the 
tower on which the ith disk is located. This structure uses only @(n) bits. Yet another possibility 
is to not represent the towers, but to output the moves in the form FROM __TO__. Thus we 
could use no storage for the towers. 

The recursive algorithm still requires space for its recursive stack. When a recursive algorithm 
calls itself, the parameters for this new call will take the places of the previous parameters, so these 
previous parameters are placed on a stack from which they can be recalled when the new call is 
completed. Also placed on the stack is the return address, the position in the algorithm at which 
execution of the old call should be resumed. All of this information, the parameters and the return 
address, for a single call are referred to as a stack frame. At most n stack frames will be active at 
any time and each frame will use a constant number of bits for the names of the towers and log n 
bits for the number of disks. So the recursive algorithm will use 9(n log n) bits whether or not the 
towers are actually represented. We summarize these considerations by the following proposition. 


PROPOSITION 5. The recursive algorithm HANOI correctly solves the Towers of Hanoi problem 
and uses 6(2”) time and 9(nlogn) space. 


The recursive algorithm uses more than minimal space. We are faced with several possibilities: 


(1) Minimal space is only a lower bound and is not attainable by any algorithm. 

(2) Minimal space can only be achieved by an algorithm which uses more than minimal time. 

(3) Some other algorithm attains both minimal time and minimal space. By developing a series 
of iterative algorithms, we will arrive at an algorithm which uses both minimal time and minimal 
space. 
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4. Some iterative algorithms. As a first step in obtaining a better algorithm, we will con- 
sider an iterative algorithm which simulates the recursive algorithm for n > 2. This algorithm 
RECURSIVE SIM is similar to an algorithm given by Tenenbaum and Augenstein [7], but we 
have chosen to explicitly keep track of the stack counter because this will aid us in finding an 
algorithm using even less space. 


PROCEDURE RECURSIVE SIM (A,B,C,n) 
I:= 1 
LI1L1J:= A; L2L1]:= Cc; L3L1J:= B 
NUML1]:= n-1 ; PAR[1J:= 1 ; PARLOJ]:= 1 
WHILE I > 1 DO 
IF NUMLI] > 1 


LIC1] 


THEN L1LI+1]: 
L2[I+1]:= L3CI] 
L3LI+1J:= L2ci] 
NUMLI+1]:= NUMLIJ - 1 
PARLI+1]:= 1 
I:= [+l 
ELSE MOVE FROM LILI] To L3[T1J 
WHILE PARLI] = 2 DO 
I:= [-l 
IF I > 1 THEN MOVE FROM L1[I] To L2[T] 
PARLI]:= 2 
TEMP:= LI[T] 
LILIJ]:= L3lt] 
L3CI]:= L2tr] 


L2L1I]:= TEMP 


The names of the towers are stored in the three arrays L1,L2,L3; the number of disks in a 
recursive call is stored in NUM; and the value of PAR indicates whether a call is the first or 
second of a pair of recursive calls. 

RECURSIVE SIM sets up the parameters for the call HANOI (A, C, B, n — 1). When the last 
move for this call is made, the arrays will contain the parameters for calls with 1 through n — 2 
disks, where each of these calls will have PAR = 2. The arrays will still contain the parameters for 
the (A, C,B, n — 1) call with PAR = 1. The inner WHILE loop will pop each of the calls with 
PAR = 2, leaving the array counter pointing at the (A, C,B,n — 1) call. Since I will be 1 at this 
point the IF condition is satisfied and the MOVE FROM L1{I] TO L2[I] accomplishes the MOVE 
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FROM A TO C of the recursive algorithm HANOI. The following assignment statements set up 
the call (B, A, C, — 1) with PAR = 2. So when the moves for this call are completed all of the 
calls in the array will have PAR = 2, and the inner WHILE loop will pop all of these calls setting 
I to 0. Then the IF condition will be false, so no operations are carried out, and the outer WHILE 
condition will be false so the algorithm will terminate. 


PROPOSITION 6. The RECURSIVE SIM algorithm correctly solves the Towers of Hanoi problem, 
and uses 9(2”) time and 6(nlogn) space. 


Proof. Correctness follows since this algorithm simulates the recursive algorithm which we 
have proved correct. The major space usage is in the arrays. Since each time I is incremented the 
corresponding NUM([I] is decremented, and since NUM{[I] never falls below 1, there are at most 
n — 1 locations ever used in an array. The four arrays L1, L2, L3, and PAR use only a constant 
amount of space for each element, but NUM must store a number as large as n — 1 so it uses 
6 (log n) bits for an element. Thus the arrays use 6(n log n) bits. 

Now we have to argue about time usage. Most of the operations deal with constant-sized 
operands so these operations will take constant time. The exceptional operations are incrementing, 
decrementing, assigning, and comparing numbers which may have @(logn) bits. A difference 
equation for the time is 


T(n) = 2T(n— 1) + Clogn, 


where 7(n) is the time to solve a problem with n disks and C log n is the time for manipulating 
the numbers with 6(log n) bits. As in the proof of Proposition 1 we have T(n) = 0(2”). & 


Notice that this algorithm does not improve on the recursive algorithm, but study of this form 
can lead to a saving of space. Storing the array NUM causes the use of 0(n log n) space. If we did 
not have to store NUM, the algorithm would use only @(7) space. Do we need to save NUM? 
NUM is used as a control variable, so it seems necessary. But if we look at NUM[1] + 1, we get n. 
When NUM{[I + 1] is set, it is set equal to NUM{[I]] — 1, but then 


NUM[I + 1] +1+1=NUM[I]-1+141 
= NUM[I] +1=n. 


Thus the information we need about NUM is stored in I and n. So if we replace the test on 
NUMI[]I] = 1 with a test on I = n — 1, we can dispense with storing NUM and improve the space 
complexity from #(nlogn) to 8(n). This replacement does not increase the time complexity of 
any step in the algorithm, so the time complexity remains 4(2”). 


Our new procedure is 


PROCEDURE NEW SIM (A,B,C,n) 
I: = 1 
LI{1ij:= A ; L2L1J:= Cc ; L3liJd:= B 
PARL1J:= 1 ; PARLOJ:= 1 
WHILE I > 1 DO 
IF I # n-1 
THEN LILI+1]:= L1l1] 
L2CIt1]:= L3lT] 


L3CIt¢1J:= L2eT] 
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PAK, ij:= 1 
T:= [+] 
2 MOVE FROM LILI] To L3[T1] 
WHILE PAR[IJ = 2 DO 
I:= I-] 
IF I > 1 THEN MOVE FROM LILI] To L2[T] 
PARLI]:= 2 
TEMP:= L1[1] 
Li{Tl:= L3CTJ 


L3[IJ:= L2€1] 


L2{C1J:= TEMP 


From the above observation we have: 


PROPOSITION 7. NEW SIM correctly solves the Towers of Hanoi problem and uses 6(2") time 
and @(n) space. 


Although we have reached @(n) space, we would like to decrease the space even further, 
hopefully to n + constant bits. If we look at the array PAR, we find that the algorithm scans PAR 
to find the first element not equal to 2, replaces that element by 2 and then replaces all the 
previous 2’s by 1’s. This is analogous to the familiar operation of adding 1 to a binary number, in 
which we find the first 0, replace it by a 1, and replace all the previous 1’s by 0’s. So it seems that 
we can replace the array PAR by a simple counter. The number of bits in the counter will, of 
course, depend on n. 

So far this has not resulted in any saving of space. Will there be enough information in the 
counter to determine from which tower we should move a disk? The affirmative answer will 
enable us to achieve a minimal space algorithm. To motivate the design of our minimal space 
algorithm, we will examine the sequence of 31 moves needed to solve the problem with 5 disks. 
This sequence is shown in Table 1. 

Every other move in the solution involves moving disk 1. So if we know which tower contains 
disk 1, we would know from which tower to move, in alternate moves, but we might not know 
which tower to move to. When we consider the three towers to be arranged in a circle, we see from 
Table 1 that disk 1 always moves in a counterclockwise direction when we have an odd number of 
disks. Similarly disk 1 always moves in a clockwise direction when we have an even number of 
disks. Thus, by keeping track of the tower which contains disk 1, and whether n is odd or even, we 
would know how to make every other move. 

For the moves which do not involve disk 1, we know that the move involves the two towers 
which do not contain disk 1. Looking again at Table 1, we see that the odd numbered disks always 
move in the same direction as disk 1 and the even numbered disks always move in the opposite 
direction. So knowing the towers involved and whether the disk to be moved is odd or even would 
allow us to decide which way to move. 

Can we determine from a counter whether the disk being moved is odd or even? If we look at 
the COUNT column of Table 1, we see that the position of the rightmost 0 tells us the number of 
the disk to be moved. Thus a single counter with n bits is sufficient to solve the Towers of Hanoi 
problem. 
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DECIMAL 
TOWER 0 TOWER 1 TOWER 2 COUNT COUNT DISK FROM TO 
12345 - - 0 00000 1 0 2 
2345 - 1 1 00001 2 0 1 
345 1 2 00010 1 2 1 
345 12 - 3 00011 3 0 2 
45 12 3 4 00100 1 1 0 
145 2 3 5 00101 2 1 2 
145 - 23 6 00110 1 0 2 
45 - 123 7 00111 4 0 1 
be) 4 123 8 01000 1 2 1 
be) 14 23 9 01001 2 2 0 
25 14 3 10 01010 1 1 0 
125 4 3 11 01011 3 2 1 
125 34 - 12 01100 1 0 2 
25 34 1 13 01101 2 0 1 
be) 234 1 14 01110 1 2 1 
be) 1234 - 15 01111 5 0 2 
- 1234 be) 16 10000 1 1 0 
1 234 be) 17 10001 2 1 2 
1 34 25 18 10010 1 0 2 
- 34 125 19 10011 3 1 0 
3 4 125 20 10100 1 2 1 
3 14 25 21 10101 2 2 0 
23 14 be) 22 10110 1 1 0 
123 4 be) 23 10111 4 1 2 
123 - 45 24 11000 1 0 2 
23 - 145 25 11001 2 0 1 
3 2 145 26 11010 1 2 1 
3 12 45 27 11011 3 0 2 
- 12 345 28 11100 1 1 0 
1 2 345 29 11101 2 1 2 
1 2345 50 11110 1 0 2 
- - 12345 31 11111 


TABLE 1. Towers of Hanoi Solution for 5 disks. 


We use these facts to construct the algorithm which follows. 


PROCEDURE TOWERS (n) 
T:= 0 (*TOWER NUMBER COMPUTED MODULO 3*) 
COUNT:= 0 (*COUNT HAS n BITS*) 
1 if n is even 
1 if n is odd 
WHILE TRUE DO 


MOVE DISK 1 FROM T TO T+P 
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T:= T+P 
COUNT:= COUNT + 1 
IF COUNT = ALL 1's THEN RETURN 
IF RIGHTMOST 0 IN COUNT IS IN EVEN POSITION 
THEN MOVE DISK FROM T-P TO T+P 
ELSE MOVE DISK FROM T+P TO T-P 
COUNT:= COUNT + 1 


ENDWHILE 


n -.. 2 1 
COUNT | 0 ... O 0 | 
A picture of the storage used for 
COUNT. 


Notice that it has n bits, and that 
we have called the rightmost bit 
position 1. The positions from right 
to left are then odd, even, odd, 
even.... 


REMARKS. We can still improve this algorithm by removing the first COUNT: = COUNT + 1 
statement and deleting the mghtmost bit of COUNT. This would also require changing the 
numbering of the bits in COUNT so that the rightmost bit is bit 0. An algorithm similar to our 
TOWERS has recently been published by T. R. Walsh [8]. 


We have to show that TOWERS correctly solves the Towers of Hanoi problem. We do this by 
proving that a certain sequence of moves has been accomplished when COUNT contains a 
number of the form 2* — 1, so that when k = n, the sequence of moves for HANOI (A, B, C, 7) 
has been completed and the procedure will terminate since COUNT contains all 1’s. 


PROPOSITION 8. When COUNT = 2* — 1, that is COUNT = | 00...01...1]| with k 1’s, then 


ifk # n (MOD 2), the correct moves for HANOI (A, C,B,k) have been completed 
and T contains 1 ( which represents B), 
if k =n (MOD 2), the correct moves for HANOI (A,B,C, k) have been completed 


and T contains 2 ( which represents C). 


Proof. If k = 1, the single move T to T + P has been completed, which is A to C if n is odd, 
and is A to B if n is even, and T contains T + P, which is 2 if n is odd, and is 1 if n is even. This 
agrees with our claim. 

Notice that COUNT can only take on the value 2“ — 1 immediately before the IF... RE- 
TURN statement. Assume the moves for either HANOI (A, B, C, k), or HANOI (A, C, B, k) have 
been completed. If k # n(MOD 2), the next move will be A to C, since by assumption T now 
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contains 1; if k is odd, the move is T — P to T + P, which is 1 — (1) to1 + 1, which represents 
A to C, and if k is even, the move is T + P to T — P, which is 1 + (—1) to 1 — (—1), which 
represents A to C. If k = n(MOD 2), the next move will be A to B, since by assumption T now 
contains 2; if k is odd, the move is JT — P to T+ P, which is 2 — (—1) to 2 + (—1), which 
represents A to B, and if k is even, the move is T + P to T — P, which is 2 + 1 to 2 — 1, which 
represents A to B. 


Next COUNT will be incremented to | 0...010...0], 1e., & trailing 0’s. When COUNT = 


2**? — 1, the algorithm will have repeated the same sequence of moves as before, since it only 


“sees” the rightmost information in COUNT, with the difference that T will have started with a 
different value. The different starting value of T will result in a cyclic permutation of the labels. If 
k # n(MOD 2), then the completed moves will be 


HANOI (A,C,B, x) 
AtoC 
HANOI (B,A,C, k) 


giving HANOI (A,B,C, & + 1) with k + 1 =n(MOD 2), and T will contain 2 (ie., 1 + 1). If 
= n (MOD 2), then the completed moves will be 


HANOI(A,B,C, k) 
AtoB 
HANOI (C,A,B, k) 


giving HANOI (A, C,B,& + 1) with k + 1 # n(MOD 2), and T will contain 1 (i.e., 2 + 2). @ 
PROPOSITION 9. The algorithm TOWERS uses 0(2”) time and n + constant bits of space. 


Proof. For space usage, there are n bits in COUNT, and a constant number of bits are used 
for T and P. 

For time, the initialization takes 9(n) and the WHILE loop is iterated 2” — 1 times. If each 
iteration took a constant amount of time, we would have 6(2”), but the test and increment 
instruction on count could take time @(7) giving 9(n2”). So we have to show that only 0(2”) time 
is used. 

If the value in COUNT is even, then incrementing and testing will only require looking at one 
bit. If the value in COUNT is odd, and (COUNT — 1)/2 is even, then the algorithm only looks at 
2 bits. In fact, the algorithm will look at k bits in COUNT in 2”~* cases. Thus the time used will 
be 6(07_,k - 2"-*) = 6(2”), since L%_,k - 27“ converges. Hl 

We summarize these results in the following theorem. 


THEOREM. Any algorithm which solves the Towers of Hanoi problem for n disks must use at least 
6(2"”) time and n+ constant bits of storage. The algorithm TOWERS solves the problem and 
simultaneously uses minimum time and minimum space. 


5. Summary and conclusion. The goal of best algorithm has been attained. To attain the goal 
we Started by analyzing the problem in a divide-and-conquer fashion and deriving from this 
analysis a recursive algorithm which we could prove solved the problem. Next we analyzed the 
time used by this recursive algorithm and argued that to order this time was best possible since 
any solution of the problem requires 2” — 1 moves. 

From the lower bound on time we derived a lower bound of n bits on the space used by any 
algorithm which solves the problem. A space analysis of the recursive algorithm showed that it 
used more space than our lower bound, and that the space usage was required for the recursive 
stack. 

To decrease the space usage we built an iterative algorithm which directly simulated the 
recursive algorithm. Since this was a direct simulation, it used the same amount of space, but we 
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now could investigate whether all the information being stored was necessary. We found that 
storing the number of disks on each simulated recursive call was unnecessary. This led to a new 
iterative algorithm which used only 9(n) space. . 

We then noticed that one of the arrays was functioning as a counter, but replacing it by a 
counter did not decrease the space usage. Next we investigated whether there was sufficient 
information in the counter to tell us which disk to move and where to move it. We found that we 
could tell which disk to move, but where to move the disk depended on whether the total number 
of disks was odd or even. 

When we added a variable to keep track of the parity of the number of disks and another 
variable to keep track of the tower which contained the smallest disk, we found that we had 
sufficient information to solve the problem, and that we could dispense with the arrays which kept 
track of the tower names for each simulated recursive call. 

At this point we had an algorithm which used n + constant number of bits, which was equal to 
our lower bound on space. We then showed that the algorithm still used minimal time order, and 
hence we had obtained a best algorithm. 

We may remark that there could be other quite different looking algorithms which solve the 
problem and use minimal time and space. What we have tried to exemplify is a design 
methodology which is frequently used in deriving good algorithms. We have chosen the Towers of 
Hanoi problem because for this problem we could arrive at the goal of best algorithm. For other 
problems the process may get stuck. We might find a provable algorithm and lower bounds for the 
problem, but find that there is a gap between the running time of the algorithm and the lower 
bound, or find a gap between the space usage of the algorithm and the lower bound. Often the 
next crucial insight, like a disk always moves clockwise or counterclockwise, might not be 
discovered for many years after an algorithm is created. Alternately a discovered algorithm may 
be a best algorithm, but an insight is needed to raise the lower bounds for the problem. 

In any case, we hope that we have given the reader some feel for analysis of algorithms. 


6. Exercises. To see that you have understood a technique, it is useful to try to use the 
technique on similar problems. We give here two more algorithms for the Towers of Hanoi 
problem. Your task, if you decide to accept it, is to show that these algorithms do in fact solve the 
problem (1.e., prove that they are correct) and to determine the time and space usage of these 
algorithms. 


Exercise l: 


PROCEDURE HANOI ITERATIVE (A,B,C,n) 


A TO B 


IF n mod 2 = 0 THEN MOVE[1]: 


ELSE MOVE[1]: 


A TOC 
K:= 1 


WHILE n > 1 DO 


n:= n=-1; K:= 2*K 
IF n mod 2 = 0 THEN MOVE[K J:= A TO B 
Ll:= C3; L2:= A; L3:= B 
ELSE MOVE[K]:= A TO C 


Ll:= B; L2:= C; L3: 


th 
> 
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CASE MOVE[I] OF 


A TO B : MOVE[K+I]:= L1 TO L2 
A TO C : MOVE[K+I]:= Ll TO L3 
B TO A : MOVE[K+I]:= L2 TO Ll 
B TO C : MOVE[K+I]:= L2 TO L3 
C TO A: MOVE[K+IJ:= L3 TO Ll 
C TO B : MOVE[K+I]:= L3 TO L2 


Hints 1. For correctness you may want to introduce a new variable and prove a statement 
which says that on each iteration of the WHILE loop the new variable increases (or if you want 
decreases) and that at the end of each iteration a Hanoi problem whose size depends on the new 
variable has been solved. You will need to give the tower names for the problem which has been 
solved. You will also need to specify the value of the new variable. 

For space, you should know that the algorithm is storing each move in the array MOVE. 

For time, you may want to consider both the uniform and the logarithmic cost measures. 


Exercise 2: (Buneman and Levy [3]) 
MOVE SMALLEST DISK ONE TOWER CLOCKWISE 
WHILE A DISK (OTHER THAN THE SMALLEST) CAN BE MOVED DO 
MOVE THAT DISK 
MOVE THE SMALLEST DISK ONE TOWER CLOCKWISE 


ENDWHI LE 


Hints 2. For correctness, you should be careful since this algorithm only solves the original 
Towers of Hanoi problem when the number of disks is even. You will probably want to introduce 
a new variable and prove a statement about the configuration of the disks when the number of 
moves completed is a specific function of your new variable. 

For time and space, the algorithm above is incomplete since it doesn’t specify the data 
structure used to determine if a disk can be moved. You might consider representing each tower 
by a stack of integers with the integers representing the disks on the tower. Alternately you might 
consider representing the information by an array DISK, so that DISK[I] contains the name of 
the tower which contains the Ith largest disk. You may also find it useful to show that the ith disk 
is moved 2”~‘ times. 
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A SHORT PROOF OF STEINHAUS’ THEOREM ON SUMMABILITY 


JEFF CONNOR 
Department of Mathematical Sciences, Kent State University, Kent, OH 44242 


In this note, we offer a soft (and short!) proof of a result of Steinhaus: a regular matrix 
summability method cannot sum all sequences of 0’s and 1’s. 

Let x and y be sequences. By a matrix summability method we mean a sequence to sequence 
transformation defined by an infinite matrix A = (a,, ,)7,—,, an array of complex numbers, and 
y = Ax when 


o@) 
y= a, .X, forallne N. 
k=1 


A is called regular if it maps convergent sequences into convergent sequences and preserves limits, 
i.e., lim, (Ax), = lim, x, if x is convergent. Regular matrices are characterized by the following: 


THEOREM (Silverman, Toeplitz) [PS]. Let A = (a, ,)? ,=1 be a matrix summability method. A 
is regular if and only if 


(1) lim » an k = 1, 

n ok=1 
(2) lima, ,=0 forallk © N, and 
(3) there is an M > 0 such that 


oe) 
> la, .1< M forallne N. 
k=1 


A sums x or x is A-summable if A maps x into a convergent sequence. 
We also use the Baire Classification Theorem, a direct ancestor of the Baire Category Theorem: 


THEOREM (Baire) [O, p. 32]. Let X be a complete metric space and (f,,)°_, a sequence of 
continuous complex-valued functions. If f(x) = lim, f,,(x) exists for all x © X, then f is continuous 
except at a set of points of first category. 


Let Q denote the sequences of 0’s and 1’s and define a metric d on Q by 


CO 
d(x,y) = L |x, —y,| 27". 
n=] 


CEHTER SECTION 
(Vol. 92, No. 6, June-July 1985) 


Telegraphic Reviews 


Edited by Lynn Arthur Steen, with the assistance of the Mathematics Departments 
of Carleton, Macalester, and St. Olaf Colleges. Books submitted for review 
should be sent to Book Review Editor, American Mathematical Monthly, St. Olaf 
College, Northfield, Minnesota 55057. 


Telegraphic reviews are designed to give prompt notice of all new books in the mathematical 
sciences. Certain of these books will be selected for more extensive review in the Reviews section 


of the Monthly. 


Special Codes: 


T: Textbook 13-18: Grade Level 

S: Supplementary Reading 1-4 : Time in Semesters 
P: Professional Reading ** : Special Emphasis 
L: Undergraduate Library 2? : Questionable 


General, S, L. The Sacred Beetle and Other Great Essays in Science. Ed: Martin Gardner. Prometheus 
Books, 1984, xv + 427 pp, $22.95. [ISBN: 0-87975-257-2] A revised and expanded hardcover reprinting 
of Gardner’s 1957 paperback collection Great Essays in Science, including four new essays by Stephen 
Jay Gould, Lewis Thomas, Carl Sagan, and Isaac Asimov--four distinguished science writers whose 


writing careers began after the original edition was published. LAS 


General, T(13: 1). The Ascent of Mathematics. Raymond Coughlin, David E. Zitarelli. McGraw-Hill, 
1984, xiii + 609 pp, $27.95. [ISBN: 0-07-013215-1] Based on the premise that "every mathematical 
technique was created by someone to solve a human problem." Text presents some traditional 
mathematics (elementary number theory, elementary probability) introduced by discussion of their 
cultural and/or historical settings. Flexible format. Many exercises. JRG 


Education, S(15-16). Student Merit Awards. Ed: Leroy Sachs. NCTM, 1984. Middle School, v'+ 100 
pp, $6 (P) [ISBN: 0-87353~-215-5]; High School, iv + 137 pp, $9 (P). [ISBN: 0-87353-214-7] Series of 
self-contained units, each requiring 10 to 30 hours of independent work by students. Each includes 
explanations, references, requirements and a teachers” guide, and is reproducible. Mathematical 
level and requirements vary to suit varying student abilities and interests. Topics range from 
purely practical to traditional enrichment. Quality somewhat uneven but every unit workable and 
interesting. MW 


Education, S(15-16), P, L. Computers in Mathematics Education, 1984 Yearbook. Viggo P. Hansen, 
Marityn J. Zweng. NCIM, 1984, x + 244 pp, $14.50. [ISBN: 0-87353-210-4] Issues, ideas and sample 
programs for using computers in teaching mathematics K-12. Twenty-six articles discuss philosophi- 
cal issues, the computer as a teaching aid, using programming to teach mathematics, and computer 


diaguosis of student errors. MW 


Education, $(15-16). Multicultural Mathematics Posters and Activities. NCTM, 1984, $7 (P). Set of 
18 posters for junior high classrooms emphasizing interdisciplinary applications of mathematics. 
Each poster accompanied by teaching aids including reproducible worksheets. Topics include ancient 
numeration and calculation systems, strategy games, mathematics in art, Pascal’s triangle and more. 


Broad cultural and historical range. MW 


Education, T(16), S, P, L. Computers in Teaching Mathematics. Peter Kelman, et al. Addison-Wesley, 
1983, xi + 308 pp, $14.95 (P). [ISBN: 0-201-10565-9] Good basic book on the subject; should be in 
every mathematics education library. Emphasizes problem solving approach using programming and com- 
puter graphics but neglects simulation in secondary mathematics. Concrete suggestions for incor- 
porating computers into the classroom at all levels. Basic explanation of difference between com- 
puter programming and computer science. Resource list of software distributors. MW 


nr a a i ml a eee 


nd 


Relics and of Books, Instruments, and Devices for Facilitating Calculation. Napier’s life, tables, 
calculating machines, ruled papers, and mathematical models. A guide to the exhibit which cele- 
brated Napier’s contributions as well as a survey of calculating instruments invented before WWI. 
MA 


History, L**. The Genesis of the Abstract Group Concept. Hans Wussing. Transl: Abe Shenitzer. MIT 
Pr, 1984, 331 pp, $30. [ISBN: 0~-262-23109-3] Long awaited translation of a pioneering work traces 
the evolution of the abstract group concept from implicit group theoretic modes of thought in 
geometry and number theory through the concept of a group as a permutation group to the axiomatiza- 
tion of the concept. JRG 


C54 Center Section June-July 


Foundations, T(14-15: 1), S. Introduction to Elementary Mathematical Logic. Abram Aronovich Sto- 
lyar. Transl: Elliot Mendelson. Dover, 1983, vii + 209 pp, $5 (P). [ISBN: 0-486-64561-4] Emphasis 
on the propositional and predicate calculi; ome section treats the application of logic to the 
analysis and synthesis of digital systems. Exercises throughout (no solutions provided). (1970 MIT 
Press edition, TR, December 1971.) JRG 


Graph Theory, T*(17), S*, P*, L*. Graphs and Algorithms. Michel Gondran, Michel Minoux. Transl: 
Cteven Vajda. Ser. in Disc. Math. Wiley, 1984, xix + 650 pp, $64.95. [ISBN: 0-471-10374-8] A 
translation of the 1979 French text. An excellent overview of graph theory and the combinatorial 
algorithms which link the abstract results and their practical implications. Has a superb collec- 
tion of exercises. An outstanding bibliographic resource. CEC 


Discrete Mathematics, T(13-14: 2). Discrete Mathematics. Kenneth A. Ross, Charles R.B. Wright. 
Prentice-Hall, 1985, xiv + 610 pp. [ISBN: 0-13-215286-xX] A substantial introduction to finite, alge- 
braic, algorithmic, and discrete topics, from recursion and semigroups through graphs and trees to 
homomorphisms and symmetry groups. Designed for freshmen as a course comparable to calculus, it is 
gently sloped in sophistication and full of thoughtful aids (chapter highlights, notation summaries, 
Greek alphabet table, even a dictionary of mathematical jargon like "for every," "suppose," "inspec- 
tion"). Emphasizes abstract algebra more than most similar books, thereby providing a secure base 
for developing mathematical maturity. LAS 


Number Theory, T*(17: 1), S, P, L. Number Theory. W. Narkiewicz. Transl: S. Kanemitsu. World 
Scientific (US Dist: Heyden & Son), 1983, xii + 371 pp, $21 (P). [ISBN: 9971-950-26-X] A sophisti- 
cated introduction to the subject. Methods from algebra, topology, analysis and probability are 
used liberally. Includes the prime number theorem, Dirichlet”s theorem, sieve methods, geometry of 
numbers, additive number theory, probabilistic number theory and algebraic numbers. Has an ample 
supply of exercises. CEC 


Number Theory, 8(18), P. Diophantine Equations Over Function Fields. R.C. Mason. London Math. Soc. 
Lect. Note Ser., No. 96. Cambridge U Pr, 1984, x + 125 pp, $15.95 (P). [ISBN: 0-521-26983-0)] This 
volume gives actual solutions for certain Diophantine equations over function fields. In particu- 
lar, algorithms are provided for the complete resolution of equations of Thue, hyperelliptic and 
genus one type. Includes a list of references. CEC 


Number Theory, 1T(18), S, P. Lectures on Sieve Methods and Prime Number Theory. Y. Motohashi. 
Springer-Verlag, 1983, 205 pp, $7.90 (P). [ISBN: 0-387-12281-8] Lectures delivered by the author at 
the Tata Institute in 1981. Part 1 introduces the Selberg sieve, the combinatorial sieve and the 
linear sieve. Part 2 applies them to fundamental problems in number theory, namely, zero free 
regions for %$ (s) and L(s,X), zero-density theorems, and primes in intervals and arithmetic pro- 


gressions. Includes a list of references. CEC 


Linear Algebra, T**(14), S, L. Elementary Linear Algebra, Fourth Edition. Howard Anton. Wiley, 
1984, xiii + 449 pp, $25.95. [ISBN: 0-471-09890-6] The Fourth Edition of this widely-used text has 
added a new chapter on complex vector spaces, new material on projections, distances in two~ and 
three~dimensional space and a more conventional notation for transition matrices. (First Edition, 
TR, March 1973; Extended Review, March 1974; Second Edition, TR, May 1977; Third Edition, TR, 
October 1981.) CEC 


Functional Analysis, T(18: 1), S, P. Non~Archimedean Analysis: A Systematic Approach to Rigid Ana- 
lytic Geometry. S. Bosch, U. Guntzer, R. Remmert. Grund. der math. Wissenschaften, B. 261. 
Springer-Verlag, 1984, xii + 436 pp, $59. [ISBN: 0-387-12546-9] A systematic development of affinoid 
and rigid analytic geometry from first principles. Includes an introduction to ultrametric analysis 
and non~Archimedean valuations. No exercises. Extensive bibliography. CEC 


Optimization, T(16-17: 1, 2), L. Introductory Optimization Dynamics: Optimal Control with Economics 
and Management Science Applications. Pierre N.V. Tu. Springer-Verlag, 1984, xiii + 387 pp, $14.50 
(P). (ISBN: 0-387-13305-4] Deterministic control theory approached primarily from the point of view 
of the calculus of variations; assumes linear algebra and calculus. Many examples and applications. 
Extensive bibliography. No exercises. JRG 


Probability, T(18: 1), P. Asymptotic Methods in Queuing Theory. A.A. Borovkov. Transi: D. Newton. 
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Minnesota 

Southern California, John W. McGhee, Jr., Cali- 
fornia State University, Northridge 

Texas, James C. Bradford, Abilene Christian 
University 


Sectional Governors (July 1, 1984-June 30, 1987): 


Allegheny Mountain, Franz X. Hiergeist, West Vir- 
ginia University 

Indiana, Rodney T. Hood, Franklin College 

Kentucky, Joe K. Smith, Northern Kentucky Univer- 
sity 

Metropolitan New York, John A. Thorpe, 950 25th 
St. NW, Apt. 112N, Washington, DC 

Nebraska, David L. Skoug, University of Nebraska, 
Lincoln 

Northern California, Roy W. Ryden, Humboldt State 
University 

Oklahoma~Arkansas, Robert C. Eslinger, Hendrix 
College 

Rocky Mountain, Gary W. Grefsrud, Fort Lewis Col- 
lege 

Wisconsin, Paul J. Campbell, Beloit College 


Sectional Governors (July 1, 1985-June 30, 1988): 


Kansas, Frank S. Brenneman, Tabor College 

Missouri, Harold W. Hager, Southeast Missouri 
State University 

New Jersey, Susan G. Marchand, Kean College of 
New Jersey 

Northeastern, James E. Ward III, Bowdoin College 

Ohio, Andrew Sterrett, Jr., Denison University 

Pacific Northwest, D. George McRae, University of 
Montana, Missoula 

Seaway, Jack E. Graver, Syracuse University 

Southeastern, John W. Kenelly, Clemson University 

Southwestern, David R. Arterburn, New Mexico 
Institute of Mining & Tech 


MAA Section Officers 
Allegheny Mountain Section: 


Chairman: Barbara T. Faires, Westminster College 


1985 


First Vice-Chairman: W. E. Deskins, University of 
Pittsburgh 

Second Vice-Chairman: James J. Reynolds, Penn St 
U-Beaver Campus 

Coord. of Student Programs: Ronald E. Harrell, 
Allegheny College 

AHSME Reg. Coord. W PA: Nicholas Ford, Penn St 
U-Fayette Campus 

AHSME Reg. Coord. WV: I. D. Peters, Univ. of West 
Virginia 

Newsletter Editor: Richard Reese, Edinboro Univ. 
of PA 

Public Info. Officer: Kathleen Taylor, Duquesne 
University 

Secretary/Treasurer: David T. Brown, Bethany Col- 
lege 


Eastern Pennsylvania & Delaware Section: 


Chairman: David E. Flesner, Gettysburg College 

Vice-Chairman: Bernard Kolman, Drexel University 

AHSME Reg. Coord. E PA: Albert E. Filano, West 
Chester U. of PA 

AHSME Reg. Coord. DE: Willard E. Baxter, Univer- 
sity of Delaware 

Newsletter Editor: Paul Cochrane, Bloomsburg 
University 

Public Info. Officer: Paul Cochrane, Bloomsburg 
University 

Secretary/Treasurer: James 0. Brooks, Villanova 
University 


Florida Section: 


President: David L. Sherry, University of West 
Florida 

President-Elect: Elizabeth A. Magarian, Stetson 
University 

Past-President: Don R. Lichtenberg, University of 
South Florida 

Vice-President for Programs: Fredric J. Zerla, 
University of South Florida 

Vice-Pres.-Elect. Programs: Kathleen M. Timmer, 
Jacksonville University 

Vice-Pres. for Site Select.: 
inole Community College 

Vice-Pres.-Elect Site Sel.: 
Manatee Junior College 

AHSME Reg. Coord. FL: Roy Bolduc, University of 
Florida 

AHSME Reg. Coord. CZ: Walter E. Mientka, Univer- 
sity of Nebraska 

AHSME Reg. Coord. PR: Eugene A. Francis, Univer- 
sity of Puerto Rico 

Newsletter Editor: Robert Kalin, Florida State 
University 

Public Info. Officer: Robert Kalin, Florida State 
University 

Secretary/Treasurer: Ralph D. McWilliams, Florida 
State University 


Bill E. Jordan, Sem- 


Michael J. Mears, 


Illinois Section: 


Chairman: John W. Hooker, Southern Illinois 


University 

Chairman-Elect: Linda R. Sons, Northern Illinois 
University 

Past-Chairman: Ronald M. Shelton, Milliken 
University 


First Vice-Chairman: Richard J. Wilders, North 
Central College 

Second Vice-Chairman: Neale R. Fadden, Belleville 
Area College 

AHSME Reg. Coord, IL: Gary Tippett, Bradley 
University 
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Secretary/Treasurer: Howard C. Saar, Plainfield 
Public Schools 


Indiana Section: 


Chairman: Daniel P. Maki, Indiana University 

Vice-Chairman: Catherine M. Murphy, Purdue 
Univ .-Calumet 

AHSME Reg. Coord. IN: Louis J. Cote, Purdue 
University 

Newsletter Editor: G. E. Graves, Rose-Hulman 
Inst. of Tech 

Public Info. Officer: Roger Lautzenheiser, Rose- 
Hulman Inst. of Tech 

Secretary/Treasurer: Duane E. Deal, Ball State 
University 


Intermountain Section: 


Chairman: Linda C. Hill, Idaho State University 

Past Chairman: Donald W. Robinson, Brigham Young 
University 

First Vice-Chairman: James W. Cotts, So. Utah 
State College 

Second Vice-Chairman: 
State College 

AHSME Reg. Coord. UT: 
Young University 

AHSME Reg. Coord. ID: 
University 

Public Info. Officer: 
University 

Secretary/Treasurer: Lawrence 0. Cannon, Utah 
State University 


Patricia P. Henry, Weber 
Ralph V. Skarda, Brigham 
Tom Williams, Idaho State 


Linda C. Hill, Idaho State 


Towa Section: 


Chairman: Wayne L. Woodworth, Drake University 

Chairman-Elect: George N. Trytten, Luther College 

AHSME Reg. Coord. IA: Robert M. Buckingham, Luth- 
eran Mutual Life Ins. Co. 

Public Info. Officer: George N. Trytten, Luther 
College 

Secretary/Treasurer: Alan J. Heckenbach, Iowa 
State University 


Kansas Section: 


Chairperson: Richard S. Rempel, Bethel College 

Chairperson~Elect: Kenneth S. Carman, Kansas 
Wesleyan University 

Assoc. Chpsn. Comm. Coll.: Jean Moran, Donnelly 
College 

AHSME Reg. Coord. KS: Robert H. Thompson, Wash- 
burn Univ. of Topeka 

Public Info. Officer: Elaine Tatham, ETC Insti- 
tute 

Secretary/Treasurer: Robert H. Thompson, Washburn 
Univ. of Topeka 


Kentucky Section: 


Chairman: Peter E. Moore, Northern Kentucky Univ. 

Chairman-Elect: Rodger Hammons, Morehead State 
University 

Vice-Chairman: Paul E. Bland, Eastern Kentucky 
Univ. 

AHSME Reg. Coord. KY: Carrie C. Parker, Jefferson 
Comm. Coll. S.W. 

Newsletter Editor: Glenn E. Johnston, Morehead 
State University 

Public Info. Officer: Glenn E. Johnston, Morehead 
State University 

Secretary/Treasurer: Sandra A. Spears, Jefferson 
Community College 
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Louisiana & Mississippi Section: 


Chairman: Stephen A. Doblin, Univ. of Southern 
Mississippi 

Past Chairman: Duane D. Blumberg, U. of 
Southwestern Louisiana 

LA Vice-Chairman: W. T. Thompson, Northeast 
Louisiana Univ. 

MS Vice-Chairman: Rose E. Strahan, Delta State 
University 

AHSME Reg. Coord. LA: James E. Keisler, Louisiana 
State University 

AHSME Reg. Coord. MS: David E. Cook, University 
of Mississippi 

Newsletter Editor: John L. Tilley, Mississippi 
State University 

Public Info. Officer LA: W. T. Thompson, 
Northeast Louisiana Univ. 

Public Info. Officer MS: Rose E. Strahan, Delta 
State University 

Secretary/Treasurer: John L. Tilley, Mississippi 
State University 


Maryland - D.C. - Virginia Section: 


Chairman: Howard L. Penn, U.S. Naval Academy 

Chairman-Elect: Robert E. Lewand, Goucher College 

Vice-Chairman for Programs: Elizabeth J. Teles, 
Montgomery College 

Vice-Chairman for Membership: William M. Sanders, 
James Madison University 

AHSME Reg. Coord. MD-DC~VA: Edward D. Bender, J. 
Sargeant Reynolds Com Col 

Newsletter Editor: William M. Sanders, James 
Madison University 

Public Info. Officer: Caren L. Diefenderfer, Hol- 
lins College 

Secretary: Caren L. Diefenderfer, Hollins College 

Treasurer: V. R. Hancock, Emory and Henry College 


Metro. New York Section: 


Chairman: Warren Page, N Y City Technical College 

Chairman-Elect: Alan Chutsky, Queensborough Comm 
Coll 

Vice-Chairman 4-Year College: F. V. Pohle, Adel- 
phi University 

Vice-Chairman 2-Year College: Geoffrey R. Akst, 
Bor. of Manhattan Comm Coll 

Vice-Chairman High School: Sheila Krilov, Hunter 
College High School 

AHSME Reg. Coord. Met NY: Sandra M. Pulver, Pace 
University 

Newsletter Editor: Alan Hoenig, John Jay Col of 
Crim. Justice 

Public Info. Officer: Bernard Sohmer, The City 
College 

Secretary: Lily E. Christ, John Jay Col of Crim. 
Justice 

Treasurer: Robert G. Payton, Adelphi University 


Michigan Section: 


Chairman: Donald R. Lick, Western Michigan 
University 

Vice-Chairman: Michael J. Gilpin, Michigan Tech 
University 

Vice-Chairman 2-Year College: Barbara K. Near, 
Henry Ford Comm. College 

AHSME Reg. Coord. MI: James Bidwell, Central 
Michigan University 

Newsletter Editor: John 0. Kiltinen, Northern 
Michigan University 

Public Info. Officer: Thomas Elsner, General 
Motors Institute 


Center Section 


June-July 


Secretary/Treasurer: Douglas W. Nance, Central 
Michigan University 


Missouri Section: 


President: Larry A. Cammack, Central Missouri 
University 

Past President: Victor Gummersheimer, Southeast 
Missouri St Univ. 

Vice-President: Shirley Huffman, Southwest Mis- 
souri St Univ. 

Chairman of H.S. Lecturers: Robert E. Kennedy, 
Central Missouri St Univ. 

AHSME Reg. Coord. MO: Alvin R. Tinsley, Central 
Missouri St Univ. 

Newsletter Editor: William A. Ettling, Southeast 
Missouri St Univ. 

Secretary/Treasurer: David W. Bahnemann, 
Northwest Missouri St Univ. 


Nebraska Section: 


Chairman: Dale W. Behrens, Hastings College 

Chairman-Elect: Alexander Mehaffey Jr., Univer- 
sity of South Dakota 

AHSME Reg. Coord. NE: Richard L. Barlow, Kearney 
State College 

AHSME Reg. Coord. SD: Jan Vandever, South Dakota 
St. University 

Newsletter Editor: Henry M, Cox, University of 
Nebraska 

Newsletter Editor: Thomas S. Shores, University 
of Nebraska 

Secretary/Treasurer: Henry M. Cox, University of 
Nebraska 


New Jersey Section: 


Chairman: S. A. Foote, Rutgers University 

Chairman-Elect: Stephanie Sloyan, Georgian Court 
College 

Vice-Chairman Innovation: Thomas J. Marlowe, 
Seton Hall University 

Vice-Chairman Speakers: Patricia K. Kenschaft, 
Montclair State College 

Vice-Chairman 2-Year College: Ruth D. 0’ Dell, 
County College of Morris 

Vice-Chairman High School: John Sooy, Glassboro 
State College 

AHSME Reg. Coord. NJ: Robert Mitchell, Glassboro 
State College 

Newsletter Editor: F. E. Clark, Rutgers Univer- 
sity 

Public Info. Officer: F. E. Clark, Rutgers 
University 

Secretary: F. E. Clark, Rutgers University 

Treasurer: Catherine Folio, Brookdale Comm. Col- 
lege 


North Central Section: 


President: Loren C. Larson, St. Olaf College 

President-Elect: Lisl N. Gaal, University of Min- 
nesota 

Past President: Murray Braden, Macalester College 

Member-At-Large: Mary B. Johnson, Inver Hills 
Comm. College 

Member-At-Large: Ronald M. Mathsen, North Dakota 
St. Univ. 

AHSME Reg. Coord. MN: Wayne Roberts, Macalester 
College 

AHSME Reg. Coord. ND: John Whitcomb, University 
of North Dakota 

AHSME Reg. Coord. MB: John Barsby, St. John’s 
Ravenscourt 


1985 


AHSME Reg. Coord. SK: Don Kapoor, University of 
Regina 

AHSME Reg. Coord. SD: Jan Vandever, South Dakota 
St. University 

Newsletter Editor: Gerald E. Lenz, St. John’s 
University 

Public Info. Officer: Joe Gallian, University of 
Minnesota 

Secretary/Treasurer: Stephen R. Hilding, Gustavus 
Adolphus College 


Northeastern Section: 


Chairman: Thurmon Whitley, University of New 
Haven 

Vice-Chairman: Steven K. Ingram, Norwich Univer- 
sity 

2-Year Coll. Representative: Jean B. Smith, 
Middlesex Community College 

AHSME Reg. Coord. CT: James E. Trimble, Hartford 
Insurance Group 

AHSME Reg. Coord. ME: 
Life Ins. Co. 

AHSME Reg. Coord. MA: 
ter State College 
AHSME Reg. Coord. NH: 

High School 
AHSME Reg. Coord. RI: 
Kingston H.S. 
AHSME Reg. Coord. VT: 
sity of Vermont 
AHSME Reg. Coord. NB NF NS PE: William Crawford, 
Mt. Allison University 

Newsletter Editor: Philip H. Mahler, Middlesex 
Community College 

Public Info. Officer: Kenneth D. Lane, Colby Col- 
lege 

Secretary/Treasurer: Gordon D. Prichett, Babson 
College 


Larry Miller, Union Mutual 
Murray Abramson, Bridgewa- 
Owen R. Koppang, Concord 

Thomasine Manickas, South 


J.£.H. Johansson, Univer- 


Northern California Section: 


Chairman: Howard J. Weiner, University of Cali- 
fornia 

Vice-Chairman: Roseanna F. Torretto, State of 
California 

Program Chairman: Michael Thibodeaux, Menlo Col- 
lege 

AHSME Reg. Coord. No CA & NV: Robert McFarland, 
San Pablo, CA 

AHSME Reg. Coord. HI PI: Jack Johnson, Brigham 
Young U.-Hawaii 

AHSME Reg. Coord. GU: Gail S. Mullen, University 
of Guam 

Newsletter Editor: William G. Chinn, City Coll of 
San Francisco 

Public Info. Officer: William G. Chinn, City Coll 
of San Francisco 

Secretary/Treasurer: Leonard Klosinski, Univer- 
sity of Santa Clara 


Ohio Section: 


Chairman: James R. C. Leitzel, Ohio State Univer- 
sity 

Chairman-Elect: Alan G. Poorman, Ashland College 

Past Chairman: Richard A. Little, Baldwin-Wallace 

Program Chairman: Robert M. Dieffenbach, Miami 
University 

2-Year College Representative: Cynthia P. Yang, 
Miami University 

AHSME Reg. Coord. OH: Leo Schneider, John Carroll 
University 

AHSME Reg. Coord. WV: I. D. Peters, University of 
West Virginia 

Newsletter Editor: David E. Kullman, Miami 
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University 

Public Info. Officer: Susan G. Schneider, John 
Carroll University 

Secretary/Treasurer: Andrew Sterrett Jr., Denison 
University 


Oklahoma-Arkansas Section: 


Chairman: John W. Watson, Arkansas Tech Univer- 
sity 

Past Chairman: Morris L. Marx, University of 
Oklahoma 

First Vice~Chairman: William A. Coberly, Univer- 
sity of Tulsa 

Second Vice-Chairman: Edward Masley, Arkansas 
College 

AHSME Reg. Coord. OK: Thomas Cairns, University 
of Tulsa 

AHSME Reg. Coord. AR: John Hodges, University of 
Arkansas 

Newsletter Editor: Allan C. Cochran, University 
of Arkansas 

Public Info. Officer: James E. Scroggs, Univer- 
sity of Arkansas 

Secretary/Treasurer: John M. Jobe, Oklahoma State 
University 


Pacific Northwest Section: 


Chairman: Marjorie M. Enneking, Portland State 
University 

Vice-Chairman 2-Year College: 
Portland Community College 

Vice-Chairman 4-Year College: 
Jr., Lewis & Clark College 

Program Chairman: Alan Hoffer, University of Ore- 
gon 

AHSME Reg. Coord. AK: Donval R. Simpson, Fair- 
banks, AK 

AHSME Reg. Coord. ID: 
University 

AHSME Reg. Coord. OR: 
of Oregon 

AHSME Reg. Coord. WA: 
State University 

AHSME Reg. Coord. AB: 
Calgary 

AHSME Reg. Coord. BC YK NW: Leon Bowden, Univer- 
sity of Victoria 

AHSME Reg. Coord. MI: William Myers, University 
of Montana 

Newsletter Editor: Sue Kaplan, Western Washington 
University 

Public Info. Officer: Sue Kaplan, Western Wash- 
ington University 

Secretary/Treasurer: Sam L. Dunn, Seattle Pacific 
University 


Vernon R. Hood, 


Harvey J. Schmidt 


Tom Williams, Idaho State 
Marion Walter, University 
Calvin Long, Washington 


G. W. Sands, University of 


Rocky Mountain Section: 


Chairman: Ruth R. Struik, University of Colorado 

Chairman-Elect: Stephen D. Bronn, Univ. of South- 
ern Colorado 

Vice-Chairman: Marie A. Ritten, National College 

Program Chairman: Allan G. Skillman, Casper Col- 
lege 

AHSME Reg. Coord. CO WY: Bob Vunovich, Univ. of 
Southern Colorado 

AHSME Reg. Coord. MI: William Myers, University 
of Montana 

Newsletter Editor: David W. Ballew, S.D. School 
of Mines & Tech 

Public Info. Officer: David W. Ballew, S.D. 
School of Mines & Tech 

Secretary/Treasurer: David W. Ballew, S.D. School 
of Mines & Tech 
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Seaway Section: 


Chairperson: Charles L. Smith, SUNY College at 
Potsdam 

Chairperson-Elect: Kenneth Wooster, SUNY College 
at Cortland 

Past Chairperson: Kenneth D. Magill, CUNY Center 
at Buffalo 

First Vice-Chairperson: Rebecca E. Hill, Roches- 
ter Institute of Tech 

Second Vice-Chairperson: Morton Goldberg, Broome 
Community College 

AHSME Reg. Coord. No NY: Stephen West, SUNY Col- 
lege at Geneseo 

AHSME Reg. Coord. W NY: Louis F. Scholl, Trocaire 
College 

AHSME Reg. Coord. ON: F. A. Sherk, University of 
Toronto 

AHSME Reg. Coord. PQ English: Saverio Mirarchi, 
Laval Catholic High School 

AHSME Reg. Coord. PQ French: Paul Charlebois, 
CEGEP 

Newsletter Editor: Donald Trasher, SUNY College 
at Geneseo 

Public Info. Officer: Donald Trasher, SUNY Col- 
lege at Geneseo 

Secretary/Treasurer: Donald Trasher, SUNY College 
at Geneseo 


Southeastern Section: 


Chairman: F, L. Cook, University of Alabama 

Chairman-Elect: Edmond D. Dixon, Tennessee Tech 
University 

Vice-Chairman: Sharon C. Ross, DeKalb Comm. Col- 
lege 

Lecturer: John W. Kennelly, Clemson University 

AHSME Reg. Coord. AL: James 0°Neal, Troy State 
University 

AHSME Reg. Coord. GA: Jan L. Boal, Georgia State 
University 

AHSME Reg. Coord. NC: William Herbst, Jefferson 
Std. Life Insurance 

AHSME Reg. Coord. SC: Stanley Lukawecki, Clemson 
University 

AHSME Reg. Coord. TN: David Collings, Nat”1l Life 
& Acci’t Ins. Co. 

Newsletter Editor: David R. Stone, Georgia South- 
ern College 

Public Info. Officer: David R. Stone, Georgia 
Southern College 

Secretary/Treasurer: James G. Ware, U of Tennes- 
see at Chattanooga 


Southern California Section: 


Chairman: Donald Y. Goldberg, Occidental College 

Past Chairman: Robert F. Brown, UCLA 

First Vice-Chairman: David D. Shochat, Logicon 

Second Vice-Chairman: Adil M. Yaqub, University 
of California 

Program Chairman: Melvin Henrikson, Harvey Mudd 
College 

Program Vice-Chairman: Herbert A. Gindler, San 
Diego State University 

AHSME Reg. Coord. So CA: Alice M. King, Cal State 
Polytechnic Univ. 

Newsletter Editor: William Margulies, Cal State 
Long Beach 

Public Info. Officer: William Margulies, Cal 
State Long Beach 

Secretary/Treasurer: William Margulies, Cal State 
Long Beach 


Southwestern Section: 


Center Section 


June-July 


Chairman: William Y. Velez, University of Arizona 

Vice-Chairman: Richard M. Grassi, University of 
New Mexico 

AHSME Reg. Coord. AZ-NM: Elias Toubassi, Univer- 
sity of Arizona 

AHSME Reg. Coord. TX: James R. Boone, Texas A&M 
University 

Public Info. Officer: Alvin Swimmer, Arizona 
State University 

Secretary/Treasurer: Edward D. Gaughan, New Mex- 
ico State University 


Texas Section: 


Chair: Margaret R. Hutchinson, University of St. 
Thomas 

Chair~Elect: R. V. Morgan, Sul Ross State Univer- 
sity 

Immediate Past Chair: Robert H. Cranford, Univer- 
sity of Texas 

Level I Director: George Brewer, Odessa College 

Level II Director: Darwin Peake, Trinity Univer- 
sity 

Level III Director: James N. Younglove, U. of 
Houston-Central Campus 

Director~At-Large: Rose M. Smith, Texas Woman’ s 
University 

Arrangements Chair: Joseph Szucs, Texas A&M 
University-Box 1576 

Arrangements Chair: Anthony Blozinski, Texas A&M 
University-Box 1576 

Arrangements Chair-Elect: Vivian A. Dennis, East- 
field College 

AHSME Reg. Coord. TX: James R. Boone, Texas A&M 
University 

Newsletter Editor: Glen E. Mattingly, Sam Houston 
State University 

Public Info. Officer: James C. Bradford, Abilene 
Christian University 

Secretary/Treasurer: Glen E. Mattingly, Sam Hous- 
ton State University 


Wisconsin Section: 


Chairman: Gary L. Britton, U. of Wisconsin Center 

Chairman-Elect: Richard J. Mihalek, University of 
Wisconsin 

Past Chairman: Lois B. Brualdi, Edgewood College 

AHSME Reg. Coord. WI: James Muhich, University of 
Wisconsin 

Newsletter Editor: Nicholas C. Bystrom, Northland 
College 

Public Info. Officer: Karl E. Byleen, Marquette 
University 

Secretary/Treasurer: James M. Sobota, University 
of Wisconsin 


Committees of the Association 


Terms of members expire, except where other- 
wise noted, at the end of the Annual Meeting in 
January following the last year of service listed 
below. For temporary committees, no terms are 
listed since they are automatically discharged at 
the expiration of the President’s term of office, 
which is the Annual Meeting in January, 1986. 


Executive Committee: 


Lynn A. Steen, Chairman (1985-87); Gerald L. 
Alexanderson (1984-85); Ronald M. Davis (1985- 
86); Leonard Gillman (1983-87); Paul R. Halmos 
(1982-86); Ivan Niven (1985); Kenneth A. Ross 
(1984-1989). . 


1985 


Finance Committee: 


Lynn A. Steen, Chairman (1985-87), ex-officio; 
Lida K. Barrett (1984-87); Leonard Gillman 
(1983-87), ex-officio; Donald L. Kreider 
(1978-85); Ivan Niven (1983-86), ex-officio; 
Kenneth A. Ross (1984-1989) ex-officio. 


Audit and Budget Committee: Donald L. Kreider, 
Chairman (1978-85); Lida K. Barrett (1984-87). 


Investment Committee: Leonard Gillman, Chairman 
(1983-87), ex-officio; Henry L. Alder (1984- 
86); Harley Flanders (1984-86); Gerald J. 
Porter (1983-85); Kenneth A. Ross (1984-1989) 
ex-officio; James M. Vaughn, Jr. (1984-86). 


Staff and Services Committee: Leonard Gillman, 
Chairman (1983-87); Lynn A. Steen (1986), both 
ex-officio. 
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Mathematics: 


Judith V. Grabiner, Chairman (1979-85); Churchill 
Eisenhart (1984-86); Leonard Gillman (1979-85). 


Committee on Archival Holdings in Mathematics: 


Sanford L. Segal, Chairman (1984-85); Jeanne 
LaDuke (1984-85); Albert C. Lewis (1984-85); 
Uta C. Merzbach (1984-85). 


Nominating Committee: 


William G. Chinn, Chairman (1985); Richard D. 
Anderson (1985); Jerome A. Goldstein (1985-86); 
Warren Page (1985-86); Alan C. Tucker (1985- 
86). 
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Monthly: 


Alan C. Tucker, Chairman; Donald J. Albers; 
Stephen B. Maurer; Barbara L. Osofsky. 


Committee on Committees: 


Lynn A. Steen, Chairman (1985-87), ex-officio; 
Ivan Niven (1982-85), ex-officio; Kenneth A. 
Ross (1985-89), ex-officio; Alfred B. Willcox, 
ex-officio. 


Committee on Corporate Members: 


Richard A. Moynihan, Chairman (1981-85); Jeremiah 
J. Lyons (1981-86); Henry 0. Pollak (1983~-85); 
Peter L. Renz (1984-86); Alfred B. Willcox 
(1980-85), ex-officio. 


Committee on Earle Raymond Hedrick Lectures: 


James W. Cannon, Chairman (1985-86); Neil J.A. 
Sloane (1985~88); Elias M. Stein (1984-87). 


Committee on American Mathematics Competition 
(CAMC): 


Stephen B. Maurer, Chairman (1981-86); Richard D. 
Anderson (1985-86); Joyce Becker (1984-85); 
George Berzsenyi (1983-85); James D. Bristol 
(1983-85); Bruce Brombacher (1987); Thomas R. 
Butts (1982-86); Michael Ecker (1985-87); Elgin 
Johnston (1985-87); Irwin W. Kaufman (1982~85); 
Thomas R. Knapp (1987); Joseph Konhauser 
(1982-85); Gerald E. Martau (1984-86); Steven 
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Meiring (1984-86); Walter E. Mientka (1979-85); 
Robert Musen (1987); Harold Reiter (1984-86); 
Ian Richards (1987); Stephen Rodi (1987); Leo 
J. Schneider (1980-85); Margaret W. Tiller 
(1982-85); Dorothy Wendt (1984-86); Melvin R. 
Woodard (1981-86). 


Subcommittee on the American High School 
Mathematics Examination: Stephen B. Maurer, 
Chairman (1981-86); George Berzsenyi (1982-85), 
ex-officio; Thomas R. Butts (1980-86), ex- 
officio; Thomas R. Knapp (1987); Gerald E. Mar- 
tau (1984-86); Walter E. Mientka (1980-85), 
ex-officio; Harold Reiter (1984-86); Ian 
Richards (1987); Leo J. Schneider (1980-85); 
Dorothy Wendt (1984-86). 
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Mathematics Examination: Thomas R. Butts, 
Chairman (1982-86); Richard D. Anderson (1985- 
86); Michael Ecker (1985-87); Elgin Johnston 
(1985-87); Irwin Kaufman (1984-85); Stephen B. 
Maurer (1981-86), ex-officio; Steven Meiring 
(1984~86); Walter E. Mientka (1981-85), ex- 
officio; Stephen Rodi (1987); Melvin R. Woodard 
(1986). 


Subcommittee on the American Invitational 
Mathematics Examination: George Berzsenyi, 
Chairman (1982-85); James D. Bristol (1982-85); 
Michael W. Ecker (1987); Elgin Johnston (1987); 
Stephen B. Maurer (1981-86), ex-officio; Walter 
E. Mientka (1980-85), ex-officio; Ian Richards 
(1987), 
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Contests: David Bernstein (1985); W.J. Blundon 
(1985); Kenneth A. Brown, Jr. (1985); Pamela 
Drummond (1985); Richard A. Gibbs (1985); 
Samuel L. Greitzer (1985); Douglas A. Hensley 
(1985); Sheila Krilov (1985); Eric Lander 
(1985); Eugene McGovern (1985); Richard Parris 
(1985); Stanley Rabinowitz (1985); Richard 
Rhoad (1985); Mark Saul (1985); Akehiko 
Takahashi (1985); Claire Tuckman (1985). 


Subcommittee on the USA Mathematical Olympiad: 
J. Ian Richards, Chairman (1985-87); George 
Berzsenyi (1980-85), ex-officio; Irwin W. Kauf- 
man (1982-85); Murray S. Klamkin, Problems 
Chairman (1982-85); Joseph Konhauser (1982-85); 
Andrew Liu (1981-86); Stephen B. Maurer (1986), 
ex-officio; Walter E. Mientka (1985), ex- 
officio; Cecil Rousseau (1985-87). 


IMO Training Session: Cecil Rousseau, Director 
(1985); Gregg Patruno, Assistant Director 
(1985). 


Committee on Placement Examinations: 


John W. Kenelly, Chairman (1985); Linda H. Boyd 
(1984-86); Judith N. Cederberg (1985-87); John 
G. Harvey (1984-86); Mary McCammon (1985-87); 
Billy E. Rhoades (1982-85). 


Committee on Publications: 


Alan C. Tucker, Chairman (1982-85); Donald J. 
Albers (1985); Duane W. Bailey (1984-86); 
Charles A. Cable (1982-85); James P. Crawford 
(1984-86); Ross L. Finney (1980-85); Leonard 
Gillman (1983-87), ex-officio; Paul R. Halmos 
(1982-86), ex-officio; Pete Hayslett (1985-87); 
John Karlof (1982-85); William F. Lucas (1982- 
87); Donald J. McCarthy (1984-86); Warren Page 
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(1985-88), ex-officio; Doris Schattschneider 
(1981-85), ex-officio. 


Subcommittee on Carl B. Allendoerfer Awards: 
JoAnne Growney, Chairman (1985); Underwood Dud- 
ley (1985-87); Doris Schattschneider (1982-85), 
ex-officio; Marjorie Senechal (1986). 


Subcommittee on Basic Library Lists: Deborah T. 
Haimo (1983-86); William F. Lucas (1982-85). 


Subcommittee on Carus Monographs: Duane Bailey, 
Chairman (1985-86); Pete Hayslett (1985-87); 
John Karlof (1982-85). 


Subcommittee on Dolciani Mathematical Exposi- 
tions: Joseph Malkevitch, Chairman (1981-87); 
Donald J. McCarthy (1983-86); Donald B. Small 
(1982-85). 


Subcommittee on Lester R. Ford Awards: Richard 
S. Millman, Chairman (1985); Paul R. Halmos 
(1983-86), ex-officio; Robert Osserman (1984- 
86); Stephen E. Puckette (1985-87). 


Subcommittee on MAA Notes: Alan C. Tucker, 
Chairman (1982-85); Charles A. Cable (1982-86); 
William F. Lucas (1982-85). 


Subcommittee on MAA Studies in Mathematics: 


Alan C. Tucker, Chairman (1980-85); James P. 
Crawford (1984-86); William F. Lucas (1985-87). 


Subcommittee on New Mathematical Library: Wil- 
liam G. Chinn, Chairman (1982-85); Donald J. 
Albers (1985); Basil Gordon (1980-85); Anneli 
Lax, ex-officio; Max M. Schiffer (1979-85). 


Subcommittee on George Polya Awards: Larry A. 
Curnutt, Chairman (1982-85); Donald J. Albers 
(1982-87); Patrick Boyle (1984-86); Warren Page 
(1985-88), ex-officio. 


Committee on Articulation: 


Billy E. Rhoades, Chairman (1982-85); Philip C. 
Curtis, Jr. (1982-85); Ann D, Holley (1982-85); 
Chancey O. Jones (1982-85); Joan P. Leitzel 
(1982-85); Vincent O”’Connor (1982-85); Barbara 
Wickless (1982-85). 


Committee on Minicourses: 

John 0. Riedl, Chairman (1982-87); Sol Garfunkel 
(1984-86); Frederick Hoffman (1982-87); Warren 
Page (1982-85); Doris Schattschneider (1984- 
85); David A. Smith (1984-86). 


Committee on Secondary School Lectures: 


Henry Gore, Chairman (1982-85); Jane M. Day 
(1984-86); Allen H. Holmes (1985-87); John M. 
Jobe (1981-86); Amy C. King (1982-85); Gary J. 
Sherman (1982-85); David L. Skoug (1982-85); 
Donald B. Small, Advisor (1979-85). 


Committee on Sections: 


David W. Ballew, Chairman (1981-87); Yousef Alavi 
(1984-86), ex-officio; Sylvan Burgstahler 
(1984-86); Barbara T. Faires (1982-85); Glen E. 
Johnston (1985); Norman F. Lindquist (1985-86); 
John D. Neff (1985-87); John M. Smith (1985- 
86); Alfred B. Willcox, ex-officio. 


Public Information Panel: Yousef Alavi, Chair- 


man (1984-86); James C. Bradford (1984-86); 
Peter Hilton (1984-85); James R. Leitzel 
(1984-87); Jean J. Pedersen (1984-85); Marcia 
P, Sward, ex-officio. 


Committee on Special Funds of the Association: 


Lida K. Barrett, Chairman (1981-86); Leonard 


Gillman (1980-87), ex-officio; John M. Jobe 
(1982-87); Eileen L. Poiani (1982-87); G.B. 
Price (1985-86); David P. Roselle (1985-88); 
Richard M. Schori (1982-87). 


Leonard Gillman, ex officio, Chairman; Ronald 


Davis, ex officio; Lida Barrett, ex officio; 
Ivan Niven, ex officio. 


to Mathematics: 


Peter J. Hilton, Chairman (1985); Henry L. Alder 


(1984-86); Anneli Lax (1985-87). 
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Peter J. Hilton, Chairman (1985); Theodore W. 


Gamelin (1985-87); Lawrence A. Zalcman (1984- 
86). 


Committee on the MAA Book Prize: 


Doris Schattschneider, Chairman (1985); J.A. 


Seebach (1984-86); Gary J. Sherman (1985-87). 
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Leonard F. Klosinski, Director (1980-86); Bruce 


Reznick, Chairman (1985); Gerald L. Alexander- 
son (1983-86); Loren C. Larson, Advisor (1984- 
86); Richard P. Stanley (1984-86); Harold M. 
Stark (1985-87). 


Committee on the Teaching of Undergraduate 


Mathematics (CTUM): 


Alan H. Schoenfeld, Chairman (1982-85); Donald W. 


Bushaw (1980-85); Gloria F. Gilmer (1981-86); 
Bonnie Gold (1985-87); Christopher Nevison 
(1984-86); Jean J. Pedersen (1984-86); Peter 
Ross (1985-87); Martha J. Siegel (1981-86). 


CIUM-CCIME Panel on Computers and Technology in 
Mathematics Instruction: Ronald H. Wenger, 
Chairman; Dorothy Derringer; Gloria Gilmer; 
Christopher Nevison; Gerald Porter; David I. 
Schneider; J.A. Seebach; Martha J. Siegel; 
David Smith. 


ee ad 


Donald W. Bushaw, Chairman; Robert B. Ash; 
Susanna S, Epp; Robert H. McDowell; Jean J. 
Pedersen; Arnold K. Pizer; James W. Vick. 


CTUM Panel on Teaching Assistants and Part-Time 
Instructors: Bettye Anne Case, Chairman; Felix 
E. Browder; Wendell Fleming; John Philip 
Huneke; David Kraines; Alan H. Schoenfeld, ex- 
officio. 


Committee on Two-Year Colleges (CTYC): 


Ann E. Watkins, Chairman (1985-86); Donald J. 


Albers (1984-86); James J. Baldwin (1984-86); 
Bettye Anne Case (1985-88); Wade Ellis, Jr. 


1985 Center 
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Program: 


John D. Neff, Chairman (1985-86); Lida K. Barrett 
(1985-86); John Bradburn (1985-86); Theodore S. 
Chihara (1985-86); William F. Lucas (1985-86). 


Committee on Election Procedures: 


Richard D. Anderson, Chairman (1984-86); Henry L. 
Alder (1984-86); Henry 0. Pollak (1984-86). 
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Kenneth A. Ross, Chairman; Donald J. Albers; Lynn 
A. Steen; Marcia Sward; Alfred B. Willcox. 


MAA Program Committee for Laramie Meeting: 


Kenneth I. Gross, Chairman; David W. Ballew; 
Robert E. Gaines; A. Duane Porter. 
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George E. Andrews, Chairman (MAA); Jeanne L. 
Agnew (MAA); Paul F. Baum (AMS); William P. 
Ziemer (AMS). 


MAA Program Committee for New Orleans Meeting: 


Jerome A. Goldstein, Chairman; Garrett J. Etgen; 
O. Carruth McGehee; John L. Tilley. 
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William C. Waterhouse, Chairman (AMS); George E. 
Andrews (MAA); Judith V. Grabiner (MAA); W. 
Gilbert Strang (AMS). 


Joint Committees 


Joint AMS-MAA-SIAM Committee on Employment Oppor- 


rn ection «ETE, ene pmctetsenereeteD — eagaatmahee 


tunities: (Terms expire October 31) 


Patrick Hagan, Chairman (1985, SIAM); Wilfred E. 
Barnes (1985-87, MAA); Richard E. Ewing (1985- 
86, SIAM); Calvin T. Long (1985-86, MAA); John 
W. Petro (1985, AMS); Donald C. Rung (1985-87, 
AMS). 
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puter Science: 


Donald L. Kreider, Chairman; Susan J. Devlin 
(MAA); Patrick Goldberg (ACM); Zaven Karian 
(MAA); Anthony Ralston (ACM); Olaf Stakelberg 
(ACM). 
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ences: 


Carole B. LaCampagne, Chairman (1985-86); Nancy 
Angle (1985, NCTM); Raymond G. Ayoub (1985-86, 
AMS); Susan J. Devlin (ASA); Marjorie G. Hahn 
(IMS); Gloria C. Hewitt (1985-86; MAA); Julie 
Knight (1985-86, AMS); Betty K. Lichtenberg 
(1985-87, NCTM); Dianne’ P. O’ Leary (1985, 
SIAM); Ingram Olkin (IMS); Katherine L. Peder- 
sen (1985, NCTM); Linda Petzold (1985-86, 
SIAM); Judith Roitman (1985-86, AMS); Alice T. 
Schafer (1985-86, MAA); Elisabeth L. Scott 
(IMS). 


AMS-MAA Joint Meetings Committee: 


William J. LeVeque, AMS; Everett Pitcher, AMS; 
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(1985-88); Ann D. Holley (1982-87); Charles 
David Miller (1985-87); Warren Page (1982-86), 
ex-officio; Stephen B. Rodi (1985-86); Jean B. 
Smith (1982~85); Larry Trivieri (1985-87). 


Committee on the Undergraduate Program in 
Mathematics (CUPM): 


Jerome A. Goldstein, Chairman (1983-87); Donald 
J. Albers (1984-86); Gerald L. Alexanderson 
(1980-85); Donald W. Bushaw (1982-85); Bettye 
Anne Case (1982-87); Philip C. Curtis, Jr. 
(1984-86); Kenneth I. Gross (1985-87); Stephen 
B. Maurer (1985-86); Mary A. Norton (1982-85); 
Billy E. Rhoades (1981-86); Alan H. Schoenfeld 
(1982-85); Lynn A. Steen (1985-86), ex-officio; 
W. Gilbert Strang (1985-87). 


Joint CUPM-CCIME Panel on the Curricular 
Aspects of Computers in the Classroom: Carl 
Leinbach, Chairman; Sheldon P. Gordon; John M. 
Hosack; F. David Lesley; Stephen B. Maurer; 


Gerald Porter; J.A. Seebach,. 


CUPM Panel on the Continuing Mathematical Edu- 
cation of Teachers (COMET): Calvin T. Long, 
Co-Chairman; Bruce E. Meserve, Co-Chairman; 
Henry L. Alder; John A. Dossey; Marjorie M. 
Enneking; Richard K. Guy; Katherine P. Layton; 
Mary Ann Roberts Norton; Alan R. Osborne. 


Joint CUPM-CTUM Panel on Service Courses: 
Donald W. Bushaw, Chairman; David W. Ballew; 
Jerome A. Goldstein; Martha J. Siegel; Maynard 


D. Thompson; James W. Vick. 
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at Two Year Colleges: Ronald Davis, Chairman; 
Geoffrey Akst; Donald J. Albers; Richard D. 
Anderson; John Bradburn; Bettye Anne Case; 
Larry Curnutt; Dale Ewen; Sheldon Gordon; Karl 
Smith. 
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Gloria F. Gilmer, Chairman; Donald J. Albers; 
Gerald L. Alexanderson; Carole Bauer; William 
Drezdzon; Eleanor Green Jones; John G. Harvey; 
Miriam R. Hecht; Peter Hilton; Anneli Lax; Jean 
J. Pedersen. 


CUPM Panel on Calculus Articulation: Donald 
Small, Chairman; Gordon Bushaw; John H. Hodges; 
Donald Nutter; Ronald Schnackenberg; Donald 


Sherbert; Barbara Stott. 
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Chairman; Donald Bushaw; Jerome Goldstein; 
Anthony Ralston; Alfs T. Berztiss (ACM); Gerald 
L. Isaacs (ACM); John Schmeelk (ASEE); Stephen 
B. Maurer. 


Committee on Computers in Mathematics Education 


(CCIME): 


Gerald J. Porter, Chairman (1985); Susan Devlin 
(1985-86); Donald L. Kreider (1985-86); L. Carl 
Leinbach (1985-86); F. David Lesley (1985); 
Alan Schoenfeld (1985-87); J.A. Seebach (1985- 
87); Ronald Wenger (1985). 


Committee on Visiting Lecturers and Consultants: 


Theodore $. Chihara, Chairman (1981-86); Vivian 
Y. Kraines (1981-86); Jon M. Laible (1980-85); 
Eugene R. Seelbach (1979-87). 
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Kenneth A. Ross (1984-89), MAA; Alfred B. 
Willcox, MAA. 
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John W. Kenelly, Co-Chairman (1985, CEEB); Ken- 
neth A. Ross, Co-Chairman (1984-89, MAA); Henry 
L. Alder (1985, MAA); Chancey 0. Jones (CEEB); 
Donald L. Kreider (CEEB); Lynn A. Steen (1985- 
86, MAA), ex-officio. 


Joint Policy Board for Mathematics (JPBM): 


1. Edward Block (SIAM), ex-officio; Hirsh G. 
Cohen (SIAM), ex-officio; Irving Kaplansky 
(AMS), ex-officio; William J. LeVeque (AMS), 
ex-officio; Kenneth A. Ross (MAA), ex-officio; 
Lynn A. Steen (MAA), ex-officio; Alfred B. 
Willcox (MAA), ex-officio. 
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Robert McKelvey, Chairman (SIAM); Lida K. Barrett 
(MAA); Wendell H. Fleming (SIAM); Murray Ger- 
stenhaber (AMS); Louise Hay (AMS); Irwin Kra 
(member~at~large); Bernard L. Madison (MAA). 


JPBM Committee for Mathematics Department Heads: 


David Ballew, Chairman (1985-86); Donald Albers 
(1985); John Feroe (1985-87); John D. Fulton 
(1985-87); Paul D. Humke (1985-86); David J. 
Lutzer (1985-87); Bernard Madison (1985); 
Stephen B. Rodi (1985-86); Richard Schori 
(1985). 
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Everett Pitcher, Chairman (AMS); I. Edward Block 
(SIAM); Paul W. Davis (SIAM); Leonard Gillman 
(MAA); William J. LeVeque (AMS); Franklin P. 
Peterson (AMS); Kenneth A. Ross (MAA); Alfred 
B. Willcox (MAA). 


JPBM Committee on Public Understanding of 
Mathematics: 


Ronald L. Graham (AMS); Joseph B. Keller (SIAM); 
Lynn A. Steen (MAA). 
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at Traditionally Black Institutions: 


C.B. Bell (AMS); Lipman Bers (AMS); R. Creighton 
Buck (MAA); I.N. Herstein (MAA); Frank A. James 
(NAM); Ted Sykes (NAM). 


Editorial Boards of the Association 


American Mathematical Monthly (All terms expire 
December 31, 1986): 

Editor: Paul R. Halmos. 

Associate Editors: Gerald L. Alexanderson, Sabra 
S. Anderson, Sheldon Axler, Thomas F. Banchoff, 
Ralph P. Boas, David Borwein, W.W. Comfort, 
Hugh M.W. Edgar, Allan L. Edmonds, John H. 
Ewing, Richard K. Guy, Raoul Hailpern, Richard 
S$. Millman, Dale H. Mugler, Seymour Schuster, 
J.A. Seebach, Lynn A. Steen, Mary R. Wardrop, 
Robert F. Wardrop, Herbert S. Wilf. 


Mathematics Magazine (All terms expire December 
31, 1985): 
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Editor: Doris Schattschneider. 

Associate Editors: Edward J. Barbeau, John A. 
Beidler, Paul J. Campbell, Underwood Dudley, 
Gerald A. Edgar, Joseph A. Gallian, Judith V. 
Grabiner, Raoul Hailpern, Joseph Malkevitch, 
Pierre J. Malraison, William A. McWorter, Jr., 
Leroy F. Meyers, Jean J. Pedersen, Gordon Rais- 
beck, Ian Richards, Eric S. Rosenthal, Dan 
Shapiro, David A. Smith. 


The College Mathematics Journal (All terms expire 
December 31, 1988): 

Editor: Warren Page. 

Associate Editors: Jeanne Agnew, Gerald L. Alex- 
anderson, Patrick J. Boyle, Donald W. Bushaw, 
Michael W. Chamberlain, Leo Chosid, Josephine 
Edwards, Gino T. Fala, Stanley Friedlander, 
Robert M. Geist, Samuel A. Greenspan, Erwin 
Just, James J. Kaput, Alan Mackay, Peter L. 
Renz, Billie A. Rice, Ann E. Watkins, David H. 
Wheeler, Milton E. Winger. 


Newsletter Editorial Committee: 


Ronald M. Davis, Chairman (1981-85); Yousef Alavi 
(1981-86), ex-officio; Donald J. Albers, Asso- 
ciate Editor (1985); Philip M. Cheifetz (1981- 
86); William Chinn, Associate Editor (1981-85); 
Joseph A. Gallian (1981-86); Katherine P. Lay- 
ton (1981-86); Stephen B. Maurer, Associate 
Editor (1983-86); John D. Neff (1985-87); 
Eileen L. Poiani (1981-85); Kenneth A. Ross 
(1984-89), ex-officio; Marcia Sward, ex- 
officio; Alan C. Tucker (1985); Paul Zorn, 
(1985-87). 


Representatives of the Association 


To Sections of AAAS: Section A: A.B. Willcox 
(1984-86); Section Q: A.B. Willcox (1984-86); 
Section U: A.B. Willcox (1984-86); Section X: 
A.B. Willcox (1984-86). 
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(1985-87); Courtney S$. Coleman (1985-87); Peter 
A. Lindstrom (1985-87); Michael Olinick (1985- 
87). 
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Lynn A. Steen (1985-86), ex-officio. 


On the Discipline Committee for Mathematics of 
CEEB: Donald L. Kreider. 


On the Governing Council of Mu Alpha Theta: Kath- 
erine P. Layton (1985-87). 


MAA Representative on AMS Science Policy Commit- 
tee: Lynn A. Steen (1985-86). 


(1981-86). 


On the April 1986 NCTM Program Committee: Ronald 
M. Davis. 
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7. A. Tenenbaum and M. Augenstein, Data Structures Using PASCAL, Prentice-Hall, Englewood Cliffs, NJ, 
1981, pp. 149-154. 

8. T. R. Walsh, The Towers of Hanoi Revisited: Moving the Rings by Counting the Moves, Information 
Processing Letters 15, 1982, pp. 64—67. 

9. D. Wood, The Towers of Brahma and Hanoi Revisited, Computer Science Technical Report No. 80-CS-23, 
McMaster University, 1980. 


NOTES 


EDITED BY SABRA S. ANDERSON, SHELDON AXLER, AND J. ARTHUR SEEBACH, JR. 
For instructions about submitting Notes for publication in this department see the inside front cover. 


A SHORT PROOF OF STEINHAUS’ THEOREM ON SUMMABILITY 


JEFF CONNOR 
Department of Mathematical Sciences, Kent State University, Kent, OH 44242 


In this note, we offer a soft (and short!) proof of a result of Steinhaus: a regular matrix 
summability method cannot sum all sequences of 0’s and 1’s. 

Let x and y be sequences. By a matrix summability method we mean a sequence to sequence 
transformation defined by an infinite matrix A = (a, ,)?%,~,, an array of complex numbers, and 
y = Ax when 


o@) 
y= do a, -x, foralneN. 
k=1 


A is called regular if it maps convergent sequences into convergent sequences and preserves limits, 
1e., lim, (Ax), = im,.x, if x is convergent. Regular matrices are characterized by the following: 


THEOREM (Silverman, Toeplitz) [PS]. Let A = (a, ,)? ,=, be a matrix summability method. A 
is regular if and only if 


(1) lim )) a, , = 1, 

" k=1 
(2) lima, ,=0 forallk © N, and 
(3) there is an M > 0 such that 


ioe) 
y la, .]< M forallne N. 
k=1 


A sums x or x is A-summable if A maps x into a convergent sequence. 
We also use the Baire Classification Theorem, a direct ancestor of the Baire Category Theorem: 


THEOREM (Baire) [O, p. 32]. Let X be a complete metric space and (f,)°_, a sequence of 


n=1 
continuous complex-valued functions. If f(x) = lim, f,(x) exists for all x © X, then f is continuous 
except at a set of points of first category. 


Let Q denote the sequences of 0’s and 1’s and define a metric d on Q by 


d(x,y) = » |x, — Vp 720", 
n=] 
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Observe that (Q,d) is a complete metric space homeomorphic to the Cantor set, the metric 
topology being the topology of coordinatewise convergence in (2. We are now ready to give the 
promised proof. 


THEOREM (Steinhaus) [PS], [S]. 4 regular matrix summability method cannot sum all sequences 
of 0’s and 1’s. 


Proof. Let A = (a, ,) be regular. By (3), L#_,|a,,| is convergent for each j € N, hence 
f(x) = L¥_14, x, is defined for all x = (x,) €Q, and f is a continuous complex-valued 
function on Q because |f(x)—f(y)| < LP,4,114,,4| << € whenever d(x,y)<27' and i is 
sufficiently large. 

Now, suppose A sums all sequences of 0’s and 1’s, or in other words, f(x) = lim, f(x) is 
defined for all x € Q. Notice that any open ball about any point in Q contains two points z and 
w where z is eventually all 1’s and w is eventually all 0’s. Since A is regular, we have that 
|f(z) — f(w)| = 1. Hence, f is discontinuous at every point of Q. 

f, however, is the pointwise limit of a sequence of continuous functions, and consequently must 
be continuous at at least one point of 2. This is a contradiction. @ 


References 


[PS] R. E. Powell and S. M. Shah, Summability Theory and Its Applications, Van Nostrand Reinhold, London, 
1972. 

[O] J. C. Oxtoby, Measure and Category, Springer-Verlag, New York, 1971. 
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A COMBINATORIAL CONSTRUCTION OF A NONMEASURABLE SET 


ROBIN THOMAS 
Department of Mathematical Analysis, Charles University, Sokolovska 83, 186 00 Prague, Czechoslovakia 


We give a slightly unusual construction of a Lebesgue nonmeasurable set on the real line using 
a fundamental graph-theoretical assertion (Proposition 1). 

First recall some graph-theoretical notions. A graph is a pair (V, E), where V, the set of 
vertices, is an arbitrary set (possibly infinite) and E, the set of edges, is contained in ( ) (= the 
collection of all two-element subsets of V). A path in a graph (V, E) is a sequence Up,...,u, of 
distinct elements of V such that {v,,v,,,} © E for i = 0,1,...,2 — 1. The number of edges, 7, is 
the length of the path. A cycle is a closed path, that is, vu) = v,. A graph (V, £) 1s called bipartite 


if there are disjoint sets V, and V, such that V, U V, = V and 


neither V, nor V, contains both endpoints of an edge. 
The following two propositions are well known, but for the reader’s convenience their proofs 
are given here. 


are disjoint from E, that is, 


PROPOSITION 1. Any graph (V, E), which has no cycles of odd length, is bipartite. 


Proof. It suffices to prove the proposition for connected graphs only. Choose any x © V and 
define 
V, = {y © V: any path connecting x and y is of odd length}, 
V, = {y © V: any path connecting x and y is of even length}. 


Since there are no odd cycles in (V, E), any two paths connecting x and y are of the same parity. 
Thus V, and V, yield the desired partition of V. 


PROPOSITION 2. Let M be a Lebesgue measurable set of real numbers with positive measure. Then 
there exists a 8 > 0 such that MX (x + M)# © for any x © R with |x| < 6. 
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Proof. Find a closed set F and an open set G with F C M and FC G such that 3AG < 4A F 
(where A is the Lebesgue measure). Since G is a countable union of disjoint open intervals, there 
is one among them, say J, such that 3AJ < 4A(FO J). Let 6 =AJ/2 and suppose that |x| < 6. 
Then J U(x +) is an interval of length less than 3AJ which contains both FJ and 
x + (FO I). The last two sets cannot be disjoint, since otherwise 


SAI = ZAI + ZAI < A((F AT) U(x +(FOT))) < A(T U(x + J)) < SAI, 


which is a contradiction. Hence, 
®@#A(FOAT)A(x+(FNT))CMA(x+M), 
which completes the proof. 


We now proceed to the construction. Define a graph (R, £) as follows: {x, y} © E when 
|x — y| = 3* for some integer k. We claim that (IR, E) contains no cycles of odd length. Indeed, 
suppose that x),...,x, is acycle of length n in (IR, E). Then for i = 1,2,...,n,x, = x,;_, + §3", 
where , = +1 and each k;, is an integer. Then 


E34 + 0+ +€ + 3hn = 0, 
For sufficiently large N, 
g, ~3KtN a... +€&- Zen tN 


is a sum of odd integers, so in order for the sum to be 0, there must be an even number of them. 
This shows that the cycle is even. By Proposition 1 there are disjoint sets A and B with 
R = A UB such that neither A nor B contains both endpoints of an edge. Since x and x + 3* 
are joined by an edge, we have A + 3* C B and B+ 3* CA for any integer k. Now by 
Proposition 2 the sets A and B are nonmeasurable. 


REMARKS. (i) In fact we see that A + 3% = B and A — 3* = B for any integer k. This follows 
from the inclusions 


A=A+3*—3%'CB-3* CA, 
B=B+4+3*—-3*'cA-—3% CB. 


(11) The same idea has recently been used by NeSetfil and Rédl [1] to obtain a short proof of 
the existence of non-Ramsey sets. 


EXERCISE. It is known that any construction of a nonmeasurable set must involve the Axiom of 
Choice. At which step of our construction is the Axiom of Choice necessary? 


Reference 


1. J. NeSetril and V. Rédl, Two Remarks on Ramsey Theorem, to appear in Discrete Math. 


WHEN IS AN ORDINARY DIFFERENTIAL EQUATION SEPARABLE? 


DAVID SCOTT 
Department of Mathematics and Computer Science, University of Puget Sound, Tacoma, WA 98416 


Elementary differential equations texts generally present the method of separation of variables 
by stating that if one can write f(x, y) = g(x)/A(y), then the equation y’ = f(x, y) can be 
solved by integrating each side of h(y) dy = g(x) dx. They don’t raise the question of whether 
f(x, y) factors into a product of a function of x times a function of y. Most of the exercises give 
f(x, y) in factored form, and many students cannot solve y’ = 1 + x + y? + xy* unless they are 
told it is separable. 

Most texts mention that a separable equation gives rise to the exact differential g(x) dx — 
h(y) dy, while treating separation of variables and exact differentials as distinct methods. The 
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sections on exact equations contain the standard test for exactness and often include a discussion 
of integrating factors, but I have seen none that develops a test for separability. 

This paper gives a simply stated test for separability. In what follows D, the domain of f, is an 
open convex set in the plane, f is real valued, and partial derivatives of f are indicated by 
subscripts. 


PROPOSITION 1. Jf there are differentiable functions @(x) and (y) such that f(x,y)= 
$(x)¥(y) for all (x, y) © D, then f(x, y) fry(%s¥) = K(X VIA, (% Y) for all (x, y) © D. 


The proof follows directly from f, (x, y) = $’(x)W(y) and the corresponding formulas for the 
other partial derivatives. 


PROPOSITION 2. If f, f., f, and f., are continuous in D, f(x,y) is never 0 in D, and 
f(x, Vf y¥) =A. YA, (%, y) for all (x, y) © D, then there are continuously differentiable 
functions @(x) and W(y) such that f(x, y) = $(x)W(y) for all (x, y) © D. 


Proof. f has the same sign throughout D and factoring f is equivalent to factoring —f, so we 
may assume that f is positive on D. Now 


8 flxsy) — FOV) hy Coy) ~fhOOWAOCEY) _ 
dy f(x,y) f(x,y) 


Since In( f(x,y) = f.(x, y)/f(x, y), we conclude that Sin f(x, 9) is a function of x 


0. 


alone and can write in f(x, y)) = a(x). The function « is continuous because f and f, are. 


Let B(x) = fa(x) dx. Then there is a function y such that In( f(x, y)) = B(x) + y(y). y is 
continuously differentiable because f and f, are continuous. 


Let $(x) = exp(B(x)) and ¥(y) = exp(y(y)). Then f(x, y) = 6(x)¥(y), completing the 
proof. 


As the following example shows, the condition that f is nonzero on D cannot be dropped. 
Define f(x, y) by 


x*exp(y), ifx > 
f(x,y) =: 2 . 
x‘exp(2y), ifx < 


Direct computation shows that on R? we have ff, y =f. f,. However f does not factor since if 
it did f(1, y), which is exp(y), and f(—1, y), which is exp(2y), would be proportional as 
functions of y. 

One can replace the condition that f is nonzero on D by requiring that f(x, y) be an analytic 
function of x and y. Then, using Proposition 2 and the fact that if two analytic functions agree in 
a neighborhood of a point in D they agree on the connected set D, one can show that ff,, = f.f, 
implies that f factors. 


TOPOLOGICAL PARTITIONS OF EUCLIDEAN SPACE BY SPHERES 


PAUL BANKSTON 
Department of Mathematics, Statistics and Computer Science, Marquette University, Milwaukee, WI 53233 


RALPH Fox 
Department of Mathematics, George Mason University, Fairfax, VA 22030 


A space X partitions a space Y if there is a family of topological embeddings of X into Y 
whose images form a cover of Y by pairwise disjoint sets. In [1] (whose notation and terminology 
we follow here) it is shown that the n-sphere S(n) cannot partition euclidean (n + 1)-space R”*! 
(an easy result) and that if X is any nonempty subspace of S(n), then X partitions R*”*'. An 
obvious question then arises as to whether S(n) partitions R” for n+1<m< 2n+ 1 (Ques- 
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tion 3.1 (iv) of [1]). In this note we give an affirmative answer based on an unpublished 
construction due to S. Kakutani and communicated to us by P. R. Halmos [4] (who learned of the 
construction forty years ago). This construction, used to give a partition of R° by S(1), is 
extremely simple, essentially different from those given in [1], and easily generalizable. Because 
several people have looked at the problem since the completion of [1], we thought the solution 
should be publicized as a short postscript to that paper. 


THEOREM. S(n) partitions R"** in such a way that each sphere is tamely embedded, unknotted, 
and not linked with any other member of the partition. 


Proof. We partition R"*? into n-spheres as follows: First let C, be an open unbounded 
cylinder in R”**. To be specific, we let X, be a coordinate axis in R”*! and let C, be the 
product of X, with the open unit ball in the (7 + 1)-dimensional vector subspace perpendicular 
to X,. Now R”**\ C,, being homeomorphic with R’ x [1, 00) X S(n), is easily partitioned into 
spheres; and C, is homeomorphic with R"*?. 

Let C, be an open unbounded cylinder in C, formed in a U-shape—i.e., let R,, R, be parallel 
translates of a closed ray in X,, let L be the straight line segment joining the endpoints of R,, R5, 
and let C, be the interior of a regular thickening of R, U L U R,. We then choose a branch of C, 
and form another open unbounded U-shape C, in exactly the same fashion, making sure that the 
base of C, is at least a unit distance from the base of C,. Proceed inductively. For each n > 1, C, 
is homeomorphic with R”*?, and C, \C,,, is homeomorphic with R! x [1, 00) X S(n). Finally, 
because the U-shapes are pushed unboundedly far from the origin, it is easy to see that 
_,C, = ©. This completes the construction. (It is obvious that there is no linking or knotting, 
and each sphere is tamely embedded.) 0 

The following question now becomes inevitable. 


Revised Question 3.1 (iv) [1]: Is there a nonempty subspace of $(2) which does not partition 
R*? Of course by the above we now need only look in R? ¢€ S(2) for candidates, but this brings 
us to the question of whether in general there are nonempty subsets of R” which do not partition 
R"*!. This is basically Question 3.1 (ii) in [1], and we are little closer to a solution now than we 
were when the issue first arose. 


REMARK. In [5], A. Szulkin offers a new solution to the problem of whether S(1) partitions R°, 
in response to a re-posing of the question by H. S. Shapiro. (Problems like this have a way of 
surfacing time and again. See also [2] and [3] for similar problems.) So far, we know of four 
distinct solutions, each with its own particular merits. To comment: 


(i) Of the two solutions offered in [1], one of them uses the Axiom of Choice. While the 
constructive solution is more visual, the Jordan curves in the partition are quite general. However, 
in the nonconstructive solution, all of the Jordan curves can be taken to be standard circles (of 
equal radius). Neither solution generalizes to higher dimensions. 

(ii) Szulkin’s solution combines the attractive features of both of the solutions in [1]; namely it 
constructively partitions R? into circles (whose radii include all positive real numbers). However, 
it too fails to generalize to higher dimensions. 

(iii) While the Kakutani solution (as well as the above generalization) is constructive, it involves 
highly noncircular Jordan curves. This, of course, prompts the question of whether R”** can 
always be partitioned into standard n-spheres. 


References 


1. P. Bankston and R. McGovern, Topological partitions, General Topology and Appl., 10 (1979) 215-229, 
MR # 801:54007. 
2. , On partitions of plane sets into simple closed curves, Proc. Amer. Math. Soc., 88 (1983) 691-697. 
, On partitions of plane sets into simple closed curves, II, Proc. Amer. Math. Soc., 89 (1983) 498-502. 
R. Halmos (private communication). 
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THE TEACHING OF MATHEMATICS 
EDITED BY MARY R. WARDROP AND ROBERT F. WARDROP 


Material for this department should be sent to Professor Robert F. Wardrop, Department of Mathematics, Central 
Michigan University, Mount Pleasant, MI 48859. 


A NON-SIMPSONIAN USE OF PARABOLAS IN NUMERICAL INTEGRATION 


ARTHUR RICHERT 
Department of Mathematics, Southern College, Collegedale, TN 37315 


This paper presents a numerical integration technique which uses parabolas differently than 
does Simpson’s rule. The degree of precision of the technique is three, as is the case for Simpson’s 
rule, while the error bound is an improvement over Simpson’s rule by a factor of almost eleven. 
The technique provides an example of the usefulness of Taylor polynomials in numerical analysis 
which may be presented to beginning calculus students as soon as Taylor’s theorem has been 
covered. 

In Simpson’s rule, the interval of integration, [ a, b], is divided into an even number, n, of equal 
subintervals each of length h = (b — a)/n. The approximation is then given by 


(1) f° F(x) de = ZL PC) + Af) + 2f (22) + 4) H/C), 


with an error bound of 
(2) | Error] < h*M,(b — a) /180, 


where M, is an upper bound on the magnitude of f over the interval [a, b]. 

We begin in the same manner with the exception that n is not required to be even. As an 
approximation to f over the ith subinterval, [x,_,,x;], we use the second degree Taylor 
polynomial for f expanded about X,, the midpoint of the ith subinterval. This polynomial is given 
by 


T(x) =f(X%) +f (X)(x — X%) + fO(X) (x — X))°72. 


Direct integration yields 


(3) [0 Td) dx = f(X) +55 | £(X). 


X;—1 


The approximation analogous to (1) is obtained by summing (3) over the 1 subintervals to get 
b n h? n 
[P(x) de =h YD (%) + 5g DL /%). 
a k=1 24 


This formula is of interest in its lack of dependence on f“) and also in that it is merely the 
midpoint rule plus a “correction” term involving the second derivative. 
To determine an error bound we assume f € C*[a, b] and express f over the ith subinterval in 


the form 
(4) 
k f ( Cy ) 
ay 


— X;) (x- X,)", 


I 


f(x) = yoy LON, 


where for each x © [x,_,,x,] an appropriate c, between x and X, is guaranteed by Taylor’s 
theorem. Direct integration utilizing the weighted mean value theorem for integrals yields 


[1 de = he fK) 454 F(X) + he f(a ‘)s 
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where d, € (X,_,, X;). Thus the absolute error over the ith subinterval is bounded by h?M,/1920 
and the error expression analogous to (2) is 


|Error| < nh?M,/1920 = h*M,(b — a) /1920. 


The degree of precision remains three while the error bound is reduced by a factor of nearly 
eleven. The cost of this improvement is the availability of (© and an additional n— 1 
evaluations. 


REBUSES AGAIN 
According to the American Heritage Dictionary, a rebus is a riddle composed of words or 


syllables depicted by symbols or pictures that suggest the words or syllables they represent. Can 
you decipher the ones below? 


aan 


* XIATAM A 


—Robert J. Beeber 


—W. Edwin Clark 


3. 3 J (ice)? d(ice) 
— Wayne M. Dymacek 


4a. 


4b. 


Dale T. Hoffman 


5. 
)) det — Dale T. Hoffman 
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— Daniel J. Sprows 


Answers on p. 440. 
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E 3093. Proposed by William C. Waterhouse, Pennsylvania State University. 


Let f(x,,...,X,) be a real symmetric polynomial, and suppose that f has an absolute 
minimum. Bunyakovskii showed long ago that the minimum need not occur at a point where all 
x, are equal (cf. this MONTHLY, 90 (1983) 378-387). Prove that Vf = 0 at some point where all x, 
are equal. Show on the other hand that this statement need not be true for an arbitrary C® 
symmetric function having an absolute minimum. 


E 3094. Proposed by Louis Funar, University of Craiova, Romania. 


Let D be a convex body in R°, and let \(D) = sup, area (a M D) where 7 is a variable plane. 
Prove that D can be divided into four parts D,, D,, D,;, D, so that A(D,) < A(D), i = 1, 2,3, 4. Is 
it possible to divide D into three parts with the same property? 
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he obtains Presburger’s decision procedure for arithmetic without multiplication and, in addition, 
he shows that the problem of deciding whether a given computational procedure will ever halt is 
algorithmically undecidable. Many details are left as exercises, and fairly detailed outline solutions 
are presented in an additional 50 pages. The book is intended for quite unsophisticated readers, is 
very clearly written, and should provide a very effective teaching tool. 

In order to accomplish his task, Fisher required not only a logical formalism for elementary 
number theory, but also a model of computation for giving a precise definition of computable 
function. From the many mathematically equivalent formalisms available for the purpose, Fisher 
chose “register” machines. This simple and elegant formalism is highly accessible, especially to 
readers with a modicum of programming experience. However, because Fisher is working with a 
formalization of number theory in which + and - are the only functions explicitly available, the 
task of proving that all computable functions are representable (in a suitable sense) in his 
formalization is rather formidable. Fisher uses the now classical device, Gédel’s B function: 


B(c,d,i) =rm(c,1 +(i+ 1a), 


where rm(x, y) is the least nonnegative remainder when x is divided by y. Using the Chinese 
remainder theorem, it is not too difficult to show that for every finite sequence dp, a,,...,4,, 
there are integers c,d such that B(c,d,i)=a, for i=0,1,...,n. The successive steps of a 
computation by a register machine program can then be coded using this 8 function. The catch is 
that it is not much fun to formalize this in Peano arithmetic. 

An alternative path, which avoids much of the tedious complications, is first to prove as a 
theorem of ordinary mathematics that computability by register machines is equivalent to a more 
tractable notion. For example, by a theorem of Kleene (later improved by Julia Robinson), easily 
proved by an informal use of Gédel’s B function, the computable functions are just those 
obtainable from a few initial functions using the two operations of function composition, and 
taking the least zero of a function. Another attractive possibility only recently available is to give 
the new Jones-Matijasevi¢ proof of the Davis-Putnam-Robinson theorem that the graph of a 
computable function consists of the tuples which satisfy an exponential Diophantine equation. 
This new proof is very simple and is carried out using register machines. If this path is taken, the 
exponential function should probably be added to + and - as primitive functions for which the 
recursion equations are taken as axioms. The slight weakening of the Gédel-Rosser theorem that 
would result hardly matters at this level, and as a dividend one would obtain the unsolvability of 
the decision problem for exponential Diophantine equations. (Alternatively, one could digress to 
obtain Matijasevic’s theorem that the exponential function is Diophantine, thereby not only 
avoiding the weakening of the Gédel-Rosser theorem, but also obtaining the unsolvability of 
Hilbert’s tenth problem.) 

Two minor errata: On p. 34 it is stated that Russell and Whitehead’s Principia Mathematica 
was written independently of Frege’s work, which is certainly not the case. On p. 37 it is stated 
that there is no mechanical procedure for “deciding whether, for arbitrary n, a sequence of n 
successive 7s occurs in the decimal expansion of ,... .” What is presumably meant is that no 
such procedure is known. 


ANSWERS TO REBUSES ON PAGES 426-427 


1. Triangle inequality. 
2. The inverse of a matrix. 
3. Iceberg (= ice cube + sea = (ice)*? + C),. 
a. Power series. 
4b. Formal power series. 
5. Telescoping series. 
6. Subset. 
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— Daniel J. Sprows 


Answers on p. 440. 
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E 3093. Proposed by William C. Waterhouse, Pennsylvania State University. 


Let f(x,,...,x,) be a real symmetric polynomial, and suppose that f has an absolute 
minimum. Bunyakovskii showed long ago that the minimum need not occur at a point where all 
x, are equal (cf. this MONTHLY, 90 (1983) 378-387). Prove that Vf = 0 at some point where all x, 
are equal. Show on the other hand that this statement need not be true for an arbitrary C~ 
symmetric function having an absolute minimum. 


E 3094. Proposed by Louis Funar, University of Craiova, Romania. 


Let D be a convex body in R®, and let \(D) = sup, area (a M D) where 7 is a variable plane. 
Prove that D can be divided into four parts D,, D,, D,, D, so that A(D,) < A(D), i = 1,2,3,4. Is 
it possible to divide D into three parts with the same property? 
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E 3095. Proposed by Alexandru Lupas, Facultatea de Mecanica, Sibiu, Romania. 
Let p be a fixed natural number, (a,,)*°_,) a sequence of real numbers, and 
1 


Pp n 
-k{ P 
A’a, = » (-1)’ ‘( Ja, Ro An = » ay. 
Loo k + n+1 /~ 


If there exist constants m,,M, such that m, < A?’a, < M,, n= 0,1,2,..., prove that 


1 
p+l’? 


pa 


<A’A < M. n=0,1,.... 


E 3096. Proposed by the editors (cf. E2903 [1980, 619; 1983, 483; this issue]). 


Consider paths in the northeast quadrant which start at (0,0), such that each step is either one 
unit east or one unit north, and end at (2”,2n). A path is called a Whitworth path if it does not 
go above the diagonal y= x. A path is called a Shapiro path if it avoids the points 
(1,1), (3,3),...,(2n — 1,2 — 1), 1e., those on the diagonal with odd coordinates. Exhibit, for 
fixed n, a one-to-one correspondence between the set of Whitworth paths and the set of Shapiro 
paths. 


E 3097. Proposed by Florentin Smarandache, Lycée Sidi El Hassan Lyoussi, Sefrou, Morocco. 
Find all real numbers x for which 
(x + 1)* +(x + 2)* =(x 4+ 3)*. 


E 3098. Proposed by Roger Cuculiere, Paris, France. 


Given two circles with diameters JA = a and /B = b, and a set of smaller circles between 
them as in the following figures, find the total area enclosed by the small shaded circles in each of 
the following cases: 

(a) The center of one of the small circles lies on the (common) diameter of the large circles 


(Fig. 1). 
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(b) Two of the small circles are tangent to the diameter of the large circles (Fig. 2). 
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SOLUTIONS OF ELEMENTARY PROBLEMS 


Random Walks and Catalan Numbers 


E 2903 [1981, 619; 1983, 483]. Proposed by Louis W. Shapiro, Howard University. 


Consider walks in the northeast quadrant which start at (0,0) and such that each step is one 
unit east or north and the points (1, 1), (3, 3),...,(2k +1,2k + 1), ... are forbidden. How many 
paths are there to (2n,2n)? 


Solution by D. M. Bloom, Brooklyn College, and the proposer. By a “good” path we mean one 
in which each step is one unit north or east. For n > 0, let c,, be the number of good paths from 
(0,0) to (n, n) which never go above the diagonal x = y; for n > 1, let d, be the number of good 
paths from (0,0) to (n,n) which lie entirely below that diagonal (except of course at the 
endpoints); and let b,,, be the number which the problem asks for. By considering the least k > 0 
for which a given path meets the point (k, k), we easily see that 


o 
I 


,=» d,Cy—5 (with cy = 1), 
k=1 


by, = 2 » An Dyn 25 (with by = 1). 
k=1 

Since evidently d, = c,_,, the above becomes 

n-1 n-1 
(1) Cn » CiCn-1—;9 bs, =2 » Cy 4197-1; 

j=0 j=O0 
A straightforward induction using (1) shows that b,, = c>, for all n>0. But c, is the nth 
Catalan number (2" /(n + 1).* Hence 


b,, = (4) Jans 1). 


(*The solution to E 2253 (this MONTHLY, 1971, p. 797) gives Feller, An Introduction to Probability 
Theory and Its Applications, Wiley, New York, Second Edition, p. 71, as a reference for this result. 
The result is also obtainable as a special case of the ballot problem.) 


Comment by the editors: This solution is published in response to a letter by Professor Ira 
Gessel of MIT who writes 


“.. The published solution to AMM problem E 2903, ‘Random Walks and Catalan 
Numbers’, which appeared in the August-September 1983 MONTHLY ... seems to claim 
that the number of paths from (0,0) to (”,”) which avoid certain points depends only on 
the number of ‘options excluded.’ This is clearly false. For example, the number of paths 
from (0,0) to (n,n) which avoid the point (a, b) is (7") —(4+°?)\ er 4— >), which is not 


independent of a and b. However, if 0 < a, b <n there are always exactly two ‘options 
excluded’.”’ 


We ask in Problem E 3096, this issue, for problem solvers to reconsider this problem. 


Two Similar Triangles with a Common Circumcenter 


E 2974 (corrected) [1982, 756; 1983, 482]. Proposed by Jordi Dou, Barcelona, Spain. 


This problem, posed originally in the December 1982 issue of this MONTHLY, page 756, 
contained an error introduced in the editorial process. The problem should read: “Let AMB 
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(oriented clockwise) and CMD (counter-clockwise) be similar triangles. Prove that triangles ACX 
(clockwise) and YDB (clockwise), both similar to the first triangles, have the same circumcenter.” 


Solution I by Howard Eves, University of Maine. We shall employ the analytic geometry of the 
complex plane, and follow the custom of designating points and their affixes by corresponding 
upper and lower case letters respectively. For convenience we choose our coordinate system so 
that the circumcircle of triangle ACX is the unit circle centered at the origin. We then have 
aa = cC = xx = 1. Since triangle ACX is directly similar to triangle AMB, inversely similar to 
triangle CMD, and directly similar to triangle YDB, we have, in turn, 


aa ll c a l y ail 
mc 1/}=0,|m c 1!1/=0,|d c¢ 1/=0. 
b x 1 d x 1 b x 1 


From the first equation we obtain 
b —_— — 
—_ (c-—a)+a(x c) 
x—-a 
From the second equation, after eliminating m and simplifying, we obtain 
d= pea VG= 4) 
(c —a)(x — a) 
From the third equation, after eliminating d and simplifying, we obtain 
(c — a)(% — ¢) 
= b-——.——~_. 
(¢ —a)(x — ¢) 
We note that yy = dd = bb, and the origin is the circumcenter of triangle YDB. Triangles ACX 
and YDB thus share the origin as a common circumcenter. 


In the particular case where triangle AMB is right-angled at M, it follows that the line 
segments AX and YB bisect each other. 


Solution I1 by John Oman, University of Wisconsin-Oshkosh. A straightforward proof can be 
given using complex numbers and the following formula for the circumcenter Z, of the triangle 
whose vertices are the complex numbers Z,, Z,, and Z;: 


ZZ, ZZ, Z;Z, 
Ly Z, 2; 


1 1 1 

Z, = ——_— 
LZ, Ly ZL; 
ZZ Z,; 
1 1 1 


If M, A,B,C are represented by the complex numbers 0, 1, I, and B, respectively, then D, X, Y 
are given by BI, (1 — B)I' + B, and (T — BT) = + BY. The circumcenter for both triangle ACX 
and triangle YDB is 
lr — pf +pIT -IT 
r-T 


Solution III by the proposer. Let S, be the similarity of center A, angle «BAM, and ratio 
AM/AB, and Sc the similarity of center C, angle« MCD = «A, and ratio CD/CM (= AB/AM), 
We then have that R= S, - S. is a rotation of angle 2A, R(B)= D, and R(X) =C since 
S,(X) = C. If O is the center of R, we have x BOD = «x XOC = 2A. Because « BYD = « XAC 
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= A, they are inscribed angles in the circles having «BOD and «XOC as central angles, and 
hence O is the circumcenter of the two triangles YDB and ACX. 


Also solved by A. Bondesen (Denmark), W. Janous (Austria), O. P. Lossers (The Netherlands), J. -P. Monier 
(France), H. J. Morse, and W. A. Newcomb. 


A Hereditary Property of Cyclic Linear Mappings 


E 2978 [1982, 757]. Proposed by U. Abel, DKFZ, Heidelberg, Germany. 


A mapping g is called cyclic if P(g) = min{k|g* = identity} exists (g* = go --- og, k 
times). 

Let f: R” — R” be a cyclic linear mapping, R” = V ® W,V,W being subspaces of R”, and 
fV)=V.. ; 

Define f = Pry o fly: W — W (Pry denoting the projection on W along V). Then f is cyclic, 
and P(f) = Icem(P(f), P(flv)). 

Solution by N. Miku, Catholic University, The Netherlands. On a suitable basis the matrix A of 
f has the form (7 2), P representing f|, and R representing f. Let the order P(f) of f be a. 
Then P*’ = J, R* = J. Let b and c be the orders of P and R, respectively. Then bla, c|a, so with 
d = lcm(b, c) we may put a = ¢d. Putting A? =(/ a) we have I = A‘ =(! ‘S|, so S = 0. This 
implies ald, so a = d. Moreover one may conclude that, if P and R have finite orders b and c 
respectively, and d = lcm(b,c), A has finite order if and only if S = £47 P'OR?~‘~' = 0, and 
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the order of A then is d. 


Also solved by F. S. Cater, W. Cordwell, S. L. Davis, E. R. Gentile (Argentina), V. D. Mascioni (student, 
Switzerland), D. M. Wells, P. Y. Wu (Republic of China), and the proposer. Cater generalized the result to the 
context of a torsion-free Abelian group. 


ADVANCED PROBLEMS 


Solutions of these Advanced Problems should be mailed in duplicate to Professor D. H. Mugler, Department of 
Mathematics, University of Santa Clara, Santa Clara, CA 95053, by November 30, 1985. The solver’s full 
post-office address should be on each sheet. 
6498. Proposed by I. J. Schoenberg, University of Wisconsin, Madison. 
Let Po(x) = 1, Pi (x), P,(x),... be an Appell sequence of polynomials, i.e., 
P/(x) = P,_1(x) (n =1,2,...). 
If x,, (@ = 1,2,...,m) are the zeros of P,(x), show that 


lim max|x; ,| = 0, 
no U 


unless P(x) =(x-—a)"/n! (n =0,1,...) for some fixed a. 


SOLUTIONS OF ADVANCED PROBLEMS 
On Equal Integrals 


6440 [1983, 569]. Proposed by M. S. Klamkin, J. McGregor, and A. Meir, University of 
Alberta. 


Let F(x), G(x) be two functions in L,(— 00, 00) which satisfy 
f F(x) dx = [~ G(x) dx =1. 


Show that for any A in (0,1) there is a set E C (— 00, 00) such that 


J F(x) dx = [ G(x) dx = . 


Solution 1 by Gerald A. Edgar, The Ohio State University. By Liapunoff’s theorem [see 
W. Rudin, Functional Analysis, second edition, McGraw-Hill, Theorem 5.5], the set S in R? 
defined by 


S= {ee dx, [ G(x) ax) E measurable] 
E E 
is convex. Since (0,0) € S and (1,1) € S, it follows that (A,A) € S for any A € (0,1). 


Solution 2 by the proposers. (This solution is more elementary than the above.) Let 
A ={x: F(x) > G(x)}, B=R\A, let A, = AN (—0,¢), B = BN (—o,?¢) and let 


a(t) =f (F(x) —G(x)) dx, B(t) = f (G(x) ~ F(x) de. 
A, B, 
Then a(t) and B(t) are monotone non-decreasing, continuous functions with a(— oo) = B(— co) 


and a(co) = B(co). Therefore, for every ¢ there exists y = y(t) such that a(t) = B(y). Now set 
C, = A, U B,. Then clearly C, Cc C, when ¢ < s, and (C,) > n(C,) when s > ¢ + . Hence the 
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function 
[ F(x) dx = { G(x) dx =: H(t) 


is continuous. Since C_,, = © and C, = R, it follows that H(t) attains all values between 0 and 
1. Thus, for A & (0,1), there is a number 7 such that H(r)=A and E = C_ has the required 


property. 


Also solved by A. N. Al-Hussaini (Canada), Preben Alsholm & Michael Ratliff, Miroslav D. A&’ié (Yugoslavia), 
Ethan Bolker, Aage Bondesen (Denmark), Victor Hernandez (Spain), L. E. Mattics, J. G. Mauldon, Stephen Noltie, 
M. Pachter (South Africa), Nicholas Passell, Vincent C. Peck, Rae Michael Shortt, and Alfonso Villani (Italy). 


Functionally Countable Spaces 


6444 [1983, 648]. Proposed by Fred Galvin, University of Kansas. 


A topological space X is functionally countable if every continuous real-valued function 
defined on X has a countable range. Prove or disprove: the product of two functionally countable 
spaces is functionally countable. 


Solution by Salvador Hernandez, Facultad de Ciencias Matemdaticas, Valencia, Spain. Let 
X = W(w,) be the space of all ordinals less than the first uncountable ordinal. Let Y = D U { p} 
where D is an uncountable discrete space, p € D and, if p € V, then V is a neighborhood of p 
when the cardinality of D — V is countable. It is clear that X and Y are functionally countable 
spaces. 

Let ¢: X — D be an injective map, and let H = {(a,¢$(a)): a is an isolated point of X}. 
Then H is an uncountable discrete closed and open subspace of X X Y. Thus X X Y is not 
functionally countable. 


Also solved by Klaas Pieter Hart (The Netherlands) and the proposer. 


The Generalized Totient Function 


6446 [1983, 709-710]. Proposed by Paul S. Bruckman, Carmichael, CA. 


Define the generalized totient function ¢, as follows: 


o(n)= > k’, r=0,1,2,..., 
k=1 
(k,n)=1 


and its (Dirichlet) generating function 
f(s) = Li &(n) fr’, 
n=1 


defined for an appropriate domain of s. Show that 


1 r+1 r+] 
f(s) = -——T » ( )1, 1-4l$(s — &), s>rt 2, 
(r+ 1)f(s—r) {7\ k + 
where the B,’s are Bernoulli numbers and § is the Riemann Zeta function. 
Solution by O. P. Lossers, Eindhoven University of Technology, Eindhoven, The Netherlands. 
The expression for f.(s) as stated is incorrect. It should read 


f(s) =1 *GaDiGlD y ("+ ) Bsr 48(s — k). 


k=1 


To see this, note that since $,(n) = O(n’*") the series defining f.(s) is absolutely convergent 
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when Res > r + 2 as also is the series defining {(s — r). Hence, for Res > r+ 2, 
a 
Ls =O, 


where 


and it is known (see, e.g., I. S. Gradshteyn and I. W. Ryzhik, Tables of Integrals, Series and 
Products, Academic Press, 1980, 0.121, p. 1) that 


n 1 r+l1 rt] 
want LEB ant 
j=l 


It follows that, for Res > r+ 2, 


L(s)(s— A) =o N+ Y (THB gs 8). 


k=1 


Also solved by C. Georghiou (Greece), Jonathan M. Jacobs, H. Jager (The Netherlands), William Newcomb, and 
the proposer. 


Factors of ( ‘) 


6447 [1983, 710]. Proposed by P. Erdos, Hungarian Academy of Sciences, and J. L. Selfridge, 
Mathematical Reviews. 


For fixed k > 2, if n > 2k, show that there is at least one i, 0 < i < k — 1, such that n —i 
does not divide ("). On the other hand, there is an n, > 2k for which n — i divides (") for all 
but one j in the range 0 < i < k — 1. Estimate the smallest such n, as well as you can. 


Partial solution by Jean-Marie Monier, Lyon, France. Let k >2, n>2k. Suppose n, 
n—1,...,.n—k + 1all divide ("). Since n(n — 1)---(n—k + 1) =k1("), we have: 


(n—1)(n — 2)(n — 3)---(n-~kK +1) € KIN 
n(n — 2)(n— 3)---(n-kK+1 E KIN 


nin —1)(n — 2)---(n—k +2) E KIN. 

Subtracting each expression from the one below we get: 
(n — 2)(n—-3)\(n-4:--(n-—k +1 EKIN 
nin — 3)\((n-—4---(n-k+1EKIN 


n(in—1)\(n— 2)---(n—k+3)EKIN. 
Repeating the process A — 1 times we arrive at 
2-3-4---(k-1) EKIN, 
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which is a contradiction since 1/k € N. Hence there is at least one i, 0 < i < k — 1, such that 
n — i does not divide ("). 


For the next part of the problem, we observe that for n, = 4, n, = k! when k > 3, ("| is 
divisible by n, — 1, n, — 2,...,n, —k +1. 


Also partially solved by Kee-Wai Lau (Hong Kong) and the proposers. The question of estimating the smallest 
n, > 2k remains open. 


On Clausen’s Integral 


6448 [1984, 59]. Proposed by Henry E. Fettis, Mountain View, CA. 
Show that 


7 (1 2a 
wsc(3)-¥(3)- "> 
where Cl,(@) = — {{1n(2 sin 41) dt is Clausen’s integral and (z) = (d/dz)InT(z). 
Solution by William A. Newcomb, Lawrence Livermore National Laboratory, Livermore, Cali- 
fornia. Let 


Ss, = ) eT Ss, = ne ae 
n=0 (3n + 1) n=0 (3n + 2) 
~S (y" FS _" 
A, = 2? 277 > ° 
n=0 (3n + 1) n=0 (3n + 2) 
Then 
co 1 io 6) 1 or 1 or Aq 
S+5= 25-2 =—- —-—-—-—.—- = —, 
n= n=0 (3n + 3)” 6 9 6 27 
S;-4,=2)) ———> = 5 2» 
n-0 (6n + 4) 
and 
n=0 (6n + 2) 
Hence 
3 dor? 
Ay + Ay = 5(S, — Sy) = 38, — = 


We also have the well-known series representations 


c1,(0)= yee cb <n, 


n=1 n? 

/ - 1 

W(t)= ) ——>;, ¢#0,-1,-2,.... 
n=0 (n+ t) 


Consequently 


W'(1/3) = 9S, and Cl,(7/3) = Boa, + Ay) = aac 2), 


Also solved by K. F. Andersen (Canada), Thomas Archibald (Canada), Paul F. Byrd, Laszlo Cseh & Imre 
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Merényi (Romania), C. Georghiou (Greece), M. L. Glasser, O. P. Lossers (The Netherlands), Armel Mercier 
(Canada), Roger B. Nelsen, Norbert Ortner (Austria), Stan Philipp, Bertram Ross, Lajos Takacs, A. Tissier (France), 
Michael Vowe (Switzerland), and the proposer. 


A Property of Polynomials 


6449 [1984, 59]. Proposed by Kent D. Boklan (student), Massachusetts Institute of Technology. 


Determine the smallest closed subinterval [a,b] of [0,1] such that if p is a polynomial of 
degree at most 6 that increases on [a, b], then necessarily p(1) > p(0). 


Solution by Miroslav D. ASic, University of Belgrade, Yugoslavia. The smallest such subinterval 
[a, b] is given by 


_i_ vis, _ 1, VIS 
a9 10” ~ 39" 10° 


Suppose first that p is an arbitrary polynomial of degree at most 6 increasing on [a, b]. Then 
1 
p’(a) > 0, (>) >0, p’(b)>0. 


Since deg p’ < 5, we may use the corresponding Legendre- Gauss quadrature formula (adapted to 
the interval [0,1]) to obtain 


p(1) — p(0) = fir») dx = [5p’(a) f+ 8'( 5 f+ sp'(b)) | 18 > 0. 


Assume next that there is a shorter subinterval [c, d] C [0,1] with the required property. Then 
a<c or d< b. Consider the case a < c (the other case can be treated analogously). Let 


p(x) =(2= 02-5) (x= by 


obviously p’(a) < 0 and p’(x) > 0 on [c,d]. Using the same quadrature formula again we have 


p(1) — p(0) = [5p'(a) + 8p'(5| + sp'(b)) [18 = Sp(4) <0 
which is a contradiction. 


Also solved by Stan Philipp, Doug Tyler & Brad Parsons, and partially solved by the proposer. 


REVIEWS 


EDITED BY ALLAN L. EDMONDS AND JOHN H. EWING 
COLLABORATING EDITOR FOR FILMS: SEYMOUR SCHUSTER 


A Problem Seminar. By Donald J. Newman. Problem Books in Mathematics, edited by P. R. 
Halmos. Springer-Verlag, New York, 1982. v + 113 pp. 


IVAN NIVEN 
Department of Mathematics, University of Oregon, Eugene, OR 97403 


Here is a challenging book for problem buffs. It is split into three parts, first a list of one 
hundred and nine problems, some of which have brief introductory statements, then a section 
giving hints on the author’s approaches to problems, and finally the solutions. The problems, at 
the level of advanced undergraduate work and beginning graduate work, are comparable in 
difficulty to the problems on the annual Putnam competitions. 
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What kinds of problems are included in the collection? Some types are identified by the author, 
as follows: estimation theory, generating functions, expected values in probability, problems on 
categories in the real variable sense of the word, and convexity. Of the fifty problems of these 
types, twenty-nine are listed under “estimation theory,’ by which the author means questions 
whose answers can be approximated but not expressed in a closed form. 

Since the other problems in the book are not classified, we offer here a rough description. 
There are several problems in each of these areas: asymptotic relations between functions; 
combinatorial geometry; number theory; real analysis; other probability questions; sequences, 
especially sequences of integers; partitioning the integers. There are a very few problems in each 
of these classes: algebraic operations; logic; abstract algebra; maxima and minima; recurrence 
relations and iterative processes; functional equations; densities of sequences of integers. Many of 
the problems cross over these lines of classification. 

The author’s interests are strongly represented, no doubt, in the selection of problems. Thus we 
find almost no questions in such areas as complex variable theory, geometry apart from 
combinatorial aspects, graph theory, algebra in general, and numerical analysis. An imbalance of 
this kind is to be expected in a problem collection by a single author. 

How original are the problems? Are they widely known? This reviewer, moderately acquainted 
with problem collections, already knew eight of the questions. For example, one of the problems 
was analyzed, with a reference to its source, in an invited lecture at the 1982 summer meetings in 
Toronto. The only really misplaced problem asks for a proof of one form of Stirling’s formula, 
which is surely a textbook matter. But all in all, the author has assembled a rather novel 
collection. Moreover, the problems are instructive; many of them illustrate an important point or 
two. Students can learn a lot of mathematical technique while enjoying the challenges. Further- 
more, in many cases a direct attack on a question leads into a morass, so that the book stands as a 
subtle argument in favor of sophisticated, even abstract, approaches. This is the book’s strongest 
and most attractive feature. 

The author is to be commended for doing more in the Solutions section than simply solving the 
problems. He outlines possible methods of solution, including occasionally approaches that fail. 
This is an attractive instructional device. 

We have three examples to illustrate the author’s approaches to problems. Here is problem 
#18: 


Evaluate V1 + 2V1+ 3¥1+ 4y1+--: . 


The author solves this problem by generalizing to the function 


f(x) = 1 t xfl +(x 4+ Dll +(x 2Vl4F-- 


which turns out to be more manageable than the special case with x = 2. 
Next consider problem #76: 


Prove n —[n/2] +[n/3] — --- ~ nlog?2. 


Newman points out that removal of the square brackets in [n/j] for 1 <j < n introduces an error 
of the same order of magnitude as n log 2, so that this direct approach fails. We must find another 
approach, perhaps by taking advantage of the alternating signs. (We won’t give away the secret 
here.) 

Problem # 80 is of a different type: 


Given that f(x) + f’(x) — 0 as x > 00, prove that both f(x) > 0 and f’(x) — 0. 


The hint suggests that we should use L’HOpital’s rule on f(x)e*/e*, since the derivative of 
f(x)e* is (f(x) + f(a)}e*. 


Turning to the question of who will use the book, the author gives a clear answer when he asks: 
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“Why have the graduate schools dropped the Problem Seminar?’ But it is unlikely that the book 
will be accorded this role in the already crowded graduate curriculum. Graduate faculty may want 
their students to solve lots of problems, but not miscellaneous problems. They usually want 
problems directed toward a specific subject, just as R. L. Moore did with his students of topology. 
Should graduate students spend their time solving such a question as the following, except for 
Spare-time amusement? 


Given any n distinct points in the plane, show that one of the angles determined by them is 
<a/n (the zero angle counts). 


It is often the case that a problem is easy enough for a reader familiar with the topic, but 
troublesome for others. Consider for example problem #48: 


Call an integer square-full if each of its prime factors occurs to the second power (at least). 
Prove that there are infinitely many pairs of consecutive square-fulls. 


Any student familiar with the elementary theory of the Pell equation knows that for any positive 
integer k not a perfect square the equation x* — ky* = 1 has infinitely many solutions in integers 
x and y. From x* — 8y? = 1 we get the consecutive square-fulls required. (The author gives a 
different solution, using only college algebra and a dash of Euler’s kind of ingenuity.) 

Graduate work aside, the book will find a place in advanced undergraduate seminars, such as 
those preparing students for the Putnam exams. More generally, anyone who likes the challenge of 
novel problems will enjoy this collection immensely. 


Formal Number Theory and Computability: A Workbook. By Alec Fisher. Oxford Logic Series 
Number 7. Oxford University Press, New York, 1982. xii + 190 pp. 


MARTIN D. DAVIS 
Courant Institute, New York University, 251 Mercer Street, New York, NY 10012 


How much should be left to the reader? A tradition which goes back to Euclid calls for a 
mathematical expositor to include all details. Perhaps the purest example in this tradition in this 
century is the “Satz-Beweis” style of Edmund Landau. But there are very successful textbooks 
written with an opposite point of view. The example that comes to mind is Polya and Szegé’s 
famous two-volume collection of problems in which readers are encouraged to obtain their own 
proofs, before consulting the carefully arranged outlines of solutions at the back. 

The question of which details may be “left to the reader” is particularly troublesome for 
writers of textbooks in mathematical logic. The methods of mathematical logic require that 
various processes of ordinary mathematics be “formalized.” This means that it needs to be 
demonstrated that certain proofs and algorithms can be developed within formal systems which, 
at least superficially, seem severely limited in scope and power. This presents no problem at all for 
working logicians who have a perfectly clear idea of what can be done using a particular 
formalism. But it does present a real dilemma for logicians attempting to teach their subject. To 
follow the Euclid-Landau model and simply present all details is to risk drowning the novice in a 
sea of tedious details while helping to perpetuate the myth (all too commonly believed by 
mathematicians a generation ago) that mathematical logic consists of nothing but these tedious 
details. On the other hand, to airily dismiss such details as ““obvious” is to risk having the subject 
lose its cogency, and to leave students with little intuition for what can be formalized in a given 
system. Alec Fisher’s little “workbook” presents an approach which is much closer to Polya-Szegé 
than to Euclid-Landau. In only 135 pages of lucid exposition he obtains Gédel’s incompleteness 
theorem (actually the somewhat stronger Gédel-Rosser theorem) for Peano arithmetic. En route 
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he obtains Presburger’s decision procedure for arithmetic without multiplication and, in addition, 
he shows that the problem of deciding whether a given computational procedure will ever halt is 
algorithmically undecidable. Many details are left as exercises, and fairly detailed outline solutions 
are presented in an additional 50 pages. The book is intended for quite unsophisticated readers, is 
very clearly written, and should provide a very effective teaching tool. 

In order to accomplish his task, Fisher required not only a logical formalism for elementary 
number theory, but also a model of computation for giving a precise definition of computable 
function. From the many mathematically equivalent formalisms available for the purpose, Fisher 
chose “register” machines. This simple and elegant formalism is highly accessible, especially to 
readers with a modicum of programming experience. However, because Fisher is working with a 
formalization of number theory in which + and - are the only functions explicitly available, the 
task of proving that all computable functions are representable (in a suitable sense) in his 
formalization is rather formidable. Fisher uses the now classical device, Gédel’s B function: 


B(c,d,i) =rm(c,1 +(i+ 1a), 


where rm(x, y) is the least nonnegative remainder when x is divided by y. Using the Chinese 
remainder theorem, it is not too difficult to show that for every finite sequence a),a,,...,4,, 
there are integers c,d such that B(c,d,i)=a, for i=0,1,...,n. The successive steps of a 
computation by a register machine program can then be coded using this 8 function. The catch is 
that it is not much fun to formalize this in Peano arithmetic. 

An alternative path, which avoids much of the tedious complications, is first to prove as a 
theorem of ordinary mathematics that computability by register machines is equivalent to a more 
tractable notion. For example, by a theorem of Kleene (later improved by Julia Robinson), easily 
proved by an informal use of Gédel’s B function, the computable functions are just those 
obtainable from a few initial functions using the two operations of function composition, and 
taking the least zero of a function. Another attractive possibility only recently available is to give 
the new Jones-Matijasevi¢ proof of the Davis-Putnam-Robinson theorem that the graph of a 
computable function consists of the tuples which satisfy an exponential Diophantine equation. 
This new proof is very simple and is carried out using register machines. If this path is taken, the 
exponential function should probably be added to + and - as primitive functions for which the 
recursion equations are taken as axioms. The slight weakening of the Gédel-Rosser theorem that 
would result hardly matters at this level, and as a dividend one would obtain the unsolvability of 
the decision problem for exponential Diophantine equations. (Alternatively, one could digress to 
obtain Matijasevic’s theorem that the exponential function is Diophantine, thereby not only 
avoiding the weakening of the Gédel-Rosser theorem, but also obtaining the unsolvability of 
Hilbert’s tenth problem.) 

Two minor errata: On p. 34 it is stated that Russell and Whitehead’s Principia Mathematica 
was written independently of Frege’s work, which is certainly not the case. On p. 37 it is stated 
that there is no mechanical procedure for “deciding whether, for arbitrary n, a sequence of n 
successive 7s occurs in the decimal expansion of ,... .” What is presumably meant is that no 
such procedure is known. 


ANSWERS TO REBUSES ON PAGES 426-427 


1. Triangle inequality. 
2. The inverse of a matrix. 
3. Iceberg (= ice cube + sea = (ice)* + C). 
a. Power series. 
4b. Formal power series. 
5. Telescoping series. 
6. Subset. 
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Elementary Applied Partial Differential Equations with Fourier Series and Boundary Value Prob- 
lems. By Richard Haberman. Prentice-Hall, Englewood Cliffs, NJ, 1983. x + 533 pp. $34.95. 


GREGORY B. PASSTY 
Department of Mathematics and Computer Science, Southwest Texas State University, 
San Marcos, TX 78666 


Whenever I have taught Fourier series and their applications, I have found two topics that 
merit special care—the modeling process and the validity of series solutions. While inclusion of 
both topics is necessary for a complete course in applications of Fourier series with mathematical 
rigor, either could be considerably deemphasized without seriously affecting the rest of the 
material. Testimony to this is provided by the varying treatments given in the different textbooks 
used to teach the subject today. 

Two basic types of students take such a course. One group is made up of mathematics majors 
looking for an introduction to partial differential equations. The other group comprises science 
majors, most notably from physics and engineering, who are looking for mathematical under- 
standing of problems and problem-solving techniques that they may have already encountered in 
their own majors. 

The mathematics majors should have more of an interest in the difference between formal and 
actual solutions and associated ideas—convergence, term-by-term differentiation, etc. They seem 
to have less interest in the modeling process because they haven’t really practiced it very much, 
and also because, with several recently completed abstract courses behind them, they have yet to 
perceive how thoroughly involved with modeling some mathematical careers can be. For the 
science majors, the steps and decisions leading to partial differential equations that describe 
natural phenomena and are, at the same time, amenable to analysis (if not complete solution) 
seem far more important than a discussion of validity. They know that the methods work in 
situations of interest to them, anyway. 

With Elementary Applied Partial Differential Equations, Richard Haberman presents a very 
thorough text for a first course in partial differential equations while finding excellent approaches 
to the two topics I have mentioned. The first half of the book deals with the method of separation 
of variables and all that entails. Following a chapter on nonhomogeneous problems, the second 
half comprises several other topics: Green’s functions, the method of characteristics, Fourier and 
Laplace transforms, and finite difference numerical methods. 

This book will appeal to both of the audiences mentioned above. Haberman begins with a 
chapter-long derivation of the heat equation and its various boundary conditions in one and then 
two or three dimensions. All physical ideas are very patiently and clearly explained. Specific heat, 
for example, is introduced after we are brought to the realization that we may need to consider 
some distinction between thermal energy and temperature. The development leads first to an 
integral conservation law and then to the heat equation as we generally think of it. Each step of 
the process is made clear, and many variations are either explained or suggested. Note that the 
derivation (via the divergence theorem) of the heat equation in several dimensions is given here in 
Chapter 1 rather than later, as in most other treatments of the subject I have read. 

An important feature of this first chapter which persists throughout the book is the repeated 
appeal to physical reasoning to suggest or interpret the mathematics. In the exposition and also in 
the exercises, the student is often asked to explain physically the meaning of a certain mathemati- 
cal result. Haberman does not »:se physical reasoning in place of proofs, however. He communi- 
cates to the student that mathematics and physics can work closely together, each illuminating the 
other. In Chapter 1, the modeling concepts are stressed as much as the mathematics, and ideas 
developed here are repeatedly referred to subsequently, for example to explain in advance why the 
separation constant is most naturally written with a minus sign. 

Chapters 2, 3, and 4 consist of series solutions of the heat equation, wave equation, and 
potential equation (with various boundary conditions), as well as the required discussion of 
Fourier series, half-range expansions, and so on. In spite of this array of different problems, there 
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has appeared through the first four chapters only one differential equation for the spatial 
eigenfunctions: ¢” = —Ad@, with various boundary conditions. This displays an admirable peda- 
gogical quality: resistance to the temptation to throw in too many new things all at once, 
especially when they’re not needed for the discussion at hand. (Other differential equations do 
appear in the exercises.) The same quality is manifested in Chapter 7, in a section on forced 
vibrations of a membrane of arbitrary shape. The shape is unspecified, so the corresponding 
eigenfunctions must go uncomputed. While the problem thus remains fairly general, Haberman is 
able to carry on a very informative discussion of forcing, periodic forcing, and resonance without 
cluttering the landscape with a pedagogically unnecessary digression. 

The author presents quite a few proofs throughout the book. Most of the time, they are 
included as needed in the exposition. In some cases (e.g., the Riemann-Lebesgue lemmas), they are 
left for optional sections or appendices. 

We now summarize briefly Haberman’s discussion of the validity of the series solution to the 
heat equation. Let f(x) be a periodic function. The keys are the following two theorems: 


(1) [p. 99]. If f’(x) is piecewise smooth, then a continuous Fourier series corresponding to f(x) 
can be differentiated term by term. 

(2) [p. 93]. For a piecewise smooth f(x), the Fourier series corresponding to f(x) is continuous if 
and only if f(x) is continuous. 


Now consider the validity of 


— . AMX na? 
u(x,t) = Xu B, sin— exp Th ka 

as a solution to the heat equation u, = ku,,,, with boundary conditions u(0, t) = u(L, t) = 0, and 
initial condition u(x,0) = f(x). The B,’s are of course the Fourier sine series coefficients for 
f(x). Under the assumption that u(x,t) and du/dx(x, t) are continuous functions of x (required 
by the physical formulation) and that d7u/dx* is at least piecewise smooth, we are justified in 
computing the second derivative 0*u/dx* term by term. On the other hand, term-by-term 
differentiation with respect to ¢ is justified if u(x,t) is continuous and du/dt is piecewise 
smooth. (The proof, depending on Leibniz’s rule, is left as an exercise.) Comparison of the 
term-by-term derivatives shows the heat equation is satisfied by u(x, £). 

Relying upon the continuity assumption, Haberman never mentions the notion of uniform 
convergence, which can be used to derive the regularity properties hypothesized for u and its 
derivatives from regularity properties of f(x). But this omission is certainly acceptable in a course 
where most of the students have not encountered uniform convergence in their previous studies. 

Looking at other texts, I find that most do not discuss the validity question at all, but do say 
that the solution obtained is a formal solution. Kreyszig [3, p. 423ff.] demonstrates a validity 
argument for the solution of the wave equation (with zero initial velocity for simplicity), and notes 
that validity ultimately follows from continuity of the initial displacement and piecewise continu- 
ity of its derivatives. There is again no mention of uniform convergence in this context. 

The first book from which I learned differential equations, by Kreider et al. [2, p. 380ff., p. 
522ff.], does discuss uniform convergence and its consequences for Fourier series. It is pointed out 
that the initial condition really says lim, _, 9+u(x, ¢t) = f(x) for all x in (0, L), and that uniform 
convergence supplies the continuity necessary to verify that this really happens. Churchill [1] also 
shows how the necessary continuity properties follow from regularity of the initial and/or 
boundary conditions by way of Abel’s Test for Uniform Convergence. I feel that both of these 
very good treatments would be difficult to follow for a class without earlier experience with 
uniform convergence. Haberman’s compromise is a good one. 

Problems treated in the text are almost all variants of the heat, wave, or potential equation, 
with various nonhomogeneities. Quite a number of other variations appear in the exercises. There 
is some discussion of non-physical problems as well. These are used to shed additional light on 
Sturm-Liouville eigenvalue problems, the subject of Chapter 5. 
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A sufficient prerequisite for the student is ordinary differential equations, although some later 
sections suggest that prior brief exposures to complex variables and to Laplace transforms would 
be helpful. 

Haberman’s book is a fine treatment of elementary partial differential equations. I recommend 
it for anyone who wants to illustrate to math and science students the ways in which their 
disciplines interact and the insights each subject contributes in elucidating the other. 
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LETTERS TO THE EDITOR 


Material for this department should be prepared exactly the same way as submitted manuscripts (see the inside 
front cover) and sent to Professor P. R. Halmos, Department of Mathematics, University of Santa Clara, Santa 
Clara, CA 95053. 


Editor: 


Miscellaneum 129 (“Triangles are square,” June-July 1984 MONTHLY) may have misled many 
readers. Here is some background on the item. 

That n’ points fall naturally into a triangular array is a not-quite-obvious fact which may have 
applications (e.g., to symmetries of Latin-square “k-nets’’) and seems worth stating more formally. 
To this end, call a convex polytope P an n-replica if P consists of m mutually congruent 
polytopes similar to P packed together. Thus, for n € N, 


(A) An equilateral triangle is an n-replica if and only if n is a square. 


Does this generalize to tetrahedra, or to other triangles? A regular tetrahedron is not a 
(2°)-replica, but a tetrahedron ABCD with edges AB, BC, and CD equal and mutually 
orthogonal is an n-replica if and only if n is a cube. Every triangle satisfies the “if’’ in (A), so, 
letting T be the set of triangles, one might surmise that 


(B) Vt & T (¢ is an n-replica if and only if 1 is a square). 


This, however, is false. A. J. Schwenk has pointed out that for any m € N, the 30°-60°-90° 
triangle is a (3m7)-replica, and that a right triangle with legs of integer lengths a and b is an 
((a? + b?)m7)-replica. As Schwenk notes, it does not seem obvious which other values of n can 
occur in counterexamples to (B). Shifting parentheses to fix (B), we get a “square-triangle” 
lemma: 


(C) (Vt © T, ¢ is an n-replica) if and only if n is a square. 

Miscellaneum 129 was a less formal statement of (C), with quotation marks instead of 
parentheses; this may have led many readers to think (B) was intended. To these readers, my 
apology. 

Steven H. Cullinane 


501 Follett Run Road 
Warren, PA 16365 
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Editor: 


Professor Neal Koblitz has pointed out to me that there is an error in my paper “Factorization 
and primality tests” (this MONTHLY, 91 (1984) 333-352) in the criterion for irreducibility stated in 
Exercise 9. The criterion should read: 


A monic polynomial f(X) of degree d over Z, is irreducible if and only if 
(i) X?° = X(mod f(X)); but 
(ii) GCD(X?"”" — X, f(X)) = 1 for each prime g|d. 


Dr. Gustavus J. Simmons has sent me a report (“Status Report on Factoring at the Sandia 
National Laboratory”) about the work done on factorization by himself and colleagues J. A. 
Davis and Diane B. Holdridge at Sandia National Laboratories, Albuquerque, New Mexico, 
during the past few years. The report describes a method used for fairly routine factorization of 
numbers up to about 70 digits (as compared to the figure of 50 digits given in my paper.) This 
recent advance is credited to three things: use of Carl Pomerance’s quadratic sieve (an improve- 
ment on the continued fraction method) with a technical modification by J. A. Davis which makes 
it run particularly fast, the use of a very fast Cray computer, and programming by Tony Warnock 
and Diane Holdridge which takes full advantage of the parallel architecture of the Cray. The Cray 
computers form one of several families of machines which are capable of doing thousands of 
computations simultaneously (in “ parallel’), but special methods of programming are required in 
order to take full advantage of this property. Undoubtedly further advances in hardware will 
allow faster factoring programs even if no better algorithms are used. 


John D. Dixon 

Department of Mathematics 
Carleton University 

Ottawa, Ontario K1S 5B6 
Canada 


150. MISCELLANEA 


Is Mathematics a Mystery? 


As evidence goes, I favor what is usually called “circumstantial evidence” as against the 
testimony of witnesses... . Neither have I much faith in experts who claim infallibility in any field 
except, perhaps, abstract mathematics. 


— Dashiell Hammett, by 
Diane Johnson, Random House, 
New York, 1983, p. 18. 


It would be impossible now for me to remember all that he wanted to learn, but I remember a 
long year of study on the retina of the eyes; how to play chess in your head; the Icelandic sagas; 
the history of the snapping turtle; Hegel; would a hearing aid—he bought a very good one—help 
in detecting bird sounds; then from Hegel, of course, to Marx and Engels straight through; the 
shore life of the Atlantic; and finally, and for the rest of his life, mathematics. He was more 
interested in mathematics than in any other subject except baseball. 


—Lillian Hellman, from Dashiell Hammett, by Diane Johnson, 
Random House, New York, 1983, p. 169. 
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INTERESTING SERIES INVOLVING THE CENTRAL BINOMIAL 
COEFFICIENT 


D. H. LEHMER 
Department of Mathematics, University of California, Berkeley, CA 94720 


The adjective “interesting” is used here in a technical sense explained as follows. A series is 
called interesting in case there is a simple explicit formula for its nth term and at the same time its 
sum can be expressed in terms of known constants. Thus 


1 1 1 
I+stqto + on Tt = 2, 
1 1 1 (-1)""? 
l-s4+a-gtot + = log 2, 
16 81 256 n4 90’ 


are familiar examples of interesting series. 
The series we plan to discuss are of two types: 


I. y a, 2m) and sid. y 
n=0 


d 


where the a,, are very simple functions of n. We begin with series of Type I. 
By the binomial theorem we have 


() Lt) =e 


This converges if |x| < 1/4. If we put x = 1/8, for example, we get the interesting series 


2n 
1 3 5 (2 _ 
(2) l+7+a5 t+ jogt- + 3" + 2 
If x = 1/10, we get 
(22) 5 
(3) 1+ 2+ .06+ .02+ 007+ +--+ 07 toa ya. 
For x = —1/8 
—1)" 2n) 
1 3 °5 ( ( 2 
—_—_ — —S=> eee +. = —_— 
(4) 1-44 397 Tog 0 Fe 3 


Averaging (2) and (4) gives us 


_ 3v2 + V6 
ae 


(on) 
— ln 
arr 


Another step along this path is to bring in complex variables. Thus using ix instead of x leads to 


D. H. Lehmer: This paper marks the sixtieth anniversary of the publication of the author’s first paper in this 
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nn 0 gan 3 V5 ? 
or 
8n 
a (3 _ (1572 + 5¥6 + 6/5 ¥2 + V5 ) 
Lan 60 
n=0 8 
In general one can obtain a value for the sum 
[> an + b) 
— an + b 
L canto 


by setting x = e’/c (vy =0,1,2,...,a— 1), where e = exp(27i/a), in (1) and forming the 
appropriate linear combination of these results, a process known as multisection (see Riordan [4]). 
Another path out of (1) is to apply operators. If we integrate (1) from 0 to x we obtain 


*¥ 2m) 1" dt = a 
I, xl " i (1 — 42)'7 
Dividing both sides by x we get 


(5) Y (2") grag T= a), 


7 nyijn+1 
n= 


The coefficients C, of the series are integers known as Catalan numbers. Their first ten values are 
1,1,2,5, 14, 42, 132, 429, 1430, 4862,... 


and they occur frequently in combinatorial analysis (see Gould [2]). By using Stirling’s formula for 
n! we see that 


“Fane tt) 


From this we see that for x = 1/4 the series in (5) converges slowly like (1 /n?/7). Thus (5) gives 
us the interesting series 


We can treat (1) a little differently by transposing its first term to the right side, dividing both 
sides by x and then integrating. This gives us the improper integral 


“y 2m) dt = {+ __ +) a 
l, XI ” I, Fr —4r)'/* 1 
That is, 
= 1 2n\ n 1—yl— 4x 
(6) » -( mee = 2og{ = * ). 


Putting x = 4 we get the series 


1 
a7 567487 Sat Te 
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If we alternate the signs of this series, we get 


(-1""1'(2") 
13,5 35, 8 Ae) yg | ¥2 41 


2 16 48 £512 n4” 2 
Adding or subtracting these two series we can derive interesting series for the two sums 
1 9) 1 
yy (2”)— or = [ n) 
n odd nin4 n even nina 
n> 


We can integrate (6) and obtain 


(7) cv eo avtggh VE nak 


—~—— + x(log4 -— 1) -= 
yo, n(n +t 1) *(log ) 
If we substitute x = 1/4, we obtain the interesting series 
11s 7 2) 
n 
S44 yp ey phd eB] —1. 
4° 16° 19° 312° “a(n 4" 4) 


Any number of examples of this sort can be obtained by repeated integrations of (1). 
Another operator of this kind is differentiation. Let # be the operator 


xd 
v= dx 
If we apply @ to (1), we get 
— 1 | 3/2 
8 , n(2")x" = o( | = 2x(1 — 4x ; 
(8) xu am —- ( ) 
If we set x = 4%, we get 
2n 
1 3 15 . 35 n{ 2”) v2 
471671957 512° °° 7 ge > 


Operating again by # we obtain 
3 Cn 2( 2m) x" = 9 (1- ax) 07? = 222% + YD 
n=1 (i _— 4x) 
Setting x = 4, we find 
2(2n 
1 3 45 35 1575 no( 20) 5¥2 


4° 3° 1287 128 gi90 7 ge 


If in (1) we replace x by x* and then integrate both sides, we get 


et (or) 


= =~ (aresin 2x) 


A 


and for x 


(2n+1)16"” 3° 


oo 2n 
y (37) T 


We turn now to series of type II which involve the central binomial coefficient in the 
denominator. These series are more mysterious and less well understood. In an informal 
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discussion of Apery’s proof of the irrationality of §(3), A. J. van der Poorten [5] refers to four 
series of type II. The following theorem was quoted and its proof by Z. A. Melzak [3] was 
described as “not quite appropriate”. We give an entirely differently proof. 


THEOREM. If |x| < 1, 
Q2xaresinx & (2x)°” 
(9) a ee 
l1-—x m=1 m( 2m) 
m 
Proof. We use the familiar Gregory series 


= (ay ten 


(10) tarctanr= > tm a] 
m=1 
and set t= x/ V1 — x’, so that arctant = arcsin x. Then (10) becomes 
00 —] m-l_ am 
arcsinx = )> (1) x" _ 
1 — x’ m=1 (2m — 1)(1 — x*) 
m-1 CO 


-§ EO" Eyam 


| 
M 
No 
3 
pa 


m—-1 . 
y (=) 3 [” FI ‘sue 
m=1 - j=0 J 


’ 


ee (yr 
Le 2 Ca Dro myWam 1) 


It suffices to show that the coefficient of x*” is half of that on the right side of (9). That is, 


(11) (27 > —|y-)! — 92r-1 


ry} v= vl(r—v— 1)'(2v + 1) 


To prove this we use Wallis’ integral 
[" (sind yr ly _ 2: 4-6: er — 2) 
1-3-5. 


as follows. The left-hand side of (11) can be written 


CLO ar MLL re 


by substituting y = cos 8. Using Wallis’ integral we find 


r(27) 2 a - 1)! —92r-1. 


This proves (11) and hence the theorem is established. 


If we substitute x = 4, we get the result (van der Poorten [5], p. 202) 
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(12) x 72) = 


as our first interesting series of Type II. If we wish to alternate the signs in this series, we merely 
put x = i/2 in (9). This gives us 


> Oo tog 158 5. 


mol m( 2") 2 
m 
If we divide both members of (9) by 2x and then integrate, we obtain 
00 2m 
2 
(13) 2(arcsin x)” = x) 
m=1 | 2m) 
m 
from which we get, for x = 3, 
— 1 a 
(14) y 4-2 =i, 
mel m?( 2m) 18 
m 


where {(s) is Riemann’s zeta function. Also, 


BO fl 424)) 


mel m?( 2m) 2 
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If we divide both sides of (13) by 2x and integrate, we obtain 


> (2x)°” _ gf Caresin») dy. 
el m( 2m) 0 y 
m 
This integral is a “higher transcendent”. It is closely connected with Spence’s transcendent, 
Clausen’s integral, and the trigamma function. For x = + we obtain 


a /3 . Xx 
= -2f x log(2 sin} dx 


- 9-9 (4(3)-o13) 


where (x) is the trigamma function. Van der Poorten [6] rejects this evaluation as being 
non-instructive. However, he does give the interesting series 


1 1 1 1 (-1)""* 2 

—_— — Ss ——_— ——— ee eee ————$——$—$—$—$ + ose }— eed 

+ 481 540 4480 °°° 7 me (27) 5 (3) 
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as well as Comtet’s [1] remarkable 
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There are no known interesting series of the form 


for k > 4. 

If we apply the operator #* to both sides of (9), we get an almost unlimited number of 
interesting series. 

To begin with # we obtain 


0 2x)” x? x arcsin x 
(15) » TCs 
m=1 (2”) 1-x (1 — x“) 
m 


From this we obtain, for example, 


Peretti ty Lot 2mV/3 
2° 6 20° 70 (2) 27 
n 


If we replace x by ix in (15) and set x = a/b, we get another form of (15) 


> (-1)""*(2a/b)"” - = Dios a+ h) a), 


on (2) h b B 
m 
where h = Va? + b?. For example, if a = 1, b = 2, h = V5, we get 
1 1 1 61 (-1)"~ . —1. 4/5. 541 
2~6'2 wT (2) ne a) 
n 


For a = 23660, b = 23661, we have h = 33461 and we obtain 


( 23660)" ; 
= n—-1\ 23661 23660 (23661)°. 57121 
y (-1)" = = 23660-3660 + * =! Jog — 
n=1 ( 2m) (33461)° 33461 23661 
n 
= 811587506... . 


This series is of course convergent. If one examines the ratio of consecutive terms one finds that 
the terms increase in absolute value until » = 11830 when the terms are as large as 117. After that 
they decrease to zero. It is well that we have our formula for the sum. 

If we operate on (9) by higher powers of #, the series we obtain are of the form 


ore) k—2 9) 2m 
(16) mex) 
m=1 (27) 

m 

and the value of the sum depends, as a function of k, on two sequences of polynomials V,(t) and 
W,,(t) defined recursively as follows: 

V,(¢) = 1, W(t) = 0, 
(17) Viai(t) = {(2k — 2)t + IV, (t) + 201 — 1) OV, (2), 
(18) W,44(t) = ((2k — 4t4+ 2}W,(t) + 201 — 1) OW, (t) + V,(2). 


The first few polynomials are 


(k > 0) 


1985] INTERESTING SERIES INVOLVING THE CENTRAL BINOMIAL COEFFICIENT 455 


V,; =14 21, W, = 3, 

V,=1+4 10¢ + 42°, W,= 7+ 8t, 

V,=1+ 36t+ 6007 + 82°, Wo =15 + 70t + 2027. 
The value of the sum (16) can be given explicitly in terms of V, and W, by 


| arcsin xV;, (x?) + xv1 — x? W,.( x?)]. 


a 
2*-*1 _ x2) ei? 


If we operate on this by @ and collect the coefficients of arcsin x and x¥1 — x*, we obtain the 
formulas (17) and (18). A trigonometric form of this result is, with x = sin@, 


fore) k—-2am(. 2m . 
x man (sin) _sn2e [20V,(sin20) + sin20W, (sir?6)]. 
m=1 (27) (2 cos*@) 


If we replace x by ix we get for the alternating series 


ore —] m-1 k—2gm nh 2m . 
> (<1) mt *4"(sinhz)" = Sn? __ [9 Jog { sinh z + cosh z} 
m=1 (27) (2 cosh’z) 


x V,(—sink?z) + sinh2zW,(—sink’z)]. 


These two formulas with their two parameters @ and k yield a wide variety of interesting series as 
examples. We list only the following. The sums extend from 1 to oo. 
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In general, 
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(2m) (am) 8 
m m 
45m 
y ee = 1130 + 355 
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where b/a is a close approximation to 7. 
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y D2" 473, where p = V3 log(2 + 3) = 2.281037989 
m 
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Dualities and duality principles are prized wherever they occur in mathematics. Some optimists 
see them as mechanically doubling the number of results in a theory. Others (also optimists) see 
them as halving the number of results, but packing more substance into each. 

There are limitations which keep duality from being as powerful in graph theory as it is in 
many other areas, but it is still a major unifying theme. Robin Wilson’s introductory text [24] is 
organized around duality, and it is also central to many area of applications. As evidence of the 
latter, consider the emphasis on duality in books such as Johnson & Johnson’s Graph Theory with 
Engineering Applications [8], Price’s operations research monograph Graphs and Networks [19], 
and Nakanishi’s Graph Theory and the Feynman Integral (16). 

Classical graph-theoretic duality centers on the relationship between circuits and cutsets, with 
spanning trees also playing a basic role. The first section of this paper surveys this duality and its 
formulation as a syntactical principle. A key feature of this duality is the role played by a 
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distinguished edge. When this localization is removed, something rather surprising occurs: A 
second duality emerges, unexpectedly centered on the relationship between spanning trees and 
cutsets, yet having the same structural features as the traditional circuit/cutset duality. The 
second section below introduces this revisionist global duality. The third section applies this new 
duality to a portion of the theory of spanning trees. 


1. The Classical Circuit / Cutset Duality. Although our graph-theoretic notation and terminol- 
ogy follow Wilson’s text [24], it is useful to illustrate the basic concepts using examples. Review of 
the classical duality is especially important since part of the significance of the duality of Section 2 
is its relationship to classical duality. 


Fic. 1 


Graph G, in Fig. 1 has six vertices and ten edges and is a plane graph in that it has been 
drawn without edges crossing (except, of course, at the endpoints of the edges). Graph G, has five 
vertices and ten edges and is nonplanar in that it can be shown that there is no way to embed five 
points in the plane and connect them with ten curves in this way without at least one crossing. 
Two edges are called adjacent whenever they share a common endpoint, and a chain of edges is 
any sequence of edges (such as a, b, j, e in G,) with each interior edge of the sequence adjacent to 
its predecessor and successor at its two endpoints. For simplicity, we consider only graphs which 
are finite and connected (i.e., every two vertices are joined by a chain of edges). 

It is fairly common to treat vertices as the fundamental objects of graph theory, with edges 
merely being unordered pairs of vertices. But for studying duality (and in many applications), 
edges are taken as the fundamental objects. We are primarily interested in special sets of edges, 
calling the cardinality of such a set its size. A natural example is a circuit (i.e., a set of edges 
forming a chain beginning and ending at the same vertex without that vertex appearing again and 
without any other vertex being repeated). For instance, G, has twenty-two circuits of sizes 3, 4, 5 
and 6, one of which is {a,c,i,g}; G, has thirty-seven of sizes 3, 4 and 5. An equally basic but 
less natural example is a cutset (i.e., a minimal set of edges whose removal would leave a 
disconnected graph). For instance, G, has twenty-two cutsets of sizes 3, 4, 5 and 6, one of which is 
{a,b,i, 7}; G, has fifteen of sizes 4 and 6. (Check that the size-six circuits of G, are identical 
with its size-six cutsets.) While cutsets are fairly easily identified in plane graphs, they are harder 
to find in nonplanar graphs. For instance, { a, b,c, e,g,h} is a cutset of G). 

In studying duality, we allow some special kinds of edges which are frequently excluded from 
other parts of graph theory. In particular, we allow Joops (i.e., edges joining vertices to 
themselves). Loops are thus circuits of size one and are precisely the edges which can never be in 
cutsets. We also allow multiple edges (i.e., edges joining the same pair of vertices); in other words, 
we allow circuits of size two. 

The relationship between circuits and cutsets is the heart of graph-theoretic duality, but it is 
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very vague what duality means beyond that. (The same duality occurs in electrical network theory 
between current and voltage-drops, but it is equally vague there; see [15].) The best description 
may be that certain parts of graph theory resemble other parts when the roles of circuits and 
cutsets are reversed. For instance, the condition (commonly called nonseparability or two-con- 
nectedness) that every two edges are in a common circuit is equivalent to every two edges being in 
a common cutset. Similarly, the theory of eulerian graphs (in which every cutset has even size) 
somewhat resembles the theory of bipartite graphs (in which every circuit has even size). 
“High-level” discussion of duality are contained in [21] and [7]. 

The duality of circuits and cutsets suggests other dual pairs of concepts. For instance, an 
isthmus is an edge which constitutes a cutset of size one and is the only sort of edge which can 
never be in a circuit. So isthmuses act as duals of loops. Spanning trees are maximal sets of edges 
which contain no circuit (e.g., {a,b,h,i, 7} in G,) and are very important in both the theory and 
applications of graphs. Surprisingly, the complements of spanning trees (within the total edge set) 
act as their duals. These complements of spanning trees are called cotrees. 

As vague as duality is in general, it becomes very concrete for plane graphs; see [22]. For 
instance, from our example G, we can define a new graph G}* whose vertices correspond to points 
from each of the six “regions” of the geometric drawing of G,. Edges are drawn between vertices 
of Gj‘ precisely when the corresponding regions of G, are bordered by a common edge. In this 
geometric setting, circuits of G, correspond to cutsets of Gj‘ and vice versa, loops to isthmuses, 
and spanning trees to cotrees. But none of this carries over to nonplanar graphs such as G,. 

The opposite strategy is also useful. Instead of narrowing the family of graphs to plane graphs, 
we can enlarge it to matroids, as was done by Whitney [23]. (An excellent introductory survey 
occurs in [25].) Within matroid theory, duality is very precise and powerful. Yet graphs are more 
general than plane graphs and less general than matroids. Graph-theoretic duality is very different 
from matroid duality and is inherently limited. | , 

Despite its limitations—and because of its vagueness—it is profitable to view graph-theoretic 
duality syntactically, that is, in terms of how properties can be expressed in formal languages and 
the effect of the symbolic interchange of circuits with cutsets. Although there is no elegant duality 
principle as in projective geometry or boolean algebra (or matroid theory), there are limited 
duality principles which are interesting. They are true duality principles in that they are 
equivalences of statements in which circuits and cutsets have been interchanged. But their 
application is restricted to very specific types of statements, and a change is also required in the 
logical structure of the statements. 

The simplest such graph-theoretic duality principle originated with George Minty and electrical 
network applications in the early 1960’s; see [14] or [6, page 41]. Suppose we have a graph with a 
distinguished edge e and suppose some of the edges (possibly including e) are somehow 
“marked”. Abbreviating X — {e} as X — e, Minty’s principle states that either 


(J circuit X containing e)(Wx © X — e)(x is marked) 
or 
(J cutset X containing e)(Wx © X — e)(x is not marked), 


but not both. By letting M(x) abbreviate “x is marked” and doing some elementary logical 
manipulation, this can be rephrased as the following equivalence, holding for every graph and 
every distinguished edge e: 


(1.1) (J circuit X containing e)(Wx © X — e) M(x) 

«> (V cutset X containing e)(4x © X — e) M(x). 
In this form, M(x) can either be thought of as identifying an arbitrary set of edges or as being an 
edge predicate and so any symbolic expression involving x (but not X). 


There are many ways to modify this principle and many ramifications of the modifications; 
these are surveyed in [13]. Notice that the equivalence (1.1) involves the logical duality of the 
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universal and existential quantifiers, as well as circuit/cutset duality. In fact [12] views this as 
identifying the quantifier duality with matrodial duality. (Edmonds and Fulkerson [3] also allude 
to this.) 

To see this in more detail, notice that the formulation 


—(4 circuit X containing e)(Wx € X — e) M(x) 
constitutes a generalized quantifier (in the sense of [9, pages 100-101]). This could be abbreviated 
as OQxM(x) with the quantifier Qx translated as “for all x (except possibly e) of some circuit 


containing e”. Just as ~Vx—M(x) (which is equivalent to JxM(x)) is the logical dual of 
VxM(x), we can dualize this “circuit containing e” quantifier as 


—(J circuit X containing e)(Wx © X — e) M(x) 
which is equivalent by (1.1) to 

—(V cutset X containing e)(4x € X — e)4 M(x), 
and so is equivalent to 

(J cutset X containing e)(Vx © X — e) M(x). 
Therefore these “circuit containing e” and “cutset containing e” quantifiers are logical duals of 
each other. 
The dependence on the distinguished edge e is also a conspicuous feature of (1.1). This 

relativization to e is often incorporated into the graph by treating e as being so distinguished that 


it is not even drawn. Instead, its endpoints—call them e and 6—are distinguished as “‘ terminals” 
of the remaining graph. For instance, G, could be reconfigured as shown in Fig. 2. 


Fic. 2 


The circuits containing e now appear as e,6-chains; i.e., as chains connecting e with 6. The 
cutsets containing e correspond to e,6-cutsets; 1.e., to cutsets separating e from 6. This two-termi- 
nal approach can be built into (1.1) as 


(1.2) (We, 8){(de, &-chain ¥)(Vx € X) M(x) @ (We, d-cutset X)(4x € X) M(x)]. 


As one advantage of this formulation, it can be easily related to the max-flow min-cut theorem 
and other parts of network flow theory. It also emphasizes the very fundamental dual-like 
relationship between connection and separation. 


2. The Tree / Cutset Duality. What makes equivalence (1.2) local is the dependence on the 
vertex pair 2,6. This can be avoided by “distributing” the (We,6) quantifier across the @ 
symbol: 


(We, 6)(Je, d-chain X)(Wx © X) M(x) = (Ve, 6)(Ve, 6-cutset X)(4x © X) M(x). 
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The right side talks about all cutsets of the graph; e and 6 are irrelevant. Letting tree hereafter 
abbreviate “spanning tree,” the left side is easily rephrased as 


(J tree X)(Vx € X) M(x). 
Thus this equivalence is naturally expressible as the following global tree /cutset duality principle: 
(2.1) (J tree X)(VWx € X) M(x) = (V cutset X)(Ax © X) M(x). 
There is an alternate form of (2.1) which is essentially its contrapositive: 
(V tree X)(Ax € X) M(x) © (A cutset X)(Vx © X) M(x). 
Yet there is also a true dual in the sense of classical duality: 
(2.2) (J cotree X)(Vx © X) M(x) © (V circuit X)(4x © X) M(x). 


This occurs automatically because (2.1) holds for all matroids (translating “tree” as “base” and 
“cutset” as “cocircuit’’). Thus there is a global cotree/circuit duality as well. 

Although it is rather jarring to suddenly start thinking of spanning trees as being duals of 
cutsets, there is certainly some sort of intimate relationship involved. A clear intimation of this 
occurs in Carré [2, page 160]: 


It is evident that a spanning tree represents a minimal collection of edges which preserves the 
connectedness of a graph. This concept is in a sense complementary to that of a proper cut set of edges 
(which is a minimal collection of edges whose removal disconnects some nodes from others). These notions 
are related precisely by the following theorem: 

In a connected graph, every cut set of edges has at least one edge in common with every spanning tree. 


The theorem he cites corresponds to the left-to-right direction of equivalence (2.1) by letting 
M(x) be the membership of x in any particular spanning tree. 

From more abstract viewpoints, it is very natural to call this tree/cttset relationship a duality. 
For instance, it is Example 4 of [4], illustrating clutter/blocker duality and so inheriting some 
structure such as Fulkerson’s Max-Min Theorem [4, Theorem 2.1]. It is also an example of 
Woodall’s abstract notion of Menger duality [26]. But what role can this tree/cutset duality play 
in graph theory itself, and how is it related to classical circuit/cutset duality? 

Consider the natural quantifier corresponding to the family of all spanning trees of a graph: 


(Jd tree X)(Vx € X).... 


By exactly the same method as we used in Section 1 to dualize the “circuit containing e” 
quantifier, except using (2.1) instead (1.1), we see that this tree quantifier is the logical dual of the 
cutset quantifier 


(J cutset X)(Vx © X).... 


Similarly, the natural cotree and circuit quantifiers are duals of each other. 

We have thus come full circle. We saw that the classical circuit/cutset duality corresponds 
precisely to logical duality as long as everything is relativized to a distinguished edge e. In 
particular, (1.1) is a localized duality principle. When the localization is dropped from, say, the 
“cutset containing e” quantifier, spanning trees suddenly emerge as the duals in the global sense. 
Moreover (2.1), the fundamental equivalence underlying tree /cutset duality, is precisely the global 
form of the fundamental equivalence of circuit/cutset duality. This is summarized in Fig. 3. 


local duality 


| q . 
global [sa* , circuits 


dualityL, trees | cotrees 


trees cotrees 


FIG. 3 
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Since spanning trees are sometimes called “minimal connectors”, call a set of edges a 
connecting set if it contains a spanning tree of the graph. Call a set a separating set if it contains a 
cutset. Then equivalence (2.1) also holds with “tree” replaced by “connecting set” and “cutset” by 
“separating set”. Thus connecting and separating become duals, and starting with cotrees and 
circuits leads to a similar duality between dependence (i.e., containing a circuit) and coindepen- 
dence. 

Each cutset has at least one edge in common with each spanning tree. It is also important that, 
given any spanning tree T and any edge e in T, there is a cutset which contains e but is otherwise 
disjoint from T. This can be derived from a statement about trees which Whitney [23] took as a 
fundamental property: 


(VW tree X)(V tree Y)(Vx € X-— Y)(Ay © Y-— X)(X-— x+y isatree), 
where X — x + y abbreviates (X — {x}) U {y}. Rephrasing this as 


(V tree X)(Vx © X)(V tree Y)(JAy © Y)| » =x or ( » ¢ Xand X-x+y is a tree) 
allows the application of (2.1), producing | 


(V tree X)(Wx © X)(J cutset Y)(Vy € Y)| » =x or ( » ¢ Xand X-x+y is a tree) 


Taking X = T and x = e produces the desired cutset D = Y, and there still is a bit left over from 
Whitney’s formulation. If the graph has n vertices, each spanning tree has n — 1 edges, so each 
edge y of D produces a different spanning tree containing the n — 2 edges of T — e. Moreover, 
each such spanning tree must meet D for its final edge, and so each determines a different edge 
from D. Thus Whitney’s property plus (2.1) corresponds to the following theorem of Hakimi [5], 
which is particularly interesting since it states a tree/cutset relationship somewhat resembling the 
classical max-flow min-cut theorem. 


PROPOSITION. Suppose G is any graph having n vertices. Every cutset of G contains at least k 
edges if and only if every set of n — 2 edges from any spanning tree of G is contained in at least k 
spanning trees. 


3. An Illustration: Maximal Distance Between Trees. The family of all spanning trees of a 
graph received a good deal of attention during the 1960’s in the literatures of both graph theory 
and electrical engineering; see [1, Sect. 164]. A key idea is a notion introduced by Ore [18, page 
104] of distance between two spanning trees: namely, the number of edges in either but not the 
other. Equivalently, each connected graph G determines a tree graph T(G) having vertices which 
correspond to spanning trees of G. Two vertices of T(G) are joined by an edge precisely when the 
corresponding spanning trees agree on all but one interchanged pair of edges; 1.e., when they are a 
distance one apart. Tree graphs enjoy many pleasant properties (e.g., they are always hamiltonian) 
and reflect a great deal of information about the original graph (e.g., they determine the original 
up to circuit isomorphism). 

Our approach yields a self-dual circuit/cutset formulation of the maximal distance between 
spanning trees of a graph or, equivalently, the diameter of its tree graph. 


THEOREM. The maximal distance between spanning trees of a connected graph G is less than k if 
and only if every set of k edges of G contains either a circuit or a cutset (i.e., is either dependent or 
separating). 


Proof. The property 
(V tree X)(V tree Y)(|X — Y| < k) 


can be re-expressed in the following logically equivalent reformulations, each preceded by a 


1985] THE LOGIC OF GRAPH-THEORETIC DUALITY 463 


universal quantification over all sets S having k edges: 
(V tree X)(V tree Y)[SCX ~SOY# @, 


(V tree X)(V tree ¥)| (vx ¢ X)(x €S) > (Ay E Y)(y s)}, 
(V tree X)(V tree )| (ax ¢ X)(x ES) or (AVEY)(ye s)| ; 


(Vv tree X)(4x ¢ X)(x ES) or (VWtree Y)(Ay € Y)(y ES), 
(V cotree X)(4x © X)(x ES) or (WtreeY)(JyE Y)\(yeES). 
Using equivalences (2.1) and (2.2) on the last, we get for every set S of size k 
(J circuit X¥)(Vx € X)(x ES) or (Acutset Y)(Vy € Y)(y ES), 


thereby proving the theorem. 

The k = 1 case is trivially true, since the maximal distance between spanning trees is zero if 
and only if the graph is (except possibly for loops) itself a tree, and this is clearly equivalent to 
every edge either being a circuit (i.e., a loop) or a cutset (i.e., an isthmus). If is the number of 
vertices of G, the k = n case is also trivial: Every spanning tree is a maximal circuitless subgraph 
and has n — 1 edges, so every set of n edges contains a circuit. (Dually, if g is the number of 
edges of G, the k = g—n+2 case is trivial, since every set of g — n+ 2 edges contains a 
cutset.) 

But the cases adjacent to these trivial ones are interesting. When k = 2, the theorem 
characterizes every two distinct spanning trees being distance one apart; i.e., the tree graph being 
complete (see [10]). This happens if and only if each pair of edges contains either a circuit or a 
cutset. If there are no loops, these are trees and the unicyclic graphs. - 

The k = n — 1 case-gives the following corollary. (Other approaches to pairwise intersection of 
spanning trees occur in [20] and [17], and in [11].) 


COROLLARY. Every two trees intersect if and only if every set of n — 1 edges contains either a 
circuit or a cutset. 


This corollary can also be reached directly, since the pairwise intersection of trees can be 
expressed as 


(3.1) (V tree X)(V tree Y)(Jy € Y)(y € X) 
and so by (2.1) is equivalent to 
(3.2) (V tree X)(J cutset Y)(Vy © Y)(y € X). 


The latter simply says that every spanning tree contains a cutset, which is essentially the content 
of the corollary. If a set does not contain a spanning tree, its complement must contain a cutset, so 
(3.2) is also equivalent to “every set either contains a cutset or its complement contains a cutset’”’. 

It is intriguing that the tree /cutset dual of the above—the pairwise intersection of cutsets—is 
equivalent to the graph being complete (i.e., every two vertices are joined by an edge) if there are 
no multiple edges. The discussion following the corollary has parallels for this version; 1.e., “every 
cutset contains a tree” and “every set either contains a tree or its complement contains a tree”’. 
But there does not seem to be a parallel version of the corollary itself. 
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151. MISCELLANEA 


An old mathematician can grow indefinitely, what he cannot do is keep up. Cultural 
conservatism is becoming in an older mathematician: anything else is cosmetics anyway. If he 
whores after the new thing, he will only get it wrong and wind up praising the latest charlatans. 


— Wilfrid Sheed (adapted by R. P. Boas). 


(The original had “writer” instead of “mathematician’’.) 


152. MISCELLANEA 
There are more than twice as many left-handed architects and mathematicians as one would 
predict from the general population. 


—From a review by Michael S. Gazzaniga of Cerebral Dominance by Norman Geschwind 
and Albert M. Galaburda (Harvard University Press, Cambridge, MA, 1984). 
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Is the mathematics that this man does honest statistics or honest measure theory? (See p. 490.) 
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(1 — yx)w = (1 — yx)(1 + yzx) 
= 1 — yx + yzx — yxyzx 
=1-yx+y[(1 — xy)z]x, 
which is 1, because the quantity in brackets is 1. 
A closer look at this computation shows something else: we have only used the assumption that 


z is a right inverse of 1 — xy and have deduced that w is then a right inverse of 1 — yx. The 
same is true with left in place of right, because 


(1 + yzx)(1 — yx) =1— yx + y[z(1 — xy) ] x. 
Thus we get the following result, which actually does more than just answer our two questions: 


THEOREM 1. Jf z is a right [left] inverse of 1 — xy, then 1+ yzx is a right [left] inverse of 
1 — yx. 


As we just saw, the proof of this theorem is a total triviality. But if we had been unwilling to 
use infinite series in a context where they may make no sense, how difficult would it have been to 
discover 1 + yzx? 

When y = 1, our two questions are of course pointless, but Theorem 1 tells us something even 
then that we might not have noticed otherwise, namely: 


If z is a right inverse of 1 — x, sois1 + zx. 


Let us now assume that 1 — x has a unique right inverse z. Then it follows that 1 + zx = z. 
This implies z(1 — x) =1, so that z is also a left inverse of 1 — x! In other words, 1 — x is 
invertible. Since every element of R can be written in the form 1 — x, we have arrived at the 
result to which the title of this note alludes. 


THEOREM 2. The invertible elements of R are precisely those that have unique right inverses in R. 


Of course, the same is true with left in place of right. 
The following well-known fact from linear algebra is also an immediate consequence of 
Theorem 1: 


If A and B are n-by-n matrices over some field, then AB and BA have the same eigenvalues. 


P.S. After completing this note, I was told that the geometric series trick of finding 1 + yzx is described in [1] 
and [2]. However, no conclusions about one-sided inverses are drawn there. In the context of Banach algebras (where 
series do make sense) 1 + yzx occurs in an exercise on p. 259 of [4]. The referee has pointed out that Theorem 2 
appears as Exercise 6 on p. 89 of [3]. 
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Dedicated to Professor G. Choquet on the occasion of his seventieth birthday 


Let @ be a symmetric compact convex set in the euclidean plane, D its diameter, L its 
perimeter, 2 its area. The circle [ of radius D/2, with the same center as @, surrounds @ and 
touches it at antipodal points A and B (see Fig. 1). Hence, it is obvious that = < 7D*/4 and 
2D < L < mD. Are there other inequalities among these three numbers? Yes, of course, and many 
of them were found many years ago. They correspond generally to figures solving an extremal 
problem; for instance, if 2 and D are given, what is the maximum of L? (Favard, 1929, [2]). Or, 
if 2 is given, what is the minimum of L? (the isoperimetric inequality: L* > 47, proved by 
Weierstrass). 

About a new inequality [5], we shall compare old and new methods, geometric and analytic 
points of view, and give elementary partial solutions, useful for teaching aims, or more sophisti- 
cated ones. 


r 
A B = 
0 D 


FIG. 1 Fic. 2 


An elementary but useful remark is that, if we know the diameter D of @, then @ is defined by 
a concave continuous function f on the interval [0,D], satisfying the inequality f(x) 
< /x(D — x) (see Fig, 2). In such a situation, we denote by / the length of the graph of f, and 
by S its area, defined by S = (f(x) dx; obviously L = 2] and = = 2S. With this point of view, 
geometric problems on figures become analytic problems on functions. 


Marc Rogalski: I was born in Paris, France, in 1940, I received my “Thése d’Etat” in 1972 from the University of 
Paris 6, under the supervision of Professor G. Choquet, for works in compact convex sets theory and ordered 
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The literature about convex sets is very extensive. We refer the reader specially to [1] and [3], 
and to their bibliography. 


1. Old inequalities. We start with two inequalities which are useful as a guide toward our 
problem. 
(a) Favard’s first inequality (1929, [2)). 


By solving geometrically the problem to determine the convex sets (not necessarily symmetric) 
of maximal perimeter L for 2 and D given, Favard proved the following inequality: 


(1) LD-23 <2D’. 
(b) Favard’s second inequality (1929). 


In the same paper, he proved for a concave continuous function on [0, D], null at 0 and D, the 
following inequality 


(2) ID—-28 <yD* + 4S 

(in the case where D is the diameter of the symmetric convex set @ defined by f, ie., f(x) 
< /x(D — x), inequality (1) implies inequality (2)). Because the maximum of f on [0, D] is less 
than //2, it is obvious that 

(3) 0</D-2S. 


Favard determines the function f that maximizes / for given S. His proof is geometric in 
nature, and easy analytic arguments are omitted. Later we shall give a modern complete proof of 
Favard’s inequalities, in the symmetric case. 


2. A new problem. Inequality (1) is equivalent to 
(4) => D5 - D), 
and from (2) and (3) we derive 
D(L L 
> abl + b\(5 — D), 
which is less good. 


This suggests a question. Obviously, if @ is a segment, L/2 = D and = = 0. In the other case, 
5 7 D> O-and & > 0, and it is reasonable to think that there exists a constant C > 0 such that 


(5) => c(£-p) 


(the square is for homogeneity). This question was put to us by A. Pajor, and its positive answer is 
a particular case of a conjecture about mixed volumes and means of volumes in any dimension. 
, , L 
Actually the answer is yes. From inequality (4), we could obtain (5) if C an D) < D,orC 
1 D 1 


is increasing on [o, 5 . and 7707 


D . x 
<7 for x = —; but the function x ~ ; 
2 x L 7 —xXx 


1 1 
hence, on [+.5] , 
gv 2 


and the inequality (5) is true for the value C, of C given by 
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2 
(6) C= a5 ~ 1519... 


But the proof of inequality (4) that is given in [2] is not elementary. Hence we shall give first an 


elementary proof of the existence of C. Then, our problem will be to find the greatest constant C that 
satisfies inequality (5). We shall study these problems only in the case of symmetric convex sets. 


> 
REMARK 1. The quantity 2 is not bounded from above, even for polygons which are 
[3-2] 


inscribed in the circle of diameter D: for a rectangle of diagonal D and side a, ——————- ~ 


L 2 
(7-9) 

when a —> 0. 

3. An elementary proof of the existence of the constant C 

PROPOSITION 1. For a symmetric convex compact set @, 

2 ; 
2 0<8<7/2 2/9 — 1+ cos@) 
, | (1 — D)’ | 
Proof. (a) We want to prove the inequality S > an an , for f concave and continuous on 
D . 

, , satisfyin x)< yx —x).Let m= = Sin e the maximum oO! /. We have 

[0, D] isfying f(x) (D ).L 7 Sing be th ff. Weh 
Dm D? 
S 2 2 = “4 sin 6, 
and 
D D ; 
l< of 54 f+ F cos = D(6@ + cos@) (see Fig. 3); 
hence 
(1-D) D2, 2 
—z 5 (8 1+ cos@)’. 
2 

We shall have § > CU 5 ay ie 


2 2 
2» sind > cz (8 —~1+cos6)’. 
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The greatest C compatible with this condition is 
C= Inf ——S2? _ 7 
0<8<7/2 2/9 — 1 + cos@) 


and this number is strictly positive. 
(b) It is easy to see that C, > 1: the function 


u(@) = sin@ — 2(0 — 1 + cos0) 


satisfies u(0) = 0 and uw’ > 0 on 0, >|: hence u > 0, and C, > 1. 


But if we define g by 


sin 6 
o() = Sin? __ 
2(@ — 1 + cos@) 
h by 
ay (8 — 1+ cos)” 
h(a) = 2g'(9) S04 + cos) 
and V by 


V(0) = cos*6h’(6), 


we have W’(0) = cos 0(—3x* — 2x + 3), with x = sin@ © [0,1]; the only root in [0,1] is x = 
(¥10 — 1)/3. Hence, it is elementary to deduce that g is decreasing from infinity to a minimum 


; | and that 4, is in the interval [0.935, 0.937]; as the function 
@ ~ 6 — 1 + cos@ is nondecreasing and nonnegative, it is easy to find that C, > 1.436. 
Of course, because of its elementary nature, this proof of the existence of C gives a bad value 


for C,, lower than C,. 


on ]0, 4], and increasing on | 


4. How to work with polygons? It is often useful to work with polygons rather than with 
arbitrary convex sets. This is possible for two reasons. First, the perimeter of a convex set is 
approached by the perimeter of a polygon whose edges are short chords of the convex set (by 
definition). Second, the area of such a polygon is an approximation to the area of the convex set 
(obvious from the formula } {2%o7(0) d@ for the area, in polar coordinates). 

As an illustrative example, this method gives an easy proof of the fact that if @C @’, then 
L(@) < L(@’). The result is obvious if @ and @’ are polygons; otherwise there exist polygons 
Pc@ and P’c @’(1 + €) (we suppose 0 € @), with P’ > P, and (1 — e) L(%@) < L(P) < 
L(P’) < (1 + «)L(@’). From this result, we deduce another which will be useful: 


LEMMA 1. On the set of concave (or convex) continuous functions on [a,b], the function 
f ~~ l(f), where l(f) is the length of the graph of f, is 4-lipschitzian for the uniform norm, hence 
continuous. 


If f—e<g<f+e, and a < Inf, f(x) —«, we apply the preceding result to the convex 
delimited by the segment y = a, the lines x = a and x = b, and, respectively, the graph of f — e, 
g and f + e; we obtain then |/(/) — /(g)| < 4e (see Fig. 4). 

Thus, if for instance the diameter D of a symmetric convex set is given, any inequality 
depending continuously on = and L is true if it is true for symmetric polygons. As an application 
of this remark, we give a result which will be useful for an analytical proof of the Favard’s first 
inequality, and for the determination of the greatest constant C satisfying the inequality (5). 


PROPOSITION 2. Let & be the convex set of concave continuous functions f on [0, D] satisfying 
0 < f(x) < yx(D — x), and let »: = R be a function that is continuous ( for the uniform norm) 
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and concave. If ~(f) > a when f is a polygon with its vertices on the circle y = |x(D — x) , then 
p(f) >a for every fin €. 


To prove that p(/) > a for every f in @, it is sufficient by Lemma 1 to prove this inequality 
when the graph of f is a polygon; for a vertex of such a polygon, we shall talk of “‘a vertex of f”’. 
We shall prove that if f has not got all its vertices on the circle, then there exists a polygon g in & 
such that p(g) < »(/), and such that either g has fewer vertices than f, or has the same vertices 
as f, except for one, and this one is on the circle for g but not for f. 

If a vertex m of f is not on the circle, three figures are possible (if the nearest vertices p and g 
of m are on the circle, only the case (c) is realized). Let f be the function whose graph is a 
polygon with all the vertices of f except m (see Fig. 5). Then there exists a continuous piecewise 
affine function u and t, €]0,1[ such that f = f + tou. For ¢ © [0,1], let g, = f + tu. For a good 
(and clear) choice of u, g, © & for every ¢ € [0,1]. Because the function V(t) = y(g,) is concave 
and continuous on [0,1], its minimum is attained at 0 or 1. Hence, for g=g, or g=g,, 
P(g) < 9(g,,) = PCS) and g is the function we wish. 


(a) (b) (Cc) 


By applying this procedure a finite number of times, we find g, which satisfies p(g) < p(/) 
and which either is the null function (which has its vertices on the circle!), or a genuine polygon 
with its vertices on the circle; we have p(f) > »(g) > a, and this proves the proposition. 


COROLLARY 1. (a) Favard’s first inequality for convex symmetric sets is true if and only if it is 
true for symmetric polygons inscribed in the circle of diameter D. 


(b) A constant C satisfies X > AS — D| for every symmetric convex set if and only if it 


satisfies this inequality for symmetric polygons inscribed in the circle of diameter D. 


It suffices to apply Proposition 2 to the function y, and q, defined by 
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ei(f) = S(f) — (UCL) ~ D) {where S(f) = f? f(x) de), 


and 


~(f) = S(f) — C5 
indeed, f ~ S(/) is affine, and 


fof) = Sup ¥ (2 +] (20) - (Eo) 


is convex, hence f ~ /(f) — D is convex positive, and (/(f) — D)* is too. 


[(f) - DD]. 
2 


COROLLARY 2. (a) Favard’s first inequality for convex symmetric sets is equivalent to the relation 


(7) >) sind, — o{ sin _ ) >0, for 06.>0, VO=7. 
i=1 1 1 
L 2 
(b) The greatest constant C satisfying the inequality © > 5 — D| for symmetric compact 


convex sets is the same as the greatest constant C satisfying 


n n . 2 
(8) Y sind, ~ 4c[¥ sin 5 ~ 1] >0, for 6.>0, VO=7c. 
] 1 


i=l] 


This is obvious, because the first members of’(7) and (8) are exactly the quantities tal f) 


n° 


and 3 Pal f), respectively, for f an inscribed polygon with angles at the center 0,,0,,...,9 


5. An analytic proof of Favard’s first inequality. We prove two lemmas in order to establish 
inequality (7). 

LEMMA 2. Define p(x) = 2sinx — 4 sin2x, and, for0 < a < 7/2, a(x) = p(x) + p(a— x). 
Then, if 0< x < a, a(x) < p(a). 
a 


5 | We have o(0) = p(a), and 


It suffices to look at the case where x © lo, 
o’(x) = p'(x) — p'(a — x) = 2cosx — 2cos(a — x) —[cos2x — cos2(a — x)] 


_a./a a a 
— 4sin$ sin( 5 — x}|2cos$ cos{ $ - x} -1]; 


2 cos cos{ 5 — x | —l> 2.cos* > —l=cosa> 0. 


Hence o is decreasing on [0.5], and o(x) < o(0) = p(a). 


LEMMA 3. If a,,...,@, are nonnegative and Lia; = > then Lip(a;) < (5). 


We prove inductively that if a, > 0 and Lia, < 7/2, then Lj'p(a;) < p(Qj"a;). It is clear for 
m = 1, and the step from m — 1 to m is a direct application of Lemma 2. 


Proof of Favard’s first inequality for symmetric convex sets. It is an easy consequence of Lemma 
3, for we have 
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n Q. n n Q. 
4¥ sin — Vsind, = 2¥ 05] < 20( = | = 4, 
i 2 t 1 2 2 


6. The greatest constant C: first method, by Favard’s first inequality 
THEOREM 1 [5]. For every symmetric compact convex set, we have 
L 2 
(9) o> — (5 - D| 
(mn — 2) \2 
for the euclidean ball, we have equality. 


This means that 7/(a — 2)* is the greatest constant which satisfies inequalities (5) and (8). 
Note that 7/(a — 2)* ~ 2.4106... > C,. 
We shall use a critical but elementary lemma: 


LEMMA 4. Define the function u on [0,7] by sinx = x — x? u(x). Then u is (strictly) decreasing 
on [0, 7]. 


Its proof is obvious if we remark that, by Taylor’s integral formula, 
l 1 2 
u(x) = af (1 — t)°cos tx dt. 


Other elementary proofs of this result exist, of course. 


Proof of Theorem 1. We have to prove it for a symmetric inscribed polygon only. 
Let S, = 7D*/8 and |, = 7D/8 (case of the circle), and let 


(I= Dy (lg — DY’ 
Then 
1 ? 2 2 2 
4 = ——._{ s|(, - D)’ -(1- D)’] -(- D)*( - 8)}; 
(/— D) (4, - DY 
but, because /, > / > D, we have 
A> + 
(/ i D) (lo — D 
By Favard’s first inequality, 2S > D(/ — D); hence 
1 
A > + [D(y- I) -(% - S)]. 
(Jo i D) 

Inserting the expression for / and S in terms of the angles 6, > 0, U@, = 7, we have 


cca (4, ~ 2sin ‘) - X (0, - snd) 


8(/, — D)’ 


weno Dla) - | 


1286 — NCI D) ~~ DIS ~ 5)]. 


6, 
By Lemma 4, u A > u(@,); hence A > 0, and 


S 5 
27 4 
(J-D) (4, - DY’ 
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this last number is 7/2(7 — 2)?, and it follows that 


S T (1— D)° 
* 7 op 2 | 


REMARK 2. For a symmetric polygon, the inequality is strict; this follows from the proof of 
Proposition 2 and Lemma 4. 


7. The greatest constant C: direct proof by variational method. Variational methods are often 
used in the proof of inequalities about convex sets (see [2] for instance). We shall give a variational 
proof of Theorem 1, which does not depend on Favard’s first inequality. This is a natural proof, 
but it uses calculus, because the point where an extremum is attained is not completely 
determined. 

We shall prove that if 0 < C < 7/(a — 2)’, 0, > 0, X76, = 7, then the quantity 


. - . 0. ; 8 C 2 
(0) ays +208) = Sesind, ~ 4c Yesin} ~ 1] - <|s- $u- py 


is not negative. This is obvious for n = 1, and for n = 2 it is equivalent to the inequality 


Te pp 28nd LC 
2. 9) 7 ) 0 T ) 
(7 — 2) 0<0<7 |. 0 6 0<0<n ; (5 4 _ 
4) sin> + cos, 1 V2sinl 5 + 3 1 
T 1 
i... ————_~ < ———, and this is true. Hence, we suppose n > 3. 
(7-2) 6-4y2 
Let 
K, = { (B,---4)18, > 0, Y 6-2}, 
1 
and 


K, = {(6,,...,9,) €K,|8>0 wi}; 


it is clear that Infg ¥, <0 (6, =7,6,=0 for i> 2). We shall prove the following lemma: 


Lemma 5. If ©, attains its greatest lower bound on K,,, for n > 3, this lowerbound is strictly 
positive. 


Proof of Theorem 1 from Lemma 5. As Infx ¥, < 0,¥,, will attain its minimum on K,, at a 
point of 0K, = K, \ K,. Thus this minimum will be the minimum of W,_, on K,_,. By 
continuing this procedure, we come to the minimum of VY, on K,, which is positive, and ¥,, > 0, 
for all n. 


Proof of Lemma 5. If ¥, attains its greatest lower bound on K,, at (0,,...,9,), we have 


” G/2 . 6 
dV, =) d0,, i.e., cos 0, — sceoss | Yin} — 7 =i, 
1 1 


OT 


6 6 , 6 
2. cos” — 4Cp cos -A-1=0 wher L= di sin5 — i} 
1 


It follows that there are only two possible values for the 8,, a and B: K times a and (n — K) 
times B. We may suppose that we have 0 < a < 7/n < B < m. In such a point, we have 
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, , _ a _ B 2 
WV, = Ksina +(n — K)sinB — 4C Ksin5 +(n — K)siny — 1]. 


REMARK 3. If we knew that a = 8B =7/n, then we should have V, > 0 at the point 
(1/n,...,2/n) for every C < 1/(a — 2)° if and only if it were true that 


. 7 
nSIN— 
n 7 


> —— for n>2. 
) (7 — 2) 


_ 
4 nsin —] 


This inequality is easy to prove with Lemma 4. 
In any case, by putting t= K/n, 0 <¢ <1, Lemma 5 will be true if we show that A, > 


a /(1 — 2)*, where 
n[tsina +(1 — t)sinB] 


A, = | 
4| n(tsin$ +(1- ‘sin | - i] 


n 


With the function u of Lemma 4, we have 


_ 1 —n| to?u(a) +(1 — 1) B?u(B)| 
[5-2 sles) col) 
But we have 


a Boeke 
_Ksinz +(n — K)siny —l=p(/ D) > 0, 


and, if we let 
X= n| to? u( a) +(1 — t)B?u(B)| =7—- Ksina —(n — K)sinB < 7, 


then 

T X 

5 —-li1- 7 > 0 

a B 
From Lemma 4, uf 5] > u(a) and (5) > u(B). 
Hence 
n— X 
A,, a x 4? 
1 
GF 


and it is sufficient to prove that this quantity is greater than 7/(a — 2)’, or 
8 
(*) | X< ait 24 - 7). 
The proof of Lemma 5 will be complete after studying two cases. 


1. First case. n — K > 2. 


As B< and |u(x)| < 2, it follows that 


7 
n—- K 
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and it suffices to see that 


1 1 48 

oa tg Salt 24-2), 
or 

1 48 1 

— < — —_— — -_ —_—_-—lc 

~ _a(™ 2)(4 — a) ran 

1 

or n > —= ~ 2.072..., and this is true for n > 3. 


Y 

2. Second case. n— K = 1. 
We have then 7 = (n — 1)a + B. Three possibilities exist for B: 

7 8a 
(a) 5 <B< 9. 
We have 

X =a —(n-1)sina — sinB=a- —. 
The relation (*) to check is hence 
8S 


8 
D2 > 1 hm — 2)(4— 7) = 5 = 0.646... . 


; ) 
But 8S/D* > 2sinB, and it suffices to have 7 — B > arcsin 59 OF 


5 <B<a- arcsinS = 2.81..., 
and this is true, for “= = 2.79.... 
7 
(b) B< 5° | 
We wish still to have 8S/D? > 8; but 8S/D? > (n — 1)sina, and (n — 1)a > 1/2; hence 
8S 2 T 
—_ > — _ l, 
D? (7 7 2(n — 1) 
which is greater than 6 (note that a < 7/2). 
8a 
(c)B> 9° 
We have 
_ 2 
WV (a,a,...,a,8) > sinB — ac(sin5 +2 5 B_ i , 
which will be strictly positive if 
sin h T 
4 cos, + 5 i 
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hence the first number is greater than 


. 
s9 1 9:0 0 
a n h\? 9h’ 
9 a5 


which will be greater than 7/(a — 2)? if 


OT 


(7 - 2)” ae 
5 9 sin 9 7 
and this is true if B > 87/9 = 2.79.... 
As pointed out in Remark 3, this proof would be much shorter if the following question had a 
positive solution: 


B>ar- 2.735..., 


PROBLEM 1. Among the symmetric convex polygons inscribed in the circle, with 2 sides, is the 


quantity minimum for the regular polygon? 


2 
[2 ->) 

We will use Theorem 1 to give a partial solution to a problem of Favard about the maximum of 
L when D and »¥ are given. 


8. Favard’s problem on maximality of the perimeter of a convex set. In [2], Favard sets the 
problem: if D and» are given, what is the maximum possible for L, and for which convex set is it 
attained? There is no exact general solution, but Favard proved that the maximum, if © < 7D?/4, 
is attained for a polygon inscribed in the circle of diameter D, which has all its sides equal, except 
for one. But the calculation of L may be difficult. 


As an example, he proves Favard’s first inequality, which states that L < 2D + 25, But this 


inequality is not sharp, for if X = 7D? /4, it gives only L < (2 + 5 )D, which is without interest, 
for L < wD in every case. 


With Theorem 1, we have another inequality: 


(10) L<2D+27 =f. 
V0 


This inequality is sharp for large values of / for the bound is at most 7D, with equality for 
D’, then it 


L = 7D*/4. But, in the case where Favard’s first inequality is useful, ie., if LX < 7 
2 
(7 _ 2) D?. 


is better in the range 0 < Y < 7 
Hence, we can assert an improved bound for the symmetric case of Favard’s problem. 


PROPOSITION 3. For a compact convex symmetric set, we have 


_{L aw—-2 
(11) 2D << 2D + 2minl 5, ie i}. 


9. Favard’s maximal triangle lemma and second inequality. Favard’s maximal triangle lemma is 
the following: 


THEOREM 2 [2]. Consider the convex curves 1° contained in triangle ABC, with C>B. Among 
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those that have endpoints B and C and that delimit with BC a given area, the one that has the 
greatest length is a triangle BAC, where A, lies on CA (see Fig. 6). 


FIG. 6 


Favard uses his lemma, for instance, to prove that if a convex set in the circle of diameter D 
has the greatest perimeter for a given area, then the arcs that bound it inside of this circle are 
necessarily polygonal, because they are locally polygonal: between two nearby support lines, the 
boundary is a triangle (make a figure!). 

Favard’s proof was of a geometric nature. We give proof with functional ideas. 

We shall first determine, by functional methods, the extreme points of a convex set of concave 
functions. , 

Let C,([0, a]) be the space of continuous functions on [0, a] null at 0 and a. 


PROPOSITION 4. Let K be the set of concave continuous functions f on [0, a] satisfying 0 < f(x) 


2 
< Min( px, g(a — x)), where p,q > 0. Let S €]0,So[, where S) = > oF 
hyperplane H, of C,((0,a]) defined by H, = {f\fofdt =S}. Then the set Ky; = KO Hy is a 
compact convex set in the topology of uniform convergence and its extreme points are functions of the 


form ; 


, and consider the 


f(x) = Min| px, q(a —x),ux+v,u’'x+ v'|, 
forvu>0, v’ > 0, uat+vu>0, wat+v' > 0. 


Recall that a point f, is extreme in Ky, if it is not possible to have fy = (f, + f,)/2 unless 
fo =fhi=h(h> fr, © Ks). We have then eight possible forms for extreme points of K,: see Fig. 7. 


A DAA 
AAA A 


FIG. 7 


Proof. (a) The set K, is norm closed and bounded and the functions f in K, are Max( p, q)- 
lipschitzian, hence equicontinuous. By Ascoli’s theorem, K, is a compact convex set. 
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(b) It is almost obvious that the functions Min[ px, g(a — x),ux + v,u’x + v’], with the 
prescribed conditions, are extreme in K,, and we do not give this easy proof. 

(c) Define the mapping 7 from K, into the set of positive Radon measures on [0,a] by 
Tf = —f”, where f” is the second derivative of f in distribution sense, or equivalently, the 
Stieltjes’ measure df; associated to the nonincreasing function f/, the right derivative of f. The 
measures p that are in the range of T must satisfy three conditions, easy to establish: 


1. fo(a—s)dp(s)<p (ie, f’(0) < p and fi f’(t)dt = 0); 
2. fosdu(s)<q (ie, f(a) > —4@); 


3. p4 =) ays) = S (ie, [of(t)dt = S). 


Hence, ||p|| < p + g. Let L be the convex compact set of positive measures on [0, a] satisfying 
llu|| < p + q, with w({0}) = w({a}) = 0, for the topology of duality with the space C)((0, a]), and 
let be E,, E, and Mg the closed half-spaces and hyperplane defined by the conditions 1, 2, and 3. 
Then, the range of T is the set 


Ly = LONE, E,O Mg, 


and T is affine and one-to-one from K, on L,. Thus, the set &(K,) of extreme points of K, is 
exactly the set T~'[&(L,)], where &(L,) is the set of extreme points of L,. But we know the set 
of extreme points of L: they are exactly the measures || p + q\|d,, for x €]0,a[, and the mea- 
sure 0. 

Suppose that we have proved the following lemma: 


LEMMA 6. Let L be a compact convex set in a locally convex Hausdorff topological vector space X, 
and E,,..., E, closed half-spaces of X,M,,...,M, closed hyperplanes of X. Then every extreme 
point of the convex set L= LONE, N-+» NE, AM,N--: OM, is the barycenter of at most 
k + 1+ 1 extreme points of L. 


Then every extreme point f of K is the barycenter of at most four functions f,,..., f, such 
that f”, = —A,6x;, with A, > 0; thus, the graph of f is a convex polygon with at most four 
vertices. But it is easy to see that if two consecutive vertices are inside the triangle made with the 
lines y = px, y = q(a — x), y = 0, then the function f is not extreme in K, (see Fig. 8). Thus, 
the only possibilities are those described in the proposition. 


FIG. 8 


Proof of Lemma 6. Let x € &(L); if x is not a barycenter of at most k + / + 1 extreme points 
of L, then x is inside a (k + / + 1)-dimensional simplex S included in L. But FE, N --- NE, A 
M, 1 --- OM, then contains a subspace of dimension at least one, whose intersection with L 
contains a segment ]x,, x,[ that includes the point x. Thus, x cannot be extreme in L. 


Proof of Favard’s maximal triangle lemma from Proposition 4. If the angle C is greater than 
1/2, we change the triangle ABC into ABC’, with C’ < 7/2 and AB > AC, adding to the area S 
and the length / numbers S, and /,; the problem is not changed and it suffices to solve it in the 
new situation (see Fig. 9). 
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‘A 


; 
~ 
~ $0 /o 
~ 


ce 


FIG. 9 


In the new triangle, convex curves are graphs of functions of K,. Let /: f~ 1(f) be the 
functional defined by the length of the graph. Then / is a convex continuous function by Lemma 
1. We use now a well-known result: 


LEMMA 7 (Bauer’s maximum principle). Any convex upper semicontinuous function | on a 
compact convex set X in a locally convex Hausdorff linear space attains its maximum on an extreme 
point of X. 


The shortest proof comes from [4]:{x © X|/(x) < Maxy/} 1s a convex relatively open set, 
hence its complement in X contains an extreme point of X. 

Applying Proposition 4, we see that / is maximum on a function of K, with polygonal graph, 
which has 1, 2 or 3 vertices. It is not possible that there is a vertex inside the triangle. Otherwise, if 
M is such a vertex, the others being F, and F,, the parallel line to F, F, through M intersects BA 
and CA in C, and C;; ‘one of these points, say C,, is outside the ellipse with foci F, and F, 
passing through M; hence C, F, + C, KF, > MF, + MBE,, but the areas of the triangles F, MF, and 
FC, F, are the same, a contradiction to the maximality of / (see Fig. 10). 

Thus, the function in K, whose graph has a maximum length is polygonal, with two vertices 
on BA and CA, or is a triangle with its vertex on BA or CA. But an obvious geometrical 
argument, which is in [2], shows the e triangle BAyC has maximal length, for if M, Ao is parallel to 
BM, and B < C, then M, BM, < A,M,B 9M, B, and BM, + M,M, < BA, + AyM, (see Fig. 11). 


Fic. 10 Fic. 11 


Proof of Favard’s second inequality. By density, it suffices to work with concave function f 
satisfying 


f(x) < Min[ px, q(D — x)], 
and then let p and g tend to infinity. If we take S fixed, / is less than the maximum, given by 
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Favard’s maximal triangle lemma. 
But an elementary calculation gives h = 2S/D (see Fig. 12); hence we have 


l<h 1+ >t 1+(2-2 


Or 


Fic. 12 


If g tends to infinity, we obtain the result: 
ID <2S + V4S? + D4. 
10. Conclusion and problems. We recall the first problem already set: 


PROBLEM 1. Is the quantity minimum for the regular symmetric polygons, among 


2 


57D 


symmetric inscribed polygons with 2 sides? 
In connection with Remark 2 we also pose the following problem: 


PROBLEM 2. If, for a symmetric compact convex set, we have the equality 


Eo eco?) 


is this set the euclidean ball? 


Many other fascinating problems exist about plane convex sets. The methods for studying them 
are various: geometrical or analytic, elementary or functional analytic. We hope that the reader 
will be encouraged to look at some of the problems that are set in the many papers cited in [1], [2], 
or [3]. 
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MODIFIED ITERATION AND PROBABILITY 


LAWRENCE J. WALLEN 
Department of Mathematics, University of Hawaii, Honolulu, HI 96822 


It is always a pleasant event when apparently disparate areas of mathematics intersect: when a 
theorem in one discipline translates into, proves, generalizes, simplifies or “explains” a theorem in 
another. This phenomenon occurs frequently between probability theory and various aspects of 
classical analysis. Examples are so abundant that it is perilous even to begin a list. This note 
presents another such interaction. 

Iteration of functions produces fixed points, at least under suitable conditions. We are going to 
present a modification of the usual iteration procedure which has the pernicious property of 
retarding or even destroying the convergence to the fixed point and ascertain (purely analytically) 
just when this convergence is destroyed. Our modification comes about not purely by malice, but 
turns out to correspond to picking a non-optimal strategy in a certain iterative game of chance. 
The analytic criterion then reappears as an immediate corollary to an elementary pointwise 
convergence theorem. 

Here is the analytic set-up. G(w) is a strictly convex, continuously differentiable function on 
the interval 0 < w <1, with G(0)>0, G1) =1, and 0 < G’(w) <1 for 0<w<1. These 
conditions are more than enough to ensure that if n,,, = G(n,) (% = 9), then n, > 1, the 
fixed point of G. 

Let w, be a sequence with 0 < w, < 1. Modify the n,, to get a sequence &, by defining €, = 0 
and 


(+) ntl = G(», 41) +(&, 7 W +1) GM 41) 


that is, (&,, €,,,) lies.on the tangent line to G through (w, ,,, G(w,+,)). The strict convexity of G 
implies that €,,, < G(&,,) with equality exactly when w,,, = &,. Since G is a strictly increasing, 
€,, < 1,. The scheme reduces to ordinary iteration when w,,, = &, = ,- 

The question arises, for which sequences { w, } does €, — 1? It is clear pictorially that w, must 
tend to 1. In the case where G’(1) = 1 (the only interesting one), if we pick w,,, = 1, then 
€,41 = §, and no progress has been made. We expect therefore that €, — 1 only when w, doesn’t 
tend to one too rapidly. This is exactly what happens as the following theorem shows. 


THEOREM 1. §, — 1 if and only if w, > 1 and 
bX (1 — G(w,)) = 0. 
n=0 


Proof. Assume 1 — §, — 0. We want to show the series diverges or, equivalently, [17°G’(w,) = 
0. Now 


1 — En+1 =1- G(w, +1) —(&, -1+1- Wri1)G'( Wy 41) 
=1 —[G( my 41) +(1 a W741) G’(%,41)] +(1 a E,) G'( wy 41)- 
Owing to the convexity of G, the term in brackets is majorized by 1 so that 


(1 -€ 41) > (1 - &,) G41) 


Lawrence J. Wallen: My vital statistics read as follows: B.A. from Lehigh University, Ph.D. from M.I.T., stints at 
Bell Laboratories, Stevens Institute of Technology, The University of Michigan, and, finally, the University of 
Hawaii. I was lured into operator theory (in part) by the seductive lectures on ergodic theory given by the present 
MONTHLY editor. Most of my research has involved operators, often semigroups of them. Of late, my interests have 
become more diffuse, my present passion involving some connections between functional analysis and geometry. I 
watch birds, fish, bugs, stars, sometimes play the fiddle in quartets and sometimes play basketball (in quintets). 
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and 


n+1 
(1 — 41) > (1 - &) I] G'(w,), 
j= 
therefore [1°G’(w,) = 0. 
For the sufficiency, since G(w) > w, we get 
G(w) — wG’(w) > w(1 - G’(w)). 


Pick N so large that w, > 1 — e for n > N. Then for these n, 


f, — §,-1G'(,) = G(w,) — w,.G'(», ) > (1 — «)1 — G(,)) 
En-1 7 €,-2G'(w,-1) 2 (1 ~ e)(1 ~ G'(w,—-1)) 
fv — §y-1G'(wy) > (1 — e)(1 — G(wy)). 


Now multiply the second inequality by G’(w,), the third by G’(w,)G’(w,_,),... and the last 
by G’(w,) G’(w,_1) °°* G’(wy 41). Adding, we see that both sides telescope and we get 


fy — fv IL G'() > (0 of - To}. 


Letting n > oo, we get liminfé, > 1 — e since II¥G(w,) = 0. 
The theorem yields an instant corollary. 


COROLLARY 1. (a) L(1 — G’(n,,)) = 0 and 
(b) (1 — ,) = 00 if G’(w) < Mfor0 <w<1 andG‘(1)=1. 


Proof. For (a), simply set W,+1 =, and use the fact that , = 7, — 1. For (b), the mean 
value theorem yields 1 — G’(n,,) < M(1 — ,,) and then use (a). 


We now make the promised probabilistic connection. The iterative game proceeds as follows. 
We announce at the outset a sequence of numbers w,,w,,...,0 < w, < 1. Pick a number X, 
randomly from the unit interval, X, having the continuous distribution F(x). We win Y, which is 
X, if X, > w, and Oif X, < w,. Now pick X, (using the same F(x)) and win Y, where Y, = X, 
if X, > w, and Y, = Y, otherwise. Continue the process ad infinitum. 

More rigorously, { X, }"_, are independent, identically distributed random variables with 
continuous distribution F(x) satisfying F(0) = 0, F(1) = 1 and F is strictly increasing on [0, 1]. 
{Y, }°_ 5 is a sequence of random variables defined by Y) = 0, 

Y= i If X41 > Wisi 


Y if X,44<w 


n n+1° 


The w,’s are called thresholds. Note they are picked before the game starts. 
An ancient question concerns the way to pick w,,w,,...,w, SO as to maximize E(Y,) (E£ is 
expectation). Compute E(Y,,,,) as follows: 


é if E> Way, 


E(Y,411Xne1 — é) _ UY) ifé< Ws: 


Hence 


E(Yae1) = [OE YnalXor = 8) (8) + f° BC 1l%rna = 8) AF(8) 


Wnh+t 
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= E(Y,) F(a.) +f dF (8). 


Set E(Y,) = &, and G(w) = 1 — {) F(é) dé. Integrating the last equation by parts gives 
En 4d — G( W,+1) +(&,, a W +1) G'( Writ) 

since G’ = F. Since F is strictly increasing, we see that G is exactly the sort of function we 
considered at the outset. The maximum value of E(Y,) is, since &, < n,, precisely n,, and is 
achieved by picking the w,,, =; = E(Y,), j = 0,...,” — 1. Thus if we play infinitely often, our 
winnings (Y,,) tend to one. 

If, however, we don’t play optimally for some reason (there may be additional strictures on 
choosing the w,’s), we can still ask when our winnings tend to 1. 

The following theorem answers the question completely and affords the promised probabilistic 
proof of Theorem 1. 


THEOREM 2. Still assuming w, — 1, the following are equivalent: 
(a) Y, > 1 with probability one (a.s.). 
(b) E(¥,) > 1. 
(c) L(1 — G’(w,)) = LL — F(w;)) = 00. 
Proof. Let (&, P) be the probability space underlying the process. For w € Q, let 
N’(w) = {v: X,(@) > w,}. 
The Borel-Cantelli lemma [1, p. 41] shows that N’ is almost surely infinite or finite according as 


Y P(X, > w,) =D (1 - Fm) 


diverges or converges. 
For each n, let v(n) be the largest member of N’ satisfying v(n) < n (set Xp = 0, v(n) = Oif 
N'(w) = ©). Then Y, = X,,,,. Thus, when 


~(1 — F(w;)) — 0; 
we get convergence to one a.s. since w, — 1 whereas when 

Yi (1 - F(w,)) < 0, 
dim Y,(w) = Xy(w) where M = max{v: v © N’(w)} and so the bounded convergence theorem 
yields 

lim E(Y,) = E( X,,) < 1. 
Note that in any case €, = E(Y,,) converges. 
Actually, this proves Theorem 1 only in the case where G’(0) = 0 and G’(1) = 1 but the other 

cases are easily disposed of anyway. 


Suppose now we are allowed to choose the w, (and the associated Y,) using hindsight. That is, 
take 


w, = w,(X,, Xo5--+, Xy-1) 
and define Y, as before. Then, there is an obvious best strategy. Namely, define 
w, = max( X,,..., X,_1) 


and let Y, be the corresponding Y,, (starting with Y, = 0, #, = 0). A trivial induction argument 
shows that 


Y, = max( X,,..., X,). 


n 
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If {w,, Y, } 1s any other choice, then since each Y,(w) is one of S X, “on j =1,...,n, we have 
Y > Y, and consequently E(Y,) > E(Y,). In particular, E(Y,)> 


t 


Now E (Y,) is easily computed (see (*) below) and is given by 
E(¥,) =1- [-F"(w) dw 
0 


so that 
1-4, > [la(w)]’ dw. 
0 


This can also be seen analytically by noting that if G, denotes the jth iterate of G, then 
1d p72! ; Lp, h 
Ny =f Ty On) dw -{ Ile (G(w)) dw > | [G’(w)]" dw 
since G(w) > w. This in turn gives a sharpening of vey 1(b), namely, 


>) 1 — 7, = oo provided [ 


Gm" CO. 


We conclude by asking how the best a priori choices of the w,, namely w, = 7,,_,, and the 
corresponding expected values, 7,,, compare with the best strategy using hindsight, namely E(Y, ). 
For fixed n, let X1, X?,..., X" be the order statistics associated with the X’s. That is, 


X) = min( X,,...,X,), and X/ = the jthsmallest X 


(see 3, Chapter 9, for a treatment of order statistics). The expectation of X* is given by the 
formula, 


ky —,{("2—1) ft p1 k-1¢4 _ ,,\"7k 
( *) E( xy) nl i) fF (u)us "(A -—u) “edu. 
Consider the family of distributions F,(w) given by 
F(w)=1-(1-w)”, p>0. 


The asymptotic nature of 7, for the corresponding G,(w) is known: 


i++ 1/p 


1-7, ~ 7 


A neat proof of this is given in (4, p. 223). Putting F, into (*), routine calculations give 


(+ *) E( X*) = n( 1) f° ~(1 = u)'”) wk = 0)" du 
_, tine Mtr htt 


r{n += +1] Mn k +1) 


Thinking of n — k as fixed, say n — k + 1 = 6, and using the fact that 
T(n+z)~n’T(n) [2, p. 212], 
we look for k so that 1 — E(X*) ~ 1 — 9, or, using (*) and (* *), we try to solve 


rla+- Vy 
es) -(244)" 


1985] NOTES 485 


Here are a few solutions (the last three being approximate): 


p = 1 (umform), k=n-—l1, 

p=1/2, k =n — 1.50, 
p= 173, k=n— 2.08, 
p= 2, k =n — 0.73. 


For the uniform distribution, using the best a priori choices, we do surprisingly well: namely as 
well as the expectation of the second largest X; for p = 1/3 we do almost as well as the third 
largest X, and so on. The moral seems to be that the utility of hindsight becomes more 
pronounced as the probability of getting a large number increases. 
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ANOTHER NOTE ON THE INCLUSION LE?(p) ¢ L£4(p) 


ALFONSO VILLANI 
Seminario Matematico, Universita di Catania, Viale A. Doria 6, 95125 Catania, Italy 


Throughout this note ({2,.%, 2) will be a positive measure space and, for each p &€ (0, oo], 
L?() will denote the space of all »/measurable real functions f on @ such that ||/||, < 00, 
where 


1/p 
Ifll, = [fur an) forp € (0,00) and |lfIl, = ess sup|/| 


and as usual we identify two functions which differ only on a set of measure zero. When endowed 
with the metric d, of convergence in pth mean, i.e., 


d,(f,g) =|lf- gll, forp €[1,oo] and d,(f,g) =|lf— gil? for p € (0,1), 


L?(p) becomes a complete metric space. We obtain a new characterization of the spaces 
(Q, 7, w) for which the inclusion L?() C L7() holds. This result simplifies both the conditions 
and the proofs already given in [1] and [4]. 

We begin with a well-known lemma. 


LEMMA 1. Let p,q € [1, 00]. The set theoretic inclusion L?(p) C L4() implies that the inclusion 
map i: L?(p) > L4(p) is continuous. 


Proof. lf f, > f in L?(p), then { f,} has a subsequence which converges pointwise almost 
everywhere to f; see [3], Theorem 3.12. The desired result now follows easily from the Closed 
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Graph Theorem; see [2], Theorem 2.15. @ 


REMARK. Lemma 1 also holds for p,q € (0, co], with the same proof, even though L’?(,1) is 
not a normed space for 0 < p < 1. 


Let .%, denote the collection of all sets A € Y with positive measure. Then we have 
THEOREM 1. The following conditions on the measure space ({), %,) are equivalent: 
(1) L?(u) C L%(u) for some p,q & (0, 0] with p < q, 

(2) inf, c,H(E) > 0, 

(3) L?(u) C L%(p) for all p,q € (O, 0] with p < q. 


Proof. (1) = (2). Since L?(w) C L7() implies L?'() C L”(p) for every t € (0, 00), we can 
assume p > 1. Then L?() and L4() are normed spaces, and by Lemma 1 there exists a positive 
constant k such that || fll, < X\|f||, for every f © L?(p). In particular we have 


{w(E)}'4 < k{ p(B}, 
and hence p(E) > k?4/(?~® for every E €.% with 0 < p(E) < oo. This proves (2). 


(2) = (3). Let fe L?(p) and let E, = {|f| >}, 1 =1,2,.... By Chebyshev’s inequality, 
u(E,) > 0 as n > oo, hence, by condition (2), there is an index ny such that n(£E,,) = 0 for 
n >No, 1.., |f| < No p-a.e. Thus L?(p) C L%(p), and this easily implies that L?(p) C L4() for 
every g © [p, oo]. 


(3) = (1). This is trivial. @ . 

Let x, denote the collection of all sets A € # with finite measure. Then we have 
THEOREM 2. The following conditions on the measure space (Q, ¥%,) are equivalent: 
(1) L?(p) > L4(p) ~for some p,q € (0, 00) with p < q, 

(2) supge.w, (EL) < 0, 

(3) L?(p) > L4(p) ~~ for all p,q € (0, 0) with p < q. 


Proof. (1) = (2). As in Theorem 1, we can assume p > 1, so by Lemma 1 there is a positive 
constant k such that || fll, < Allf|l, for every f © L7(p). It follows that 


w(E) <k?V4-P) for every EE &,,, 
and hence condition (2) holds. 
(2) = (3). Let f © L4(p) and let 
E, = {1/(n+ 1) <|f| < 1/n}, n=1,2,.... 
Then 


m(E,) < (n+ 1)" fl? dp < 0 for every n = 1,2,..., 
Q 


and hence, by condition (2), L*°_,y(E,,) < oo, because the E,,’s are pairwise disjoint. Now for 
Pp < q we have 


(oe) 


_ ; 
P du = P du + Pdu< [\f\t'dut+ ¥, —n(E,) < o. 
furan fo Wedus Df ifedn<fierdn+ 2 pat.) 


u= 


(3) = (1). This is trivial. @ 
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SPACES WHERE ALL CONTINUITY IS UNIFORM 


M. ARALA CHAVES 
Departamento de Matematica, Faculdade de Ciéncias, 4000 Porto, Portugal 


Elementary topology courses normally include a proof that all continuous functions from a 
compact metric space to a metric space are uniformly continuous. We abbreviated this by saying 
that for compact metric spaces all continuity is uniform. The aim of this note is to give several 
equivalent conditions, which are necessary and sufficient for all continuity to be uniform. It is 
easy to check that compactness is not such a condition, because necessity fails. 

The conditions are stated formally in the following theorem. 


THEOREM. For a metric space (X,d), the following conditions are equivalent: 

1. Every continuous function from X to any metric space is uniformly continuous; 

2. every open covering of X has a Lebesgue number; 

3. for every sequence (x,,) into X which has no convergent subsequence, the only sequences (x? ) 
in X such that lim d(x,,, x;,) = 0 are those which are almost equal to (x,,), in the sense that x, = x} 
for all but a finite set of indices; | 

4. for any infinite svibset A of X without accumulation points (in X), the infimum of the distances 
between (different) points of A is greater than 0. 

The following observations will help to explain how the theorem comes about and how the 
proof is constructed. Conditions 2 and 3 of the theorem were motivated by a careful analysis of 
two standard proofs of uniform continuity on compact metric spaces. In fact, one of these proofs 
[2; 11.4, p. 234] merely uses the property of compact metric spaces that every open covering has a 
Lebesgue number, 1.e., a number 6 > 0 such that each 6-ball is contained in a set of the covering. 
That property, which is precisely our Condition 2, is strictly weaker than compactness, as can be 
easily seen by considering an infinite set with the discrete metric. It turns out, in fact, that uniform 
continuity of all continuous functions is equivalent to the assertion that every open covering has a 
Lebesgue number. The other proof of the uniform continuity on compact metric spaces [1; 3.16.5, 
p. 58] uses the characteristic property of compact (metric) spaces that every sequence has a 
convergent subsequence. It is easy to see that the proof still works if we assume the weaker 
Condition 3. The interesting point is that once more we have a condition—the third one of our 
theorem—which is not only sufficient, but also necessary for all continuity to be umform. 
Condition 4 is a slightly different and perhaps more suggestive version of Condition 3. 


Proof. We shall prove our theorem by showing that] > 4>3>2=>1. 

To prove that 1 implies 4, we will begin by assuming the existence of an infinite subset A of X 
with no accumulation point in X and such that the infimum of the distances between different 
points of A is zero. The existence of such a set will enable us to define a continuous function from 
X to R, which is not uniformly continuous. The general lines for the definition of such a function 
are as follows: 

We first construct a locally finite sequence of balls (B(x,,, R,,)) such that: @) x, © A, for all n; 
(ii) each ball B(x,, R,,) has at least one point x’ of A distinct from x,,; (iii) the sequence (R,,) of 
the radii converges to 0. For each n, we then define a real function f, with support contained in 
B(x,,R,,) and such that f,(x,) = 1 and f,(x,,) = 0. These choices can be made in such a way 
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that each function vanishes on all points x’,. The sum of these f, is the continuous function f we 
wanted to define: the fact that (d(x,,, x;,)) converges to zero, and that f(x,) > 1 and f(x/) = 0, 
ensures that f is not uniformly continuous. 

Let us now look into the technical details. Let 7 = 1; assuming r, > 0 to be defined, let 
R,., = min(v,,1/(i + 1)). By hypothesis, there exist in A two distinct points x,;,, and x;,,, such 
that d(x;.1,%/41) < R,.13 we choose a positive r,,, < d(x,,1,X/,,), such that the r,, ,-balls 
centered at x;,, and x/,, intersect A precisely in their centers. Such an r,,, exists, because x;,, 
and x/,, are not accumulation points of A. Let us now see why the choices made imply that 
Xjy XpyX) 9%; are all different for i#j. In fact x;,x; (and x,,x;) were chosen distinct; by 
symmetry it is then sufficient to show that, if i < /, the points x;,x; are different. Now, if i <j, 
we have R, <r, and so the ball B(x,,7;) will have at least two points of A, x; and x‘; since 
B(x;,7,) has just one point of A, we have x, # x,. Let us prove that the sequence (B (x,. R,,)) 1s 
locally finite. We first remark that, for each a © xX, there exists an R > 0, such that B(a, R)\ {a} 
does not intersect A. If a = x, for some no, then the balls B(x,, R,) and B(a, R/2) are disjoint 
for n > max(n,,2/R). If no term of the sequence equals a, then the same assertion is true if we 
take n, = 1. This concludes the proof that (B(x,,, R,,)) is locally finite. By defining, for n € N 
and x € X, 


f, (x) = max(0,1 — d(x,,x)/r,), 
r,) is included in B(x,, R,,). 


we have a continuous real function f, on X whose support B(x,,, 7, 

As the sequence of these supports is locally finite, it makes sense to define f= Uf, and this 
function f is continuous. To finish the proof that 1 implies 4, we prove that f is not uniformly 
continuous. As d(x,,x,) < R, < 1/n, it is sufficient to check that, for each n, d(f(x,,), f(x, )) 
>1. From f,(x,) =1 it results trivially that f(x,) > 1 (and it is easy to see that in fact 
f(x,) = 1, although this is not essential to the proof). Let us now show that f(x’) = 0. For each 
iE N, x, is the only point of A in the ball B(x,,7,). Therefore x’, does. not belong to B(x,, 7,), 
since it is a point of A distinct from x, as we have seen. But, as f is zero Outside B(x,,7,), we 
conclude that f(x,,) = 0, so f(x,,) = 0. 

To prove that (4) implies (3), let (x,,) and (x/) be sequences such that: (i) (x,,) has no 
convergent subsequence; (11) (d(x,,, x/,)) ~ 0. We want to show that the two sequences are almost 
equal. We first remark that (x/) has no convergent subsequence either. Let A be the union of the 
ranges of the two sequences. A is an infinite subset of X with no accumulation points (in X). 
Condition (4) applied to this subset A ensures the existence of an e > 0, such that the distances 
between different points of A is greater or equal to e. For this e, there exists a k € N, such that 
d(x,,x/;)<e if n>k. But d(x,,x,,) < e implies x, = x/; so the two sequences are almost 
equal. 

Assuming now that X satisfies Condition (3), let us prove that every open covering of X has a 
Lebesgue number (Condition (2)). Suppose the contrary: let (U,);_<, be an open covering of X 
with no Lebesgue number. Then, for each n € N, 1/n is not a Lebesgue number of this covering; 
therefore there exists a ball of radius 1/n — B(x,,1/n)—not included in any of the open sets U.. 
Let i, be such that x, © U;; the relation B(x,,1/n) € U; implies the existence in B(x,,,1/n) 
of an element x/ distinct from x,. The sequences (x,,) and (x*) are not almost equal and 
(d(x, x,,)) converges to zero. To conclude the proof that (3) implies (2) it is sufficient to show 
that (x,,) has no convergent subsequence. If a were the limit of a subsequence (x,,_), we could 
choose an open set U, such that a © U; and a 6 > 0 such that B(a,d) C U;; there would be a 
peEN Suen that n, > 2/6 and x, E B(a,6/2). Therefore we would have B(x, ,1/n,) Cc 
B(a,8)c U,, contradicting the way the x, were chosen. 

For the sake of completeness, we reproduce here the usual proof that the second condition 
implies the first one. Given a continuous function f: X > Y and an e> 0, let d>0 be a 
Lebesgue number of the open covering (f~'(B(y, €/2)), <y of X. Each 6-ball in X is included in 
an open set of this covering, so that the distance between the images of two points of that ball is 
less than e; so d(x, y) < 6 will imply d( f(x), f(y)) < e. 


n°? 
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UNIQUE RIGHT INVERSES ARE TWO-SIDED 


WALTER RUDIN* 
Department of Mathematics, University of Wisconsin, Madison, WI 53706 


A theorem may be hard to discover, even though, once discovered, it is easy to prove. The 
point of this note is to emphasize that completely nonrigorous (some may say nonsensical) 
reasoning is perfectly acceptable in the discovery stage, and that it may furnish clues that enable 
one to make a good guess. Proofs can come later. 

Let R be an associative ring, not necessarily commutative, with unit element 1. Recall that an 
element a of R is said to be invertible if there exists b in R so that ab = 1 and ba = 1. The 
uniqueness of the inverse b is obvious; in fact, if ab = 1 and ca = 1, then 

b = (ca)b = c(ab) =c. 
Consider the following two questions: 

(I) If 1 — xy is invertible, must 1 — yx be invertible? 

(II) If the answer to (1) is yes, is there a simple universal relation between these two inverses, 
i.e., one that holds in every ring? 


Let us try to tackle this by thinking of, say, the complex numbers in place of our ring, where a 
simple relation exists between inverses and geometric series, namely 


1 
——— =1ltxtxr +x tee: 
1-x 


(at least when |x| < 1). It may be nonsense to talk of infinite series whose terms are members of 
our perfectly arbitrary ring R, but never mind. Pretend that the inverse z of 1 — xy is given by 
the geometric series 
z=1+(xy) +(x) +(xy?) + -:- 
=1+ xy + xyxy + xyxyxy tec: 
If we accept this, then the inverse w of 1 — yx, if there is one, ought to be 
w=1+ yx + pxyx + yxyxyx tere. 


Having gone this far, we might as well do a bit of factoring (i.e., assume that the distributive laws 
extend to our possibly nonexistent infinite series), write 


w=1l+y(l+xy+xxyt+---)x, 
and observe that the series in parentheses is the postulated expansion of z. We are thus led to 
w=1-+4 yzx. 


So far, we have proved nothing. But we have found a candidate for w, and can test whether it 
does the job. Indeed, 


* Partially supported by the National Science Foundation and by the William F. Vilas Trust Estate. 
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(1 — yx)w = (1 — yx)(1 + yzx) 
= 1 — yx + yzx — yxyzx 
=1-yx + y[(l — xy)z]x, 
which is 1, because the quantity in brackets is 1. 
A closer look at this computation shows something else: we have only used the assumption that 


z is a right inverse of 1 — xy and have deduced that w is then a right inverse of 1 — yx. The 
same is true with left in place of right, because 


(1 + yzx)(1 — yx) = 1 -— yx + y[z(1 — xy)] x. 
Thus we get the following result, which actually does more than just answer our two questions: 


THEOREM 1. If z is a right [left] inverse of 1 — xy, then 1 + yzx is a right [left] inverse of 
1 — yx. 


As we just saw, the proof of this theorem is a total triviality. But if we had been unwilling to 
use infinite series in a context where they may make no sense, how difficult would it have been to 
discover 1 + yzx? 

When y = 1, our two questions are of course pointless, but Theorem 1 tells us something even 
then that we might not have noticed otherwise, namely: 


If z is a right inverse of 1 — x, sois1 + zx. 


Let us now assume that 1 — x has a unique right inverse z. Then it follows that 1 + zx = z. 
This implies z(1 — x) = 1, so that z is also a left inverse of 1 — x! In other words, 1 — x is 
invertible. Since every element of R can be written in the form 1 — x, we have arrived at the 
result to which the title of this note alludes. 


THEOREM 2. The invertible elements of R are precisely those that have unique right inverses in R. 


Of course, the same is true with left in place of right. 
The following well-known fact from linear algebra is also an immediate consequence of 
Theorem 1: 


If A and B are n-by-n matrices over some field, then AB and BA have the same eigenvalues. 


P.S. After completing this note, I was told that the geometric series trick of finding 1 + yzx is described in [1] 
and [2]. However, no conclusions about one-sided inverses are drawn there. In the context of Banach algebras (where 
series do make sense) 1 + yzx occurs in an exercise on p. 259 of [4]. The referee has pointed out that Theorem 2 
appears as Exercise 6 on p. 89 of [3]. 
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General, P. AAAS Report X: Research & Development, FY 1986. AAAS, 1985, xviii + 270 pp, (P). [ISBN: 
0-87168-272-9] Tenth annual analysis of the federal R&D budget request prepared by AAAS. Includes 
chapters on agency budgets (NSF, DOD, Energy, etc.) and discipline analyses. In this budget, 
mathematics was recommended for a 154 increase at NSF, tilted towards young investigators. LAS 
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Bezuszka. Motivated Math Project Activity, Booklet 13. Boston College Pr, 1984, 240 pp, (P). All 
the max-min problems a teacher would ever want to see. Problems solved without calculus by writing 
computer programs to list all instances and looking for the maximum or minimum values. Emphasis on 
developing the algebraic equation through computation and observation of patterns. Useful as source 
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History, S*, P**, L**. I Want To Be A Mathematician: An Automathography. Paul R. Halmos. 
Springer-Verlag, 1985, xvi + 421 pp, $41.50. [ISBN: 0-387-96078-3] 400 pages of unadulterated Halmos 
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attribute of a genuine professional mathematician"), on writing ("I like words more than 
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and on himself: "My greatest strength as a mathematician is the ability to see when two things are 
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& Bacon, 1985, ix + 678 pp, $40. [ISBN: 0-205-08095-2] Unencyclopedic. For college juniors and 
seniors. On the level of Eves” popular history of mathematics. Accessible to general reader. Cov- 
ers 5000 years through the early part of the 20th century. Some emphasis on personalities. Prob- 
lems at various levels of difficulty are integral to the text and appear throughout. Author’s "per- 
sonal tastes and prejudices held sway" in selection of material. Numerous references, many to 
recent papers and articles, are given in each chapter. Suggestions for further reading. Well- 
illustrated. Altogether readable. JK 


Logic, S(16-18), P*, L**. Cantorian Set Theory and Limitation of Size. Michael Hallett. Logic 
Guides, No. 10. Clarendon Pr, 1984, xxii + 343 pp, $32.50. [ISBN: O0-19-853179-6] A comprehensive 
study of the impact that Cantor’s metaphysical, philosophical, and theological views had on his sub- 
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stitution of sets for numbers as the foundation of mathematics. Constructivist and "creational" 
metaphors in which sets exist in Divine intellect explain Cantor’s thesis that "possibility implies 
existence;" Aristotelian and scholastic teachings on the actual infinite led Cantor to posit an 
Absolute Infinite beyond the ken of human intellect. Corrects a number of common misunderstandings 
and establishes a new plateau for historical comprehension of Cantor’s monumental contribution to 
mathematics. LAS 


Combinatorics, P. Finite Generalized Quadrangles. S.E. Payne, J.A. Thas. Research Notes in Math., 
No. 110. Pitman, 1984, 312 pp, $22.95 (P). [ISBN: 0-273-08655-3] Combinatorial and geometric study 
of a type of point-line incidence structure introduced by J. Tits. General structure, combinatorial 
characterizations, embeddings in affine and projective spaces, collineation groups, coset 
geometries. Analysis of "small" examples. RM 


Number Theory, T(18: 1), S, P. Arithmetic Functions and Integer Products. P.D.T.A. Elliott. Grund. 
der math. Wissenschaften, B. 272. Springer-Verlag, 1985, xv + 461 pp, $64. [ISBN: 0-387-96094-5] 
The aim of this volume is to introduce a new aesthetic into the study of arithmetic functions by 
systematically applying them to certain problems in algebra. For example, the problem of represent- 
ing an integer as a product to certain rationals touches upon the theory of denumerably infinite 
groups. An impressive volume with many new results. Exercises, unsolved problems, and an extensive 
bibliography. CEC 


Multiplikation. Claus-Gunther Schmidt. Springer-Verlag, 1984, x + 96 pp, $7.50 (P). [1SBN: 0-387- 
13863-3 |] 


Number Theory, S(17), P*, L. Exercises in Number Theory. D.P. Parent. Problem Books in Math. 
Springer-Verlag, 1984, x + 541 pp, $38. [ISBN: 0-387-96063-5] A translation of the 1978 French edi- 
tion. A collection of about 200 problems and solutions from 10 different areas of number theory. 
Each section includes a brief introduction to the area. The combined work of a group of French 
number theorists. A valuable resource. CEC 


Group Theory, S$(18), P. Lecture Notes in Mathematics-1084: Structure and Representations of Q- 
Groups. Dennis Kletzing. Springer-Verlag, 1984, vi + 290 pp, $13.50 (P). [ISBN: 0-387-13865-X] 
Investigation of the structure of rational groups (finite groups whose complex representations have 
rationally valued characters, e.g., Weyl groups) and the relationship between the rationally 
represented characters and the permutation characters. Uses techniques based on algebraic geometry 
(local characters, Burnside rings, etc.). RM 


Topological Groups, P. Lecture Notes in Mathematics-l1079: Compact Semitopological Semigroups: An 
Intrinsic Theory. Wolfgang Ruppert. Springer-Verlag, 1984, 260 pp, $13.50 (P). [ISBN: 0-387- 
13387-9] Presents major areas of current research on compact semitopological semigroups and semi- 
group compactifications. Emphasis is on the topological-algebraic structures involved; methods from 


functional analysis are avoided as much as possible. DA 


Topological Groups, P. Transformation Groups on Manifolds. Ted Petrie, John D. Randall. Pure & 
Appl. Math., V. 82. Dekker, 1984, vii + 266 pp, $49.75. [ISBN: 0-8247-7156-7] Gives a discussion 
of some aspects in the study of transformation groups of manifolds. It features equivariant K- 
theory, new material on equivariant surgery, and Smith equivalence. LN 


and Methods. David Benson. Springer-Verlag, 1984, xi + 231 pp, $11 (P). [ISBN: 0-387-13389-5] 
These lecture notes consist of two chapters. The first gives background material from the theory of 
rings and modules, and the second treats representation rings, almost split sequences and the 
Auslander-Reiten quiver, and complexity and cohomology varieties in detail. The material covered 
has little overlap with existing texts. Includes exercises and references. CEC 


Calculus, T(13: 2). BASIC Fundamentals and Style. James S. Quasney, John Maniotes. Boyd & Fraser, 
1984, xvi + 463 pp, $19 (P). [ISBN: 0-87835-138-8] A BASIC programming text which emphasizes princi- 
ples of structured programming. Many skill exercises; excellent programming problems. Gocd chapter 
summaries and self-test exercises. MA 


Calculus, T*(13: 1). Calculus: A Modeling Approach, Third Edition. Marvin L. Bittinger. Addison- 
Wesley, 1984, xi + 559 pp, $31.95. [ISBN: 0-201-11217-5] For a one-term course in calculus. Appli- 
cations in business, economics, the behavioral sciences, the social sciences, biology and medicine. 
Changes include availability of computer software keyed to the text, video tape reviews for self- 
study and a student’s guide to the exercises. "Challenge" problems have been added. Approach 
remains "experience based." Leisurely-paced and readable with nice graphics. If a course you offer 
fits, have a look. (First Edition, TR, June-July 1976; Second Edition, TR, January 1981.) JK 


Complex Analysis, T(18: 2). Compact Complex Surfaces. W. Barth, C. Peters, A. Van de Ven. Erg. der 
Math. und ihrer Grenzgebiete, 3. Folge, Band 4. Springer-Verlag, 1984, x + 304 pp, $38. [ISBN: 0- 
387-12172-2] A nearly complete theory of compact complex surfaces, including a nice historical 
introduction. LN 


Complex Analysis, P. Proceedings of the Conference on Banach Algebras and Several Complex Variables. 
Ed: F. Greenleaf, D. Gulick. Contemp. Math., V. 32. AMS, 1983, xiii + 295 pp, $28 (P). [ISBN: 0- 
8218-5034-2] Twenty-six papers constituting the proceedings of a conference on Banach algebras and 
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several complex variables, held at Yale in June, 1983. The papers treat Banach algebras, function 
algebras, and finite and infinite dimensional holomorphy. PZ 


Partial Differential Equations, T(18: 1, 2), S. Maximum Principles in Differential Equations. Mur- 
ray H. Protter, Hans F. Weinberger. Springer-Verlag, 1984, x + 261 pp, $28. [ISBN: 0-387-96068-6] 
A comprehensive introduction to the idea of maximum principle as it occurs in partial differential 
equations. Four chapters develop the theory in the settings of one dimension, elliptic equations, 
parabolic equations, and hyperbolic equations. With exercises and an extended bibliography. 
Reprint of 1967 Prentice-Hall edition (TR, April 1968). Pz 


Partial Differential Equations, T*(15-16: 1), S, L. Boundary-Value Problems. Ladis D. Kovach. 
Addison-Wesley, 1984, xii + 419 pp, $33.95. [ISBN: 0-201-11728-2] Separation of variables and 
transform (Laplace and Fourier) methods for solving second-order linear partial differential equa~- 
tions which arise in applications. Light on theory but honest. References provided for omitted 
proofs. Exercises to clarify the text and to fill in omitted details, to provide practice in using 
the methods and to extend the theory. Some are of a computational nature. Numerous footnotes with 
historical tidbits. Readable, non-overpowering, of manageable length in one course and _ student- 
friendly. JK 


Functional Analysis, $(18), P. Boundary Methods: An Algebraic Theory. Ismael Herrera. Applic. 
Math. Ser. Pitman, 1984, viii + 136 pp, $32.95. [ISBN: 0-273-08635-9] An outgrowth of work on dif- 
fraction problems that occur in geophysics and seismic engineering. The algebraic theory is formu- 
lated in the setting of general functional-valued operators defined on arbitrary linear spaces which 
do not generally possess an inner product or a metric. Employs Trefftz procedure and Green’s formu- 
las. MU 


Analysis, $(18), P. On a New Method of Analysis and Its Applications. Paul Turan. Wiley, 1984, xvi 
+ 584 pp, $49.95. [ISBN: 0-471-89255-6] The author’s theory of "power sums" (linear combinations of 
powers of complex numbers) arose from his study of the Riemann hypothesis. In Part I, the minimax 
theory of power sums is developed. Part II is applications, principally to analytic number theory, 
but also to numerical analysis, complex analysis, and differential equations. There are two sec- 


tions on unsolved problems. PZ 


Analysis, T(18: 1, 2), S*, P. Exercises in Integration. Claude George. Problem Books in Math. 
Springer-Verlag, 1984, x + 550 pp, $38. [ISBN: 0~-387-96060-0] Eleven chapters of graduate-level 
exercises, each with a clear and detailed solution. Nearly all integrais are with respect to Lebes- 
gue measure on R. Fourier analysis figures prominently. Zeroth chapter outlines the necessary 
theory, without proofs. PZ 


Analysis, P. The Geometric Theory of Ordinary Differential Equations and Algebraic Functions. 
Georges Valiron. Transl: James Glazebrook. Lie Groups: History, Frontiers & Applic., V. XIV. Math 
Sci Pr, 1984, xiii + 576 pp, $75. [ISBN: 0-915692-38-4] This classic treatise of analysis provides a 
fascinating exposition of Riemann surfaces, elliptic integrals, Weierstrass” preparation theorem in 
two complex variables, as well as a detailed study of differential equations and of the calculus of 
variations. Bad typing has disfigured this work, however--everywhere numerous lacunae stand as 
ghosts of missing clauses, absent symbols make formulae meaningless, and even characters have under- 
gone odd permutations. Valiron’s masterpiece deserved better care. YN 


Algebraic Geometry, P. Lecture Notes in Mathematics-1056: Algebraic Geometry, Bucharest 1982. Ed: 
L. Badescu, D. Popescu. Springer-Verlag, 1984, 380 pp, $16 (P). [ISBN: 0-387-12930-8] Contains 
twelve research papers in algebraic geometry which were presented at an August 1982 conference heid 


at the University of Bucharest. LN 


Algebraic Geometry, T(16-18: 2). Geometry of Projective Algebraic Curves. Makoto Namba. Pure & 
Appl. Math., V. 88. Dekker, 1984, x + 409 pp, $83.50. [ISBN: 0-8247-7222-9] Discusses algebraic 
plane curves including a classification of the singularities of curves of low degree. After intro- 
ductory material on general varieties, the monograph gives most of the classical theory of curves 
without using sheaf cohomology. LN 


Algebraic Geometry, P. Lectures on Results on Bezout’s Theorem. W. Vogel. Springer-Verlag, 1984, 
132 pp, $7.30 (P). [ISBN: 0-387-12679-1] The main result shows that the product of the degrees of 
two intersecting subvarieties of projective n-space can be expressed in terms of degrees of sub- 
varieties of the intersection and lengths of primary ideals. This generalizes results on Bezout and 


is valid even if the intersection iS excess. LN 


Algebraic Geometry, T(18: 2), P. Isolated Singular Points on Complete Intersections. E.J.N. Looi- 
jenga. London Math. Soc. Lect. Note Ser., No. 77. Cambridge U Pr, 1984, xi + 200 pp, $22.95 (P). 
[ISBN: 0-521-28674-3] An organized text on isolated singularities of complete intersections. Con- 
tains a treatment of Milnor fibrations, monodromy, critical spaces, deformations, and the Gauss- 
Manin connection. LN 


Algebraic Geometry, P. Introduction to Intersection Theory in Aigebraic Geometry. William Fulton. 
CBMS Reg. Conf Ser. in Math., No. 54. AMS, 1984, v + 82 pp, $16 (P). [ISBN: 0-8218-0704-8] An 
introduction to the modern theory of excess intersection theory of algebraic varieties. Gives a 
readable presentation with many examples and applications. LN 
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Algebraic Geometry, P. Gohomology of Quotients in Symplectic and Algebraic Geometry. Frances Clare 
Kirwan. Math. Notes, No. 31. Princeton U Pr, 1984, 210 pp, $17.50 (P). [ISBN: 0-691-08370-3] 
Kirwan develops a general procedure for computing the rational cohomology of quotients of groups 
acting on nonsingular complex projective varieties. She has two approaches--one algebraic and one 
based on Morse theory and symplectic geometry. LN 
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Peter F. Stiller. Aspects of Math., V. E5. Friedr. Vieweg & Sohn (US Distr: Heyden & Son), 1984, 
194 pp, $14 (P). [ISBN: 3-528-08587-8] Deals with the question of how many curves there are on an 
elliptic surface up to algebraic equivalence. Includes a large number of examples for which the 
Picard number is explicitly determined. Includes new results. References. CEC 


Differential Geometry, P. Harmonic and Minimal Maps: With Applications in Geometry and Physics. 
Gabor Téth. Ser. in Math. & Its Applic. Halsted Pr, 1984, v + 342 pp, $70. [ISBN: 0-470-20127-4] 
Studies the geometry of harmonic and minimal maps into spaces of constant curvature. Gives applica- 
tions to physics and geometry. LN 


Geometry, S*, P*¥*, L*¥*. Mathematics and Optimal Form. Stefan Hildebrandt, Anthony Tromba. Scien- 
tific American Books (Distr: WH Freeman), 1984, xiii + 215 pp. [ISBN: 0-7167-5009-0] A stunning 
presentation of the geometry of optimal shapes--minimal paths and surfaces, soap films--together 
with stories of the mathematicians who discovered them. Profusely illustrated with both drawings 
and photographs, this first mathematics volume in the Scientific American Library is a dazzling mas- 
terpiece of exposition and geometry. LAS 


Geometry, P, L. Geometry of Spatial Forms. Peter C. Gasson. Ser. in Math. & Its Applic. Halsted 
Pr, 1983, xii + 561 pp, $64.95. [ISBN: 0-470-20011-1] Intended for shape design architects and 
engineers, this “encyclopaedic coverage of all the important concepts and formulae of applicable 
geometry” would be extremely useful to anyone interested in computer-aided graphics. JNC 


Algebraic Topology, P. Lecture Notes in Mathematics-1051: Algebraic Topology, Aarhus 1982. Ed: I. 
Madsen, B. Oliver. Springer-Verlag, 1984, x + 665 pp, $25.50 (P). [ISBN: 0-387-12902-2] Proceed- 
ings of an August 1982 conference at the Mathematics Institute at Aarhus, Denmark. Includes papers 


in algebraic K~ and L-theory, geometry of manifolds, homotopy theory, and transformation groups. LN 


Operations Research, T**(14), L. A First Course in Mathematical Modeling. Frank R. Giordano, Mau- 
rice D. Weir. Brooks/Cole, 1985, xvii + 382 pp, $32.50. [ISBN: 0-53-403367-9] Provides an early 
modeling experience assuming only one-variable calculus. Discussion of creative and empirical model 
construction, model fitting, models requiring optimization techniques, and the modeling of dynamic 
behavior. Excellent examples. Suggestions for projects and for further reading. Highly readable. 


JRG 


Optimization, 1T(14-15). Linear Programming: An Introduction to Finite Improvement Algorithms. 
Daniel Solow. Elsevier Sci, 1984, xv + 392 pp, $37. [ISBN: 0-444-00912-4] Why another linear pro- 
gramming book? The author’s distinctives: greater emphasis on proofs (including an entire chapter 
on the nature and methods of mathematical proof); a downplay of the tableau method in favor of an 
emphasis on conceptualization (who does these by hand anyhow?); an account of most recent computa- 
tional techniques. An attractive book! AWR 


Statistics, S(14-18), P. Applied Factorial and Fractional Designs. Robert A. McLean, Virgil L. 
Anderson. Statistics, V. 55. Dekker, 1984, ix + 373 pp, $65. [ISBN: 0-8247-7154-0] About 100 pages 
of non-theoretical discussion of factorial and fractional designs followed by a 260-page appendix 
containing three out-of-print publications on designs from the National Bureau of Standard and 
tables for partially nonorthogonal designs. FLW 


Statistics, T(13-14: 1). Basic Statistics for Nurses, Second Edition. Rebecca Grant Knapp. Wiley, 
1985, xii + 418 pp, $15.95 (P). [ISBN: 0-471-87563-5] This revision of this non-mathematical intro- 
duction contains new material on estimation, the power of a test, and experimental design. It also 
includes a decision tree to help guide one to the appropriate analysis. (First Edition, TR, 


December 1978.) FLW 


Statistics, T, $(14-15). Computer Modeling for Business and Industry. Bruce L. Bowerman, Richard T. 
O’Connell. Statistics: Textbooks & Mono., V. 59. Dekker, 1984, vii + 219 pp, $23.75. [ISBN: 0- 
8247~-7296-2] Self-contained chapters on basic descriptive statistics and the chi-square test, 
regression analysis, elementary time-series, and linear programming illustrate the use of SAS and 
LINOD to solve problems in statistics and operations research. Intended as a supplement to business 
statistics and quantitative analysis courses. JRG 


Statistics, P. Design of Experiments, Ranking and Selection: Essays in Honor of Robert E. Bechhofer. 
Ed: Thomas J. Santner, Ajit C. Tamhane. Statistics, V. 56. Dekker, 1984, xxii + 302 pp, $45. 
[ISBN: 0-8247-7274-1] A biographical essay and bibliography for Robert E. Bechhofer followed by 21 
research papers on multiple comparisons, selection and ranking, and estimation and testing. FLW 


Statistics, T*(14-15: 1, 2). Probability and Statistics for Engineers, Third Edition. Irwin Miller, 
John E. Freund. Prentice-Hall, 1985, x + 530 pp, $34.95. [ISBN: 0-13-711938-0] Revision of the 
author’s 1977 Second Edition (TR, November 1977). Major changes include an updating of the applica- 
tions, notation and terminology, a revision of the exercises (particularly in the earlier chapters), 
new material on combinatorial methods and stem-and-leaf plots, a revised treatment of statistical 
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inference, and the addition of a check list of key terms at the end of each chapter. RSK 


Statistics, P*, Lecture Notes in Statistics-25: Time Series Analysis of Irregularly Observed Data. 
Ed: Emanuel Parzen. Springer-Verlag, 1984, 363 pp, $21 (P). L[ISBN: 0-387-96040-6] Proceedings of a 
symposium held at Texas A&M University in February, 1983. Contains 16 papers by active researchers 
in this important srea., RSK 


Statistics, T(1). Business Statistics: Concepts and Applications, Secoud Edition. William J. 
Stevenson. Harper & Row, 1985, xvii + 637 pp, $31.95. [ISBN: 6-06-046442-9; Probability, sampling, 
estimation, hypothesis testing, regression, correlation, index numbers and time series. New topics 
in Second Edition (First Edition, TR, December 1978): multiple regression, decision theory, two-way 
analysis of variance. Chapters, not sections, are basic units. Informal approach--no proofs. Each 
chapter includes outline, objectives and review. Study guide available. MA 


Statistics, TC1\7-18: 1, 2), P*¥. Statistical Inference Based on Ranks. Thomas P. Hettmansperger. 
Wiley, 1984, xvii + 323 pp, 537.50. L[ISBN: 0-471-88474-X] In the Wiley Series in Probability and 
Mathematical Statistics. Major goal is "to develop a cchercnt and unified set of statistical 
methods (based on ranks) for carrying out inferences in various cxperimental situations." Includes 
one and two-sample location models, one and two-way iavouts, a systematic treatment of the general 
linear model, and a final chapter on multivariate location models. Emphasizes both efficiency and 


robustness of these methods. Good set of references. RSK 


Statistics, P, L*, Handbook of Applicable Mathematics. V. VI, Statistics. Ed: Emlyn Lloyd. Wiley, 
1984, $85 each [ISBN: 0-471-90024-9]. Part A, xxxviii + 541 pp [ISBN: 0-471-90274-8]; Part B, xxxvi 
+ 486 pp. [ISBN: 0-471-90272-1] Fifth volume to appear of this six-volume series aimed at profes~ 
sional users of mathematics (Volume V: Geometry and Combinatorics has yet to appear). (For descrip- 
tion of series, see TR, December 1980, of Volume I: Algebra.) Part A deals with basic principles of 
estimation and testing. Part B treats linear models, sequential analysis, distribution~free 
methods, Bayesian statistics, multivariate analysis, time series, decision theory and Kalman filter- 
ing. RSK 


Statistics, $(15-17), P. Applied Statistics: A Handbook of Techniques, Second Edition. Lothar 
Sachs. Trans]: Zenon Reynarowych. Ser. in Statistics. Springer-Verlag, 1984, xxviii + 707 pp, 
$54, [ISBN: 0-387-90976-1] Modest revision of the .author’s 1982 First Edition (TR, June-July 1983). 
Changes include some corrections and insertions in the text, and some additions and deletions from 
the indices and bibliography. RSK 


Statistics, P=. The Collected Works of George E.P. Box. Ed: George C. Tiao. Stat. & Prob. Ser. 
Wadsworth, 1985. Volume I, xiv + 657 pp, $42.95 [ISBN: 0-534-03307-5]; Volume II, xiv + 710 pp, 
$44.95. [ISBN: 0-534-03308-3] Includes a major portion of his articles. Volume I contains papers in 
Lwo main areds: statistical inference, robustness, and modeling strategy; and experimental design 
and response surface methodology. Volume If has three parts: time series analysis and forecasting; 
distribution theory, transformation of variables, and nonlinear estimation; and applications of 
statistics. R&K 


Computer Literacy, S, P, L*. The Sachertorte Algorithm and Other Antidotes to Computer Anxiety. 
John Shore. Viking, 1985, xvi + 270 pp, $16.95. [ISBN: 0-670-80541-6] A sound, informative and 
well-written introduction to computers for computer~anxious laymen. Totally avoiding the cute 
patronizing tone of so many books of this genre, physicist Shore offers a perceptive, wise and bal- 
anced view of computers, citing appropriate examples and authorities from both computer science and 
society. A worthy sequel to Weizenbaum’s Computer Power and Human Reason. LAS 


Computer Programming, T(14: 1), S$, L. Problem Solving with Fortran. Richard W. Dillman. Holt, 
Rinehart & Winston, 1985, xiii + 354 pp, (P). [ISBN: 0-03-063734-1] Introduction to programming 
using ANSI FORTRAN 77. Use of structured control primitives gives the book much of its strength. 
Good programming style. Emphasis on stepwise refinement is lacking. FA 


Computer Prograsming, S(15~-16). MACRO 11 Programming. C.C. Zammit. Adam Hilger (US Distr: Heyden & 
Son), 1984, xi + 119 pp, $19 (P). LISBN: 0-85274-769-1] A comprehensive tutorial on Macro-1ll assem- 
bly language programming. Sections describe the PDP-ll architecture, the assembler, the instruction 
set, the use of macros, I/0 and modular programming. FA 


Computer Programming, T(14-16: 1), S, P, L. Understanding C. Bruce H. Hunter. Sybex, 1984, xiv + 
320 pp, $17.95 (P). [ISBN: 0-89588-123-3] Written for a beginning C programmer (who knows BASIC, at 
least), this introduction moves quickly through data types and pointers to an extensive treatment of 
C functions. Intended for the microcomputer (CP/M or MSDOS) world, it includes extensive tables 
comparing various C compilers. No exercises. LAS 


Software Systems, S, P. The TK!Solver Book: A Guide to Problem-Solving in Science, Engineering, 
Business, and Education. Milos Konopasek, Sundaresan Jayareman. Osborne/McGraw-Hill, 1984, vi + 
458 pp, $19.95 (P). [ISBN: 0-88134-115-0] A thorough introduction to "TK!Solver," the most commonly 
used "tool kit" package for solving algebraic equations. The system stores "rules" relating vari~ 
ables to each other, then solves (directly or by iteration) for any one given adequate data on the 
others. This package has enormous potential for imaginative pedagogy in high school mathematics. 
LAS 
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Computer Science, P. Lecture Notes in Computer Science-170: 7th International Conference on 
Automated Deduction. Ed: R.E. Shostak. Springer-Verlag, 1984, vi + 508 pp, $22.50 (P). LISBN: 0- 
387-96022-8] Proceedings of a May 1984 conference held in Napa, California. The keynote address 
gives an overview of the field. LN 


Computer Science, P. Lecture Notes in Physics: WOPPLOT 83: Parallel Processing: Logic, Organization, 
and Technology. Ed: J. Becker, I. Eisele. Springer-Verlag, 1984, v + 189 pp, $10 (P). [ISBN: 0- 
387-12917-0] Proceedings of a June 1983 workshop held at the Federal Armed Forces University in Mun- 
ich. LAS 


Computer Science, P. Synthesis of Digital Designs from Recursion Equations. Steven D. Johnson. MIT 
Pr, 1984, xv + 207 pp, $30. [ISBN: 0-262-10029-0] Functional languages (Scott-Strachey formalism) 
and applicative notation are used as basis for digital design. Design process viewed as transforma- 
tion from specification to realization, with self-referential realization corresponding to feedback 
in physical implementation. 1983 ACM Distinguished Dissertation. RM 


Computer Science, P. Automated Theorem Proving: After 25 Years. Ed: W.W. Bledsoe, D.W. Loveland. 
Contemp. Math., V. 29. AMS, 1984, ix + 360 pp, $30 (P). [ISBN: 0-8218-5027-X] Expanded proceedings 
of a special session on automated theorem held at the 1983 Denver meeting of AMS. First article is 
an introduction to and overview of the field. LN 


Computer Science, T(13-14). Computer Mathematics. D.J. Cooke, H.E. Bez. Computer Sci. Texts, No. 
18. Cambridge U Pr, 1984, xii + 394 pp, $49.50; $19.95. [ISBN: 0-521-25341-1; 0-521~-27324-2] A 
textbook to acquaint computer science students with needed mathematical structures and techniques, 
e.g., algebraic structures, matrices, graph theory, languages and grammars, finite automata, com- 
puter geometry. LN 


Applications (Artificial Intelligence), P. Developments in Expert Systems. Ed: M.J. Coombs. Com- 
puters & People Ser. Academic Pr, 1984, xv + 253 pp, $15. [ISBN: 0-12-187580-6] 13 articles on the 
methods and applications of expert systems, knowledge-based computer systems for solving problems at 
an expert level, employing casual knowledge, heuristics, user interaction, explanations, set cover- 
ings and natural language understanding techniques. Applications include electronic circuits, medi- 
cal diagnosis, business operations, and processing assistance, biographies and forecasting. RWN 


Applications (Economics), P. Practical Statistical Sampling for Auditors. Arthur J. Wilburn. 
Statistics, V. 52. Dekker, 1984, x + 410 pp, $39.75. [ISBN: 0-8247-7124-9] Non-mathematical 
description of various sampling procedures appropriate for use in auditing, intermixed with discus- 
sions of related auditing problems. Includes a presentation of the author’s flexible sampling stra- 
tegy, a Bayesian approach which 18 particularly suitable in this context. RSK 


Applications (Engineering), T(17). Urban Transportation Networks: Equilibrium Analysis with 
Mathematical Programming Methods. Yosef Sheffi. Prentice-Hall, 1985, xvi + 399 pp, $45.95. [ISBN: 
0-13-939729-9] The flow pattern of traffic through an urban network is analyzed by modeling travel 
decisions and network congestion. The book grew out of notes used in undergraduate and first year 
graduate courses in traffic management at MIT. AWR 


Applications (Physics), $(18), P. Mathematical Aspects of Superspace. Ed: H.-J. Seifert, C.J.S. 
Clarke, A. Rosenblum. NATO ASI Ser. C, V. 132. D Reidel, 1983, xii + 214 pp, $39. [ISBN: 90-277- 
1805-9] These papers represent an effort to unify and extend the mathematics underlying modern rela- 
tivity and quantum theory. Such questions are addressed as "Does there exist a superspace having 
all the properties that physicists require of it?" Very densely written, requiring familiarity with 
a great deal of mathematics and physics. MU 


Applications (Social Science), T*(15~-16: 1), S*, L. Probability in Social Science. Samuel Goldberg. 
Math. Modeling, No. 1. Birkhauser Boston, 1983, xii + 119 pp, $18.95. [ISBN: 3-7643-3089-9] Revi- 
sions of seven of fifteen CUPM modules originally distributed in an MAA sourcebook in 1977. Topics 
range from power indices to glottochronology (rate of change of language), and employ primarily 
undergraduate probability. Each module concludes with careful exercises and an extensive bibliogra- 
phy making contact with current social science research. LAS 
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St. Olaf; FLW: Frank L. Wolf, Carleton; PZ: Paul Zorn, St. Olaf. 
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Section Reports 


An asterisk (*) by the title of a paper indicates that copies of the paper are available from 
the author. Papers presented under special sponsorship as part of joint meetings are so noted in 
parentheses, 


Eastern Pennsylvania and Delaware Section 


The annual spring meeting of the Eastern Pennsylvania and Delaware Section was held on March 
30, 1985 at Bloomsburg University, Bloomsburg, Pennsylvania. There were approximately 90 registra~ 
tions. 


Invited Lecture: 


"Mathematical Education in America," by Garrett Birkhoff, Harvard University. 


Workshop: 


"Discrete Algorithmic Mathematics," by Stephen B. Maurer, Swarthmore College. 


Florida Section 


The eighteenth annual meeting of the Florida Section was held on March 8-9, 1985 at Stetson 
University in DeLand, Florida. There were 147 registrants. 


Invited Addresses: 


"Classification of the Simple Groups," by Daniel Gorenstein, Rutgers University. 

"Renewing Collegiate Mathematics," by Lynn Arthur Steen, St. Olaf College. 

"Some Problems Neither My Computer Nor I Can Solve--YET," by Richard V. Andree, University of 
Oklahoma. 

"What’s the Delay? An Introduction to Differential Equations Involving Delays," by Mary Parrott, 
University of South Florida. 

"An Interesting Linear Transformation and Its Generalizations," by Sidney Kung, Jacksonvil 
University. 

"Space-Time Geometry.,"' by Gregory Galloway, University of Miami. 

"Aspects of Legislation Affecting Mathematics Education," by T.K. Wetherell, Chair, Committee on 
Higher Education, Florida House of Representatives. 


Short Presentations: 


"An Introduction to TEAM, Teaching Experimental Applied Mathematics," by David Lawson, Stetson 
University. 

"Inner Models of Zermelo-Fraenkel Set Theory with Urelements (ZFU) and Standard Models of Arith- 
metic," by David Lawson, Stetson University. 

"A Connected Locally-Connected Countable Space Satisfying the Separation Axiom T, - (l/n) for 
Every n," by Jennifer L. Davidson and Gerhard Ritter, University of Florida. 

"Discovering Mathematical Concepts Through the Coriputer," by» Gareth Williams, Stetson University. 

"Some Notes on Heron Triangles," by Alan Wayne, Pasco-Huernando Community College. 

"Computers for Communication in Natbematics,' by Beverly Brechner, University of Florida. 

"Matrices That Multiply Like Rabbits," vv '.P. Miles, Florida State University. 

"Using the Computer in Analysis--Chasing Delta Through Dirichlet’s Function on the MacIntosh," by 
Harold Dansberger, Stetson University. 

"Arcade Games, Kidney Stones and Reflections on an Ellipsoid: A Computer Graphics Demonstration in 
Color," by Arthur Cummer, University of Florida. 

"The Geometry and the Group Theory of Special Relativity," by Ernest A. Thieleker, University of 
South Florida. 


Special Sessions: 

"Articulation Session," organized by Ernest R. Ross, Jr., St. Petersburg Junior College. 

"TCME-5 Panel Discussion," organized by Don Hill, Florida A&M University; Robert Kalin, Florida 
State University; Don and Betty Lichtenberg, University of South Florida. 

"Special Session on Certification," organized by Louis M. Edwards, Valencia Community College; Don 
Lichtenberg, University of South Florida; Tom Denmark, Florida State University; Clem Boyer, 
Florida Council of Teachers of Mathematics. 

Regional Meeting Session, organized by Elizabeth Magarian, Stetson University. 

Florida Two-Yeer College Mathematics Association Meeting, organized by Dale Grussing, Miami-Dade 
Community College, North. 

Women in Mathematics Meeting, organized by Marilyn Repsher, Jacksonville University: "Education 
and Examination Policies of the Actuary Societies," by Ruth Frew, American Society of Pension 
Actuaries. 

Florida Association of Mathematical Educators Meeting, organized by Robert Kalin, Florida State 
University. 
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Student Papers: 


"Two Telescoping Series," by Jan Figa, University of Central Florida. 
"Fantastic Feminine Figures," by Kimberly Kailing, University of Tampa. 


At the business meeting an award was presented to Dwight B. Goodner of Florida State University 
for outstanding contributions to mathematics and the mathematical community of the Florida Section. 


Southern California Section 


The spring meeting of the Southern California Section was a joint meeting with SIAM, held on 
March 2, 1985 at California State University Northridge, at Northridge, California. There were 103 
registrants. 


Invited Addresses: 


"Computation of Small Scale Phenomena in the Solution of Differential Equations," by Bjorn 
Engquist, University of California at Los Angeles. 

"Statistical Inference for Discovery Process Models," by Louis Gordon, University of Southern Cal- 
ifornia. 

"A Survey of Work on the Bieberbach Conjecture," by Carl Fitzgerald, University of California at 
San Diego. 

"Computational Complexity of Linear Programming," by Kenneth Lane, Colby College. 

"The Effect of Battery Additives on Numerical Analysis," by John Todd, Cal Tech; Magnus Hestenes, 
University of California at Los Angeles. 


Panel Discussion: 


"Mathematics or Computer Science: What Does Industry Really Want?" by Roy Danchick, TRW (Modera- 
tor); Diane Schwartz, CSUN; Stavros Busenberg, Harvey Mudd College; Barney Krinsky, Hughes Air- 
craft Company. 


Northern California Section 


The annual meeting of the Northern California Section was held on February 23, 1985 at Menlo 
College at Atherton, California. There were 224 registered participants. 


Invited Addresses: 


"Unsolved Problems in Convexity: Some Very Easy Questions With Very Hard Answers," by David Bar- 
nette, University of California at Davis. 

"Reminiscences on the Origins of Linear Programming," by George Dantzig, Stanford University. 

"Mathematics, Statistics, and Computers," by Bradley Efron, Stanford University. 

"Cosets, Spinsters, Clusters, and the Schroeder~Bernstein Theorem," by Paul Halmos, University of 
Santa Clara. 

"A Non-Mathematician’s Apology," by Constance Reid. 


Maryland-D.C.-Virginia Section 


The fall meeting of the Maryland-D.C.-Virginia Section was held on November 9-10, 1984 at the 
U.S. Naval Academy in Annapolis, Maryland. 


Invited Addresses: 


"Shakespeare or Bacon? Submarine or Whale?" by Herbert Wilf, University of Pennsylvania. 
"Mathematics and The Art of Escher," by Doris Schattschneider, Moravian College. 


Short Presentations: 


"Applications of Jacobi’s Method for Computing Singular Values," by T. Hoy Booker, Gallaudet Col- 
lege. 

"Computers in the Mathematics Classroon: A Good Idea? Maybe," by Ephraim Salins, Montgomery Col- 
lege. 

"An Extension of Euler’s Theorem," by William Wardlaw, U.S. Naval Academy. 

"TEAM (Teaching Experiential Applied Mathematics)," by John Smith, George Mason University. 

"Percolation in Continuous Systems," by James R. Kirkwood, Sweet Briar College. 

"Teaching a Calculus of Programming," by Harlan D. Mill, IBM and University of Maryland. 

"Inertial Flight Instruments: Mathematics Applied to Flight Training Simulations: A Case Study," 
by Tom Allan, Sperry Corporation. 

"L2 on the Unit Circle: Which Rotationally Invariant Subspaces are Hardy-like?" by Richard B. 
Tucker, Mary Baldwin College. 

"Computer Science Curricula: Where’s the Math?" by Richard W. Dillman, Western Maryland College. 

"Sine, Cosine, and Binomial Coefficients," by Michael Hoffman, U.S. Naval Academy. 

"Ring Theory and Exact Integer Arithmetic," by George Mackiw, Loyola College. 
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"Using the Computer Package SAS in an Undergraduate Course," by Raymond Geremia, Goucher College. 
"A Monte Carlo Simulation Involving Alzheimer’s Disease," by John C. Hennessey, Loyola College. 
"Mathematics Liberally Applied for/by the Non-Mathematics Major," by Sister Helen Christensen, 


Loyola College. 

"Hyperbolic Boundary Value Problems Solved by Special Functions Methods," by Peter McCoy, U.S. 
Naval Academy. 

"Synthesizing the Discrete and Continuous: A Potential First Year College Math Course," by Stephen 
B. Seidman and Michael D. Rice, George Mason University. 

"A Moth to the Flame," by Robert A. Maynard, Tidewater Community College. 

"RMS Without Calculus," by Robert A. Maynard, Tidewater Community College. 

"An Application Course for the Terminal Mathematics Student," by Thomas Sonnabend, Trinity Col- 
lege. 

"Subparticles and Their Mathematical Generation," by Robert A. Herrmann, U.S. Naval Academy. 

"Tax Brackets, Loans and Effective Interest Rates," by John Milcetich, University of the District 
of Columbia. 

"Are the Japanese Really Ahead in Mathematics Education?" by Donald R. Peeples, Mary Washington 
College. 

"Pragmatics, Transactions, and Consensic Logics: Applications in Programming Languages and Artifi- 
cial Intelligence," by John Hays, Naval Research Laboratory. 


Student Papers: 


"Pursuit Games," by Young Lee, Goucher College. 
"Optimum Strategy for a Two Person Poker Game," by Laura Lamb, Goucher College. 
"Guessing a Number with Lying," by Susan Emily Imber, Goucher College. 


Louisiana-Mississippi Section 


The sixty~second annual joint meeting of the Louisiana-Mississippi Section of MAA and the 
Louisiana-Mississippi Branch of NCTM was held on February 15-16, 1985 in Biloxi, Mississippi. The 
University of Southern Mississippi served as host; there were 209 registrants. 


Invited Addresses: 


"Count or Think," by Paul R. Halmos, University of Santa Clara. 
"How to Paint Triangles," by Paul R. Halmos, University of Santa Clara. 


Contributed Papers: 


"Estimation of the Pareto Density Function," by Badiollah R. Asrabadi, Nicholls State University. 

"Hyperbolas of Statistical Confidence," by Margaret W. Maxfield, Louisiana Tech University. 

"Comparing Two Estimators of Reliability for the Gamma Failure Model," by Laurie Meaux and Thomas 
L. Boullion, University of Southwestern Louisiana. 

"Comparing Estimators of Reliability for the Uniform Distribution," by Sharon Navard and Thomnas 
L. Boullion, University of Southwestern Louisiana. 

"Unitary Cycles Less Than Four," by Dale Woods, Central State University, Edmond, Oklahoma. 

"Equivalent Induced Topologies and Semi-topological Properties," by Charles Dorsett, Louisiana 
Tech University. 

"A Characterization of Infinite Dimension for Vector Spaces," by Henry Heatherly, University of 
Southwestern Louisiana. 

"A General Principle for Fixed Points of Contractive Multi-valued Mappings," by Hideaki Kaneko, 
Mississippi State University. 

"Connected Sums on the Hyperbolic Plane," by Margaret M. LaSalle, University of Southwestern 
Louisiana. 

"Constructing Covering Spaces of 3-manifolds," by William Ortmeyer, University of Southwestern 
Louisiana. 

"Automorphism Groups II," by Gary L. Walls, University of Southern Mississippi. 

"A Ring for Quantum Mechanics," by Fred Rolfes, University of Southwestern Louisiana. 

"Periods of Sums of Periodic Functions," by James Caveny, University of Southern Mississippi. 

"Multisolute Network Flow Problems," by J.B. Garner, Louisiana Tech University. 

"A Note on Frobenius Type Solutions of Second Order Linear Differential Equations," by William F. 
Hoyt, Mississippi State University. 

"Sufficient Conditions for Convergence to Zero of Solutions of Second Order Differential Equa- 
tions," by J.R. Graef and P.W. Spikes, Mississippi State University. 

"The House Number Problem and Its Variations," by Joey Paul, Copiah-Lincoln Junior College. 

"Some Remarks on Equations of Matrices," by Cecil E. Robinson, Jr., Univ. of Southern Mississippi. 

"Remarks on an Open Question in LCP," by M.W. Jeter and W.C. Pye, Univ. of Southern Mississippi. 

"Who Takes Science? Characteristics of 342 Students," by Bill Attebery, M. Maxfield, M. Livings- 
ton, and T. Portis, Louisiana Tech University and Grambling State University. 

"Calculators Are Not Obsolete," by N. Jane Carr, McNeese State University. 

* "Consistency," by W. Ed Stephens, McNeese State University. 
* "Ts It Working? A Five-year Evaluation of the Developmental Mathematics Program at McNeese State 

University," by Robert T. Yellott, McNeese State University. 

"Are Perfect Numbers Really Perfect?" by Steve Ligh, University of Southwestern Louisiana. 

"A Necessary and Sufficient Condition for Convergence of a Positive-term Series," by Robert 
Heller, Mississippi State University. 
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Student Papers: 


"Difference Equations and Probability," by Kate Abernethy, Mississippi University for Women. 

"Remarks on the Convergence of TT = 4(1-1/3+1/5-1/7+...)," by Sharon Crook, University of Southern 
Mississippl. 

"The Elastica Problem," by Deanna M. Caveny, University of Southern Mississippi. 

"Bezout’s Theorem," by Susan M. Gibson, University of Southern Mississippi. 

"Disk Storage Efficiency," by Aubrey Griffin, Xavier University of New Orleans. 

"The Method of Curvature," by Donna L. Huch, University of Southern Mississippi. 

"Power Series Solutions of the Lane~Emden Equation," by Randall A. Stevenson, Northeast Louisiana 
University. 

"Improper Integrals with the Greatest Integer Function," by Carolyn Autrey, University of Southern 
Mississippl. 

"A Number-theoretic Result from Abstract Algebra," by Scott J. Beslin, University of Southwestern 
Louisiana. 

"Covering Theorems for the Alternating Group," by Karen Fawcett, Univ. of Southern Mississippi. 

"Some Properties of P-matrices," by Richard Godbold, University of Southern Mississippi. 

"A Generalized Thomas-Fermi Theory of Ionized Atoms," by Yiu Chung Hon, University of Southwestern 
Louisiana. 

"Irrational Numbers Viewed as Continued Fractions," by Lida G. McDowell, University of Southern 
Mississippi. 

"An Elementary Solution to Equations Such as x2 = x (mod pk) and x3 = x (mod pk), by D. Straub, 
University of Southwestern Louisiana. 


Panel Discussions: 


"Everything You’ ve Always Wanted to Know About Challenging Your Bright Students, But Were Afraid 
to Ask," by Burrow Brooks, Delta State University (Moderator); James Porter, University of Mis- 
Sissippi; Duane Blumberg, University of Southwstern Louisiana; Donald Ryan, Northwestern State 
University of Louisiana. 

"Discrete Mathematics: Something New in the First Two Years of Undergraduate Education?" by Jerome 
A. Goldstein, Tulane University (Moderator); R.D. Anderson, Louisiana State University; Bettye 
Anne Case, Florida State University. 


Short Course: 


"Curves and Surfaces--The Building Blocks in Computer Aided Design and Numerical Simulation," by 
Wayne Mastin, Mississippi State University. 


Special Report: 


"Status of the Profession," by Bernard Madison, University of Arkansas. 


Eastern Pennsylvania and Delaware Section 


The annual fall meeting of the Eastern Pennsylvania and Delaware Section was held on November 
17, 1984 at Swarthmore College. There were 163 persons in attendance. 


Invited Addresses: 
"Mordell”s Conjecture: Ideas at the Confluence of Arithmetic and Geometry," by Stephen S. Shatz, 
University of Pennsylvania. 
"Renewing Undergraduate Mathematics," by Lynn A. Steen, St. Olaf College. 
"Solutions of Diophantine Equations and the Class Number Problem of Gauss," by Don Zagier, Univer- 
sity of Maryland. 
"Remediation: A Waste Or a Gold Mine?" by Raymond Coughlin, Temple University. 


Oklahoma-Arkansas Section 


The forty-seventh annual meeting of the Oklahoma-Arkansas Section was held on March 29-30, 1985 
at the University of Tulsa at Tulsa, Oklahoma. There were 148 registrants. 


Invited Addresses: 


"Some Functions That Count," by Gerald L. Alexanderson, University of Santa Clara. 
"What is Geometry: Euclid, Riemann, Weil, or Bing?" by Saunders MacLane, University of Chicago. 


Short Presentations: 


"Need Retraining in Computer Science? I Have an Answer," by Edward N. Mosley, Arkansas College. 

"A Study of the Effect of Math and Science Preparation on ACT Scores," by Darryel Reigh, Univer- 
sity of Science and Arts of Oklahoma. 

"Graphing Absolutely--Taxi and Maxi Geometrics," by Robert DeMille, Holland Hall School. 

"Pythagorean Triples and Trigonometric Identities," by Thomas Peter, University of Arkansas at 
Little Rock. 
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"Optimizing Gauss-Seidel Iteration," by John M. Woods, Oklahoma Baptist University. 

"LOGO for Third and Fourth Graders," by John Watson, Arkansas Tech University. 

"Improbable Probabilities with “‘f and e," by Robert DeMille, Holland Hall School. 

"Sub-surface Electrical Measurements About Plane and Cylindrical Interfaces," by D.J. Boyce and 
Dale Woods, Central State University. 

"A Model for the Bending of a Plastic Trough," by Larry Crawford, Oklahoma State University. 

"A Problem of a Perimeter Estimation for an Elliptic Partial Differential Equation," by Kip Holm, 
University of Oklahoma. 

"On the Relationship Between Levinson Recursion and the R and S Arrays for ARMA Model Identifica- 
tion," by Brenda J. Roberts, University of Tulsa. 

"Radar Assisted Turns for Helicopter," by James Yates, Central State University. 

"Singular Perturbations," by Walter Kelley, University of Oklahoma. 

"Lotka-Volterra Competition Equations: Numerically," by Jeff Thompson, Oklahoma State University. 

"On Inclusions Between Lp(/”) Spaces," by Joe Swartz, Oklahoma State University. 

"Boundedness of Solutions of Difference Equations and Application to Numerical Solution of 
Integral Equations," by Zkzislaw Jackiewicx, University of Arkansas. 

"Zermelo’s First Proof," by Herman Burchard, Oklahoma State University. 

"Positive Integral Operation," by Calvin Piston, John Brown University. 

"Lotka-Volterra Competition Models Revisited--I," by Marvin S. Keener, Oklahoma State University. 

"Lotka-Volterra Competition Models Revisited--II," by Stanley F. Fox and Keith W. Dorschner, 
Oklahoma State University. 

"Population Equilibrium and Stability in Plant-Animal Pollination Systems," by Harrington Wells, 
University of Tulsa. 

"On Groupoids," by Naoki Kimura and Jane Sheu, University of Arkansas. 

"A Problem in Linear Algebra Related to Azumaya’s Exact Rings," by Joel Haack, Oklahoma State U. 

"Lattices of Transformations," by Sinisa Crzenkozic and Michael Dunze, University of Arkansas. 

"Topology of p-adic Fields," by Paul T. Young, Oklahoma State University. 

"A Taste of Differential Topology," by Bobby N. Winters, Oklahoma State University. 

"A Quadratic Transformation on the Plane," by T. Sekiguichi, University of Arkansas. 

"A Topological Proof of the Compactness of the Propositional Calculus," by Michelle Penner, 
Oklahoma State University. 

"C{0,1] Has No Minimal Topology As a Riesz Space," by Daryl Ezzo, University of Tulsa. 

"Stability Analysis of Linear Multistep Methods for Delay Differential Equation," by V.L. Bakke, 
University of Arkansas. 

"Banach Spaces on Which Every Completely Continuous Operator is Compact," by Tommy Leavelle, John 
Brown University. 

"Sharp Weak-type Inequalities for Analytic Functions on the Unit Disc," by Boguslaw Tomaszewski, 
Oklahoma State University. 

"The Moment Problem and C®-scalar Operators," by Ralph de Laubenfels, University of Tulsa. 

"TEAM Materials for the Classroom," by Dan Hansen, Northeastern Oklahoma State University. 

"Use of the MAA Placement Exam at ECU," by Ray Hamlett, East Central State University. 

"James Clerk Maxwell and the Development of Vector Analysis," by Richard Greenhaw, Oklahoma Chris- 
tian College. 

"How Do Electronic Machines Perform Arithmetic?" by Gary V. Millsap, East Central University. 

"Mathematics and Statistics in Medicine and Epidemiology," by Elisa T. Lee, University of Oklahoma 
Health Sciences Center. 

"On Nicholson’s Blowflies and Other Bugs," by Carlos Castillo-Chavez, University of Tulsa. 

"Stability Results for a Discrete Predator-Prey Model," by William Ray, University of Oklahoma. 


Student Papers: 


"Strategic Metal Stockpiling," by Steven Cudd and Lisa Pfannestiel, Oklahoma State University. 

"Mathematics: Agony or Ecstasy," by Tammy Tucker, Hendrix College. 

"Computer Assisted Enrollment," by Donnie J. Glass, University of Science and Arts of Oklahoma. 

"A Characterization of a Class of Stone Algebras by Factorization Lattices," by Scott Roberts, 
Hendrix College. 

"Predator-Prey Systems and Optimal Harvesting," by Gordon Clark, Ryan Cross, and David Patocka, 
Oklahoma State University. 

"Applying Prime Factorizations: Amicable and Perfect Numbers," by James Hart, Hendrix College. 


Rocky Mountain Section 


The sixty-eighth annual meeting of the Rocky Mountain Section was held March 15-16, 1985 on the 
campus of Casper College. 


Invited Addresses: 


"Women Mathematicians: Grace Chisolm Young and Emmy Noether," by Lida Barrett, University of 
Northern Illinois. 

"Bieberbach, Riemann, and the Class of 1990," by Kenneth Gross, University of Wyoming. 

"Topology and Material Science," by Lida Barrett, University of Northern Illinois. 


Short Presentations: 


"A New “Problem Solving” Course," by Ben Roth, University of Wyoming. 


"The Probability of Obtaining All Roots of a Polynomial Equation to be Real (Part 1)," by Hung C. 
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Li, University of Southern Colorado. 

"Getting Beyond the Formulas--Generating Graphical Representations of Fourier Series Solutions for 
the Heat Equation," by Joan Hundhausen, Colorado School of Mines. 

"The Bootstrap with Some Applications," by Martin Hamilton, Montana State University. 

"Calculation of Critical Values for the Studentized Range," by Richard Lund. 

"Probabilistic Design in Engineering," by Leon Borgman, University of Wyoming. 

"Two-sided Distribution-free Tolerance Intervals and Accompanying Sample Size Problems," by Wil- 
liam Guenther, University of Wyoming. 

"Planning, Implementing and Obtaining Results for a Statewide Mathematics Assessment Test in Mon-~ 
tana," by Ken Tiahrt, Montana State University. 

"Results on the Structure of the Polynomial Ring klx]," by Karen Whitehead, South Dakota School of 
Mines & Technology. 

"The Technique of Nonlinear Variation of Constants--Conceptual and Quantitative Results," by 
Michael Lord, Colorado School of Mines. 

"Comments on Two Mathematical Experiences for Non~mathematically Oriented Students, by John 
Hodges, University of Colorado at Boulder. 

"Isomorphisms Can be Non-trivial," by Stephen Bronn, University of Southern Colorado. 

"Quarternion Propagation Anomaly," by Thomas Kelly, Colorado School of Mines. 


Student Papers: 


"Revised Fast Fourier Transforms," by Shane Hubler, Colorado College. 

"An Application for Multiple Regression," by Donna Johnson, South Dakota School of Mines & Tech. 

"Probabilistic Rocket-engine Propellant Sizing," by Lloyd Best, Colorado School of Mines. 

"What Happens to Students Who Start Out with Remedial Math?" by Kim Gehring, South Dakota School 
of Mines & Technology. 

"The Random Walk Program," by Steve Kelly, George Washington High School, Denver, Colorado. 

"A New Technique for Identifying Transfer Function-noise Models," by Paul Anderson, Colorado 
School of Mines. 

"Patterns in Bernoulli Numbers," by Shane Hubler, Colorado College. 

"Mathematics in Hungary," by Louis Kovari, Colorado School of Mines. 


Kansas Section 


The seventieth annual meeting of the Kansas Section was held at Kansas Wesleyan in Saline, Kan-~ 
sas on March 29-30, 1985. There were 80 registrants. 


Invited Addresses: 


"The Reality of Students” Mathematical Behavior," by Alan H. Schoenfeld, University of California 
at Berkeley. 

"Teaching a Problem Solving Course," by Alan H. Schoenfeld, University of California at Berkeley. 

"What High School Seniors Know and How They Learned It," by Curtis McKnight, Univ. of Oklahoma. 


Short Presentations: 


"On the Existence and Nonexistence of Periodic Orbits in a Small Neighborhood of a Homoclinic 
Orbit in R2 and R3," by Mohammad Riazal-Kermani, Fort Hays State University. 

"More on Super Heronian Triangles," by Ruth Meyer, Kansas State University. 

"An Introduction to Strongly Regular Graphs and Association Schemes," by Cecil Andrew Ellard, Kan- 
sas State University. 

"Practical Teaching Suggestions for Improvement of Problem Solving Skills in Mathematics," by 
Carolyn Ehr, Fort Hays State University. 

"Synthetic Division for Non-linear Polynomial Division," by Kim Gattis, Wichita State University. 

"Running Races and Recursion," by Dan Fitzgerald, Kansas Newman College. 

"Calculating Kendall”’s Tau Correlation Coefficient Using Weighted Data by SPSS," by Merillee Car- 
lyle, Wichita State University. 

"Combinatorial Investigations on 0-1 Matrices," by Dale Hignes, Wichita State University. 

"Sober Spaces," by Dian Palenz, Wichita State University. 

"On the Boundary Behavior of Non-parametric Minimal Surfaces in R3," by Kirk Lancaster, Wichita 
State University. 


Student Paper: 


"An Application of Linear Interpolation in Two Variables for Brain Wave Displays," by Douglas P. 
Bogia, Washburn University. 


Panel Discussion: 
"Testing for Mathematics Proficiency," by John Carlson, Emporia State University (Moderator); 


Elton Beougher, Fort Hays State University; Rose Buschman, Garden City High School; Richard 
Driver, Topeka Public Schools; Ted Zerger, Kansas Wesleyan. 
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X1, X5,..., X, of exchangeable r.v’s. Let T be a stopping-time defined on the sequence. 


LEMMA 1. If P(T <n-—k)=1, the random variables (Xrp.,,...,X74,) and (X,...,X,) 
have the same distribution. 
Proof. Let A, be the set of outcomes in R, that defines the event T = f¢; that is 
{T=t} = {(X%,...,X,) €A,}. 
Letting B be a set in R, we have 
PIC Xp a pse-s Xan) © B| 


n—-k 


SPUNK) A, A Xai Xeag) © Bl 


t=] 


m—k 
=) PICX,,...,X%) € 4, A(X, 445---,X,) € B 
t=] 


PI(X, p40) X) e B| = P[(X,...,X,) e BI. 


The result in Example 2 follows from a corresponding lemma for an infinite sequence of 
exchangeable r.v.’s: 


LEMMA 2. For any k, the v.v.’s (X741,...,Xr4,) and (X,..., X,) have the same distribution. 


We can express the results in Lemma 1 and Lemma 2 succinctly as follows: Exchangeable r.v’s 
are invariant in distribution under stopping-time shifts. 

Kallenberg [2] has proved the very interesting result that such invariance characterizes 
exchangeability; an infinite sequence of r.v.’s is exchangeable if and only if it is invariant under 
stopping-time shifts. 


I am indebted to O. Kallenberg for valuable remarks. 
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SIMULTANEOUS COMPLEMENTS IN FINITE-DIMENSIONAL VECTOR SPACES 


N. J. Lorp 
Wolfson College, Oxford, England 


It is very well known that if U is a subspace of the finite-dimensional vector space V, then U 
has a complement in V; that is, there exists a subspace X of V such that the direct sum of U and 
X,U ® X,is V. (See [1], p. 143.) Shghtly less well known is that two subspaces U, W of V, both of 
the same dimension, have a common complement. (A quick proof is by finite induction on the 
codimension of U; the inductive step resting on the observation that if U# W then UU W is 
never a subspace of V ([i], p. 136). Choose x € UU W and use the inductive hypothesis to 
obtain a common complement to span(U, x) and span (W, x).) 

This note arose from the observation that there does not, in general, exist a common 
complement to three subspaces having a common dimension. Thus, for example, if V is a 
2-dimensional vector space over Z,, then V has but three 1-dimensional subspaces; so it is 
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manifestly impossible to find a 1-dimensional subspace complementary to all three. 

However, one is left with a feeling that this counterexample is somewhat “degenerate” in 
nature. (It does raise the interesting combinatorial problem: if VY is an n-dimensional vector space 
over a finite field, and if 1 < k < n — 1, what is the maximum number of distinct k-dimensional 
subspaces of V that have a common complement?) Normally in elementary linear algebra one 
seeks fairly obvious counterexamples in familiar Euclidean spaces: the proof that one cannot find 
a counterexample in this situation turns out to be a pleasant exercise in what might be called the 
“generic method”, blending topology and algebra. Indeed, we are able to prove rather more: in a 
finite-dimensional Euclidean space, given an arbitrary finite number (or even a countably infinite 
number) of subspaces all of the same dimension, there will exist a subspace that is a complement 
to every subspace in the given family. 

For example, in 2-dimensions, this means that given any countable family of lines through the 
origin, there exists a line through the origin distinct from all of them. In this special case, an ad 
hoc argument can, of course, easily be supplied (the set of points on the unit circle which lie on 
the given lines 1s countable; the unit circle is not) which, in addition, suggests that if there exists 
one common complement there will, in some sense, exist many—a statement that will be made 
precise in the course of the proof. In 3-dimensions the result means that, given any countable 
family of lines through the origin, there exists a plane through the origin which contains none of 
them; and that given any countable family of planes through the origin, there exists a line through 
the origin which lies in none of the planes. 


THEOREM. Let V be an n-dimensional vector space over the real numbers IR (or the complex 
numbers), and let {U.:i EN} be a countable family of subspaces of V each having the same 
dimension, k. Then there exists a subspace, X, of V which is a complement in V to all of the 
subspaces in the family. 


Proof. We write out the proof in the real case. 


Let (u, ,,...,U,.,) be a basis for U,, let V"~* denote the (m — k)-fold Cartesian product of V 
with itself, and define f:V"~* > R by: 
fi (0p 5-005 Uy 4) = det (uy yee Up Vises Oey): 
Then f is continuous (for any topology induced by a norm ||: || on V"~*), so that O. = 
f- '(R — {0}) is an open subset of v"~* Tt is nonempty because there exists (v,,...,U,_,) 
extending the basis for U, to a basis for V. 
We claim that O, is dense in V"~*. Let v = (v,,...,0,-,) be in V"* and fix any ¢ in O.. 


Consider p(x) =f,(xv +c) where x €R. Then p is a polynomial function which is not 
identically zero since p(0) = f,(c) # 0 by the choice of ¢. So there exists a positive R such that 
p(x) # 0 for all x > R. Thus xv + ¢, and hence v + x” ’e, is in O, for all x > R. But 


lv —(v + xe) || = x7" |le|| ~ Oas x > 00, 


which establishes the claim. 

The Baire Category Theorem then implies that N°%,O, is dense in V"~* and therefore 
nonempty. Choose any (@,,...,€,-,) in M%2,O,, and, to complete the proof, let X= 
span(é€),...,€,—,): 

Moreover, our preliminary suspicion that, if there were one common complement, there would 
be many, is confirmed in the strong sense that 12,0, (the set of possible choices) is dense 
in V"~*. 
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A SHORT PROOF OF CHEBYSHEW’S THEOREM* 


N. COSTA PEREIRA 
Departamento de Fisica, Faculdade de Ciencias, Universidade de Lisboa, Lisbon, Portugal 


If we define y(x), for x > 0, by W(x) = X,«., log p for all primes p and positive integers a, 
Chebyshev’s classical theorem states that 
WO) 


(1) lim inf vs) <1 and limsup > 


x>0o  =0oX vow x 


The prime number theorem is equivalent to the statement that lim, _,,,(x)/x = 1. Chebyshev’s 
theorem is both weaker and easier to prove (see, e.g., Ingham [3], Hardy [2], or Ellison [1]). The 
purpose of this note 1s to give a direct and especially short proof of Chebyshev’s theorem. We will 
use three elementary results: 


(2) lognt = ) (4), 

k>1 . 
(3) logn! = nlogn + O(n), 
(4) 7 = logn + O(1). 


Let m be a fixed integer greater than one and v an arbitrary integer greater than m. From (2) 
we get the double inequality: 


n 
[n/m | l (=) log n! ; [11 /m | l v( 


H l 
— < < 7 + — 
©) 2 kon n a kon ae koR 
k k k 
If we define L,, L,A,, by: 
L,, = inf vy) , L = sup vO) , A, = sup v(x) 
x2m x x>m x l<x<m x 
it follows at once from (5) that: 
[n/m] [n/m] n 
] 1 1 
L,Y 7< 8 <Ln de Gt Am Lg: 
k=1 k n A=] c k=[n/m]+1 c 


From identities (4) and (5) we then get 
L, (log n — logm + O(1)) < logn + O(1) 
< L7 logn — logm + O(1)) +A,, Clog m + O(1)). 
Hence we have 
_ logn + O(1) 
” “logn — logm + O(1) 
and 
It > logn + O(1) —A,, (log m + O(1)) 
mes logn — logm + O(1) 


Since these inequalities are valid for every n greater than m and X,, 1s finite, we conclude that 


*Supported in part by a grant from Secretaria de Estado da Cultura, Lisbon, Portugal. 


1985] NOTES 495 


~ = int YOO gy and Le = sup YO) 54, 


as m approaches infinity we obtain (1). 
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PLANAR METRIC INEQUALITIES DERIVED FROM THE VANDERMONDE DETERMINANT 


GENG-ZHE CHANG 
Department of Mathematics, University of Science and Technology of China, Hefei, Anhui, 
The People's Republic of China 


This paper presents some new results that illustrate the power of complex numbers in planar 
geometry. 

We begin with some notation. 

Let N denote the set of the first » natural numbers. For k = 1,2,...,n —1, let S be a subset 
(51, 575...,5,) Of N such that 1 <5, <5, < +++ <5, <n. We call S a k-shuffle of N. The 
(n — k)-shuffle (4,,t5,... ) is called the complementary shuffle of S with respect to N and is 
denoted by S” if 


? tak 


(t1,t),...5¢,-JUS=N. 


After proving our main result, Theorem 1, we will give several geometric applications. 


THEOREM 1. [f 2,, Z5,...,2Z, are distinct complex numbers, then fork = 1,2,...,n — 1, we have 


k 
(2,2 ty ty, ) 


(1) ver 
U [] (2,-—2 ,) 


=] /=] 
where the sum is extended over all k-shuffles S = (5,,55,...,5,) and S” = (t,,t5,....t,-,). 
Proof. We use the Vandermonde determinant 


Z Z4 eee Z 


1 
(2) V(N) = . =[][(z,- 4). 
zi} zi} a zn t 
If the Laplace expansion (see, for example, [6], p. 298) is applied to the first k rows of V(N), 
then we obtain 


(3) VIN) = ECP a a VCS) V(S>), 


where the sum is extended over all ("") of the k-shuffles S, |S] =s, + 5, + +--+ +s,, and V(S) 
and V(S~) are the Vandermonde determinants determined by z, , and 2,.2),5-++42, 
respectively. 

Note that z,,z,,...,2Z, are distinct. When V( NV) is replaced by the expression in the right-hand 


Zope Zy 
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side of (2), and V(S) and V(S~ ) by similar expressions, we see clearly that 


(4) MN) _ gy TG, 2). 


V(S)VCS™ )— j=l l= / 


where 
A n-k 


o=sign|] [| (t,—s,). 
j=l l=] 


By an inversion of the sequence (/,, /5,...,/,,), a permutation of N, we mean a number pair 
(j,»Jy) with p < q but j, > j,. Hence o = (—1)":* ?, where 7(S, S~) designates the number of 
inversions of the sequence 


CS, S7) = (51,8250 Sli lasers tyg): 


In the sequence (S,S7 ), (s, — 1) inversions are produced by s,, (s, — 2) inversions are produced 
by 55,...,(s, — &) inversions by s, and no other inversion exists; hence we have 


T(S,S~) =|S|}-(1 +24 --+ +k) 
and (4) becomes 


V(N) 7 7 k n-k 


=(-1)"' (l+2+ TTT T(z, -4,). 


V(S)V(S~) j=l l=1 


Upon dividing both sides of (3) by V(N) and using (5), we find that the identity (1) follows. 
Theorems 2 and 3 show some of the geometric consequences of Theorem 1. 


(5) 


THEOREM 2. Let P,, P,,...,P, ben distinct points in the Euclidean plane and let P be some point 
in the same plane. Then for k = 1,2,...,n — 1, we have 
h 
([PP,|--: IPP, |) 


(6) » i n—-kh “ >i, 
> TY ETP, Pa 


j=l l= 


where the sum is extended over all k-shuffles S = (8,,55,...,5,)s 
between points P, and P,, and S” = (t),t),---5t,—;). 


; 


|P, P,,| denotes the distance 


Proof. Set a complex coordinate system on the plane with point P as the origin. Assume that 
P,, P,,..., P, have complex coordinates z,,z5,...,Z,, respectively. Upon taking the modulus of 
each term in the left-hand side of (1), we obtain 


(7) » k n-k >I, 


which is equivalent to (6) in the present case. 
Equality in (6) may occur, as will be shown in Theorem 3. 
REMARK. Putting k = 1 into (6) we have 

|PP ~ 

|P.P >| 7 


(8) 
r=1 j4r| 

Inequality (8) establishes the Euclidean conjecture of K. B. Stolarsky [10] in the case n = 3; for 

n > 4 this conjecture is still open. In fact, inequality (8) for 4 points was originally established by 
T. Hayashi [5S] in 1913. In 1975 M. S. Klamkin proved it via inversion and the use (see [8]) of polar 
moments of inertia; he later discussed Hayashi’s method and developed it further in [9]. Whether 
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a “one line proof” of Hayashi’s result can be obtained by rearranging the Cayley-Menge1 
determinant remains open: see the discussion in [14], p. 233. 


THEOREM 3. Let P,, P),...,P, denote n distinct points on a circle of unit radius. For each 
k-shuffle S = (81, 55,...,5,) with its complement S~ = (t,,to,...5¢,—;), define 
k n—-k 
(9) a(sy=|h LLP 
j=l i= 

Then 
(10) s- — >1 

5s a(S) 


where the sum is extended over all k-shuffles of N. Equality is attained if and only if the points form a 
regular n-gon. 


Proof. Inequality (10) follows directly from (6) if we put the point P at the center of the circle. 
It remains to consider unger what conditions equality holds in (10). 

Suppose that P,, P,.. form a regular n-gon. Without loss of generality we may assume 
that the complex coordinate CrP is z, =w”, m=1,2,...,n, where 


w = exp(i2a/n) = cos(2a/n) + isin(2a7/n). 
We are going to show that 


(11) Ul rH — 2,2,) 


or equivalently 


n—k 


(12) I} IT ¢ (1 — wi") 


is a positive number. 
Since we have 


Tl (1 — w® 2) LT] @ -— wi) = (1 -— w)(1 - we) ee = wt) = a, 


mF] 
multiplication of (12) and 


(13) TH Ta wr) 


J=1 m4; 
yields n*, a positive number. It is obvious that (13) is equal to 


—S s,—S5 2 
TP] GQa-wr)y= TP [d= wren) f > 0. 


l<em#j<k l<y<meh 
It follows that (11) 1s positive and so is each term in the left-hand side of (1). The positivity of 
each term yields equality in (7) and also in (10). 
Conversely, assume that equality holds in (10) and hence in (7). It is necessary that each term 
in the left-hand side of (1) is a positive real number. Set z, = exp(/@,), 7 = 1,2,...,. We must 
have for each k-shuffle S = (s,,55,...,5,) of N that 


n—-k 


(14) UI il (1 7 exp| i( 8, — @,,) |} 


is a real number. Since 
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1 — exp| i(4, — 0,,)| = —2sin|( 6, — 6,,) /2| exp| i (a + O, - 6,)/2|, 
(14) becomes 


n—k 


rT i sin( 0, — 0)/2). oy iG —k)a+ »(6, - 4) 2}. 


Since the above quantity 1s real, we have 


(15) k(n—k)a+¥(0, — 8.) = 2mm, 
yoé 


where m is an integer. Now 


n—-k 


k 
(n—k) 26 —k 6, 
/=] 


Jié j=l 


138, kD 


j=l /= 


MM 
—~ 
~_— 
| 
as) 
SS 
| 


so it follows from (15) that for any two k-shuffles S and S’ 


k k 
(16) 56 — > 60,=2ma/n. 
pal yd 
Since Z,,Z5,...,Z, are distinct, we conclude by (16) that z,,2z,,...,z,, or equivalently 
P,, P,,...,P, are vertices of a regular n-gon. This proves the result. 
In [15], a result due to H. S. Shapiro was stated without proof as “let P,, P;,..., P, denote n 


points on a circle of unit radius, and set d, equal to the product of the distance from P, to the 
other points. Then 


Hi | 
ua?) 


equality being attained if and only if the points form a regular n-gon.” Clearly Theorem 3 of the 
present paper generalizes Shapiro’s statement. 

For a numerical example take n = 5 and let a and d denote the side and the diagonal of the 
pentagon, respectively. 

For k = 1, by Theorem 3 and also by Shapiro’s statement, we have 


ad? = 5. 
For k = 2, by Theorem 3, we get 


5(a? + d*) =(ad)’. 


REMARK. There are many easy to state but open problems on inequalities among the distances 
determined by » points in d-dimensional Euclidean space. The work of J. B. Kelly on hypermetric 
spaces sheds some light on these problems; see [7] and the references on p. 31. In many of these 
problems (as in Theorem 3 of this paper) the points are constrained to lie on the boundary of a 
disc or surface of a sphere; see Alexander and Stolarsky [2], Stolarsky [11], [13], and Harmon [4]. 
Recently some remarkable progress has been made by J. Beck [3]. For an inequality involving 
points near the unit circle (and involving an expression reminiscent of (11)) see Alexander [1]. 
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COMBINATORIAL PROOKS OF PASCAL’S FORMULA 
FOR SUMS OF POWERS OF THE INTEGERS 


JEROME L. PAUL 
Department of Computer Science, University of Cincinnati, Cincinnati, OH 45221 


Pascal’s formula (cf. [1]) involving the sum of the kth powers of the n successive terms 


a,a+d,...,a + (mn -— 1)d in an arithmetic progression can be written: 
r-l on r . 

(1) S >» (p)ara +(j—-l1)d)° =(a+t+ nd)’ —a’. 
k=0 j=l 


The formula is proved by interchanging the order of summation, using the binomial theorem, and 
noting the resulting telescoping sum (see e.g. [3]). The following related formula can be established 
in a similar manner: 


r-~l opr ; . 
(2) y, » (-1) aa ia’ (a + jd)" =(a+nd)" —a’. 

k=0 7=1 
In [2] a combinatorial proof of (1) was given for the special case a = d= 1. This case is of 
particular interest, since the formula then involves 


I+ 2k +--+ +n* = S,(k). 


The proof consisted in enumerating the points in a hypercube of integer lattice points in 
r-dimensional space via a partition of the cube into r disjoint subsets. In this note a combinatorial 
proof of (2) will be given for the case a = 0, d = 1, a case which also involves the sum S,,(). The 
proof will again consist in enumerating the points in a hypercube of lattice points using 7 subsets, 
but here the subsets will not be disjoint, allowing a direct application of the inclusion-exclusion 
principle. Recall that the inclusion-exclusion principle is a generalization to 7 sets of the familiar 
formula 


|A U B|=|A}+|B\/-|A nr Bl, 


where | X| denotes the number of points in the set X. The principle is stated as formula (4), and 
can be established by a straightforward counting argument (see e.g. [4]). 

Geometric motivation for the particular choice of subsets is readily obtained in dimensions 
r = 2, 3, and the reader is urged to draw the appropriate pictures. A description of how these 
proofs can be extended to (1) and (2) for the case of arbitrary nonnegative integers a and d will 
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also be given. Throughout the following, let C’(m) denote the hypercube of integer lattice points 
in r-dimensional space having m points on a side, 


C’(m) = {(x,,...,x,) EZ’: O<x,<m-1,1<i<r}, 


t 


so that |C’(m)| = m", where Z denotes the integers. 
Setting a = 0,d = 1 into (2), replacing k by r — k, and using (") =(_ t )s this special case 
will be proved in the equivalent form 


(3) 3 (-1"'{pjar $k gee tn) nl 
k=1 


For a fixed 1,1 <i <r, let K, be the subset of C’(n) consisting of those points (x,,...,x,) for 
which x, > x,,1 <j <r, The inclusion-exclusion principle applied to the sets K, yields 


’ | k+1 
(4) U K, = » (-1)|K,,.9-- OK, |, 
1=1 l<ip< ++ <a,<r 
where the summation is over all k-combinations of integers in {1,...,r},k =1,...,r. Note that 
for any choice of k indices1 <i, < +--+ <i, <7, 
n 
_ rah 
IK AK = i, 
j=l 
since if x, = ++: =x, =j—1, then the remaining r— k coordinates can have any of the / 
values between 0 and / — 1. Since there are "| ways to choose the k indices1 <i, < ++: <i, 
< r, and 
UK, =|c'(a)l= 0", 
i=] 


note that (3) follows from (4). 

The proof above is easily generalized to establish (2) in the case of arbitrary nonnegative 
integers a and d. For a = 0,d > 0, the generalization amounts to an “expansion” of each point 
of C’(n) into an equivalence class of points in C’(nd), and when a > 0,C’(1 + n) 1s “expanded” 
to C’(a + nd). To see this, let @ be a step function defined on the nonnegative integers by 


—l, O<x<a-—l, 
J; atjd<x<at+(j+1)d-1,j;>0. 


ols) =| 


—a 
d 
fixed 7,1 <i<_r,let K, be the subset of C’(a + nd) consisting of those points (x,,...,x,) for 
which $(x,) > $(x,), 1 <j <r. Now for any choice of k indices1 <1) < +++ <& <P, it 18 
easily verified in a manner analogous to that above that 


Equivalently, ¢(x) = max {| * | -1}, where [y] denotes the greatest integer < y. For a 


r ” . r-hA 
IK, A AK, |=a’+ di d*(a+ jd)". 
j=l 


An application of (4) to the sets K, then yields 


a’ + ¥ d'(a+ jd)" "| =(a+nd)’, 


j=l 


(5) rene(Z) 


which is equivalent to (2). (Note that ear (- 1") = 1). 
With the aid of the function $, the proof of (1) given in [2] when a = d = 1 can be generalized 
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in a similar manner to establish (1) for arbitrary nonnegative integers a and d. The details are left 
to the reader. 
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A NOTE ON COMPLEX ITERATION 


I. N. BAKER 
Department of Mathematics, Imperial College, London, England 


P. J. RIPPON 
Faculty of Mathematics, The Open University, Milton Keynes, England 


The aim of this note is to describe how a recent result about the sequence 


(1) a,at,a’,..., ae, 


might be included in a course on complex analysis (one that 1s based for instance on [1]). The 
study of the sequence (1) goes back to Euler [5] who showed that, for a > 0, it is convergent if and 
only if 

e’<a<e’. 
To discuss the general case it is convenient to let c be some fixed determination of log a and to 
put 


(2) f(z)=e%, 2 EC, 
so that, if f"*! = feof", n=1,2,..., with f’ =f, then 
(3) fr*(0), an = 1,2)... 
is a well-defined version of (1). 
It was shown by Carlsson [4] that if f"(0) ~ w as n > oo and f"(0) # w, n = 1,2,..., then c 


must lie in the closure of a certain cardioid 
D = {te ':|t}< 1}, 


which is illustrated in Fig. 1. 
To prove Carlsson’s result, note that lim, _,,,f"(0) = w imphes that w = f(w) = e“". Thus if 
t = cw, then w = e’ and so c = te '. To prove that |t| < 1 we use the fact that f’(w) = ce®” = ¢. 
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Fic. 1 
Since f"(0) # w, n= 1,2,..., we have 
ff (O)=w\_ yf £CF"O)) = FOP) ) yy 
(4) sim | f"(0) — W | 7 sim | f"(0) —_ WwW ~f (w) ~ f 


This implies that |¢| < 1, as required. Also, if ¢ is not real, then, according to (4), the convergence 
of f"(0) to w is eventually spiral-like. Some illustrations of the kinds of convergence which occur 
are given in [8]. 

We shall now give a sufficient condition for the convergence of (3). Previous results in this 
direction are surveyed in [7]. 


THEOREM. The sequence (3) is convergent whenever c lies in D. 


This result was proved in [2] using the theory of iteration due to Fatou and Julia (see, for 
instance, [6]). We prove it below using only those parts of this general theory which seem to be 
essential. Note that the case c = 0 can be disposed of immediately. 

If c = te’ with O < |[t| < 1, then w = e’ is a fixed point of f with f’(w) = t. Thus 


lim [A=] = lim pte L) =f'(w) =¢, 


ZW Zw — W 


2-0 W 
and since |¢| < 1 there exist r > 0 and A < 1 such that 


f(z) = 


Z—_ WwW 


<A, O<|z—wi<r. 


We deduce that f"(z) — w uniformly as n > oo for |z — wi <r. 

Now consider the set 2 of those complex numbers in some neighbourhood of which the 
sequence f” — w uniformly as n — oo. To prove the theorem we show that 0 € Q). It is clear that 
Q is open, that w € Q and that 2 is completely invariant under f, by which we mean that 


zEQe f(z) EQ. 


Also, by the Heine-Borel theorem, the sequence f” — w as n > oo uniformly. on any compact 
subset of Q. It follows from this that each component of 22 is simply connected. Indeed if [T° is 
any Jordan curve in Q, then f” — w — 0 uniformly on [T° and so f” — w > 0 uniformly in the 
interior of [’, by the maximum principle. This implies that Q has no “holes’’. 

Let Q,, be the component of 2 which contains w and suppose that 0 ¢ Q,,. We shall show that 
this leads to a contradiction. Since Q,, is simply connected, we can choose in Q, [1, p. 143] a 
single-valued analytic branch g of 


f-*(z) = (log z) /e, 
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FIG. 2 


such that g(w) = w. The complete invariance of Q and the continuity of g then imply that 
g(Q2,) CQ, and we deduce, from Schwarz’s lemma, that 
Ig’(w)| <1. 


Thus | f’(w)| = }g’(w)}>* > 1 and this contradiction completes the proof. 
The form of Schwarz’s lemma used here is as follows. 
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LEMMA. Let g be an analytic function which maps a simply connected domain {,, # C into itself 
with fixed point w & (2. Then 


Ig’Cw) | < 1. 


This can be proved by using the Riemann mapping theorem [1, p. 222] to choose a conformal 
mapping ¢ from {2,, onto the unit disc U, such that ¢(w) = 0. The result follows by applying the 
usual form of Schwarz’s lemma to ¢° g° @~', which maps U to U and fixes 0. 

If a more elementary argument is required, then we can choose, as in the first stage of the proof 
of the Riemann mapping theorem [1, p. 222], an analytic function @ from Q, into U such that 
¢’(w) # 0. The sequence of functions ¢, = ¢° 2", n =1,2,..., 1s then uniformly bounded in Q,, 
and so, by [1, p. 122], the sequence ¢/, is uniformly bounded in some neighbourhood of w. But 


Hn 


¢,(w) = ¥(w)(g'(w))”, 
since g(w) = w and, in view of the fact that ¢’(w) # 0, we deduce that |g’(w)| < 1. 


REMARKS. In [2] we also describe the behaviour of the sequence (3) for most boundary points 
of D. It is convergent if c = te~', where ¢ is an nth root of unity, but divergent almost 
everywhere on 0D. The behaviour of (3) for values of c outside D is described in a forthcoming 
paper [3]. Briefly, in the c-plane there are infinitely many disjoint domains, in each of which the 
sequence (3) forms a number of convergent subsequences. In those domains illustrated in Fig. 2 
the label D, indicates that, for the corresponding numbers c, the sequence (3) has k convergent 
subsequences { f"**/(0)}°_,, 7 = 1,2,...,k, each with a distinct limit. 

The domain D, is the cardioid D discussed above. Each of the domains D,, k > 2, is simply 
connected and unbounded, there is a single D,, but for k > 3 there appear to be infinitely many 
D,. The relative positions of these domains are described in more detail in [3], but there remain a 
number of open questions. For example, we do not know whether the union of all these domains 
D, is dense in C. 
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COMPACTNESS AND CLOSEDNESS IN LOCALLY COMPACT HAUSDORFF SPACES 


WARREN PAGE 
Department of Mathematics, New York City Technical College, Brooklyn, NY 11201 


The purpose of this note is to illuminate a reasoning error in point set topology that is 
seductively easy for experts as well as students to make—namely, the belief that 


Tf Y is a locally compact subset of a locally compact Hausdorff space X, then Y (\ K is 
compact for every compact set K C X. 


(A simple counterexample is the open segment (0,1), which is a locally compact subset of the real 
line, although its intersection with the compact interval [0,1] is not compact.) Since this 
unjustified assumption has, in fact, led to circular reasoning in a number of sources in proving the 
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Pontryagin Duality Theorem (e.g., [1], [3]), it should be instructive to illustrate a few contexts in 
which the above conclusion is and is not warranted. We hope that this note, by calling attention to 
these deficient proofs, will also be of value to all who continue to use and refer to Naimark’s and 
Rudin’s classical texts. 

It will be useful to recall the following characterization of closedness in locally compact 
Hausdorff (LCT, ) spaces. 


THEOREM 1. A subset Y of an LCT, space X is closed in X if and only if Y QO K is compact for 
every compact subset K C X. 


Proof. If x» € Y —Y and u is a relatively compact neighborhood of x, € X, then 
x E(¥Nu)-(YN@). 


Thus, Y \ w 1s not closed and therefore not compact in X. Assume, conversely, that Y is closed in 
X and let K be compact in X. If 


YOKCU(YNO,) 
A 


for X-open sets {O,:a © A}, then 
YOKCU(KOO,). 
A 


Since YM K is compact in K, we have 


YOKCU(KNO,) 
\ 


for some finite subset B C A. In particular, 


YOKCU(YNO,) 
B 


and Y  K is seen to be compact in X¥.O 


Now consider the Pontryagin Duality Theorem for locally compact Hausdorff abelian (LCT, A) 
groups. If X is the second dual group (i.e., the dual group of G’) associated with an LCT, A group 
G, the natural embedding 6: G— X with {86(g)}(x) = x(g) for every x © G’ is an into 
topological group isomorphism. (See, for instance, [1], [2], [3].) What remains is to establish that 
the LCT,A subgroup Y = 6(G) is all of the LCT,A group X. One approach is to first prove that 
Y is closed in X and then show (independently of this) that Y is dense in X. It is in this first step 
where several noted texts go wrong. 


EXAMPLE 1 ((3], p. 29). Suppose x, © Y and let u be an X-relatively compact neighborhood of 
X,. Since Y is LCT,, the set Y \ u is compact in X. Therefore, Y M wu is closed in X, and 


XE YNuU=YNUCY. 
Thus, Y is closed in X. But this italicized assertion is unjustified, and so this proof is untenable. 


EXAMPLE 2 ((1], p. 418). Let Y, and xX, be the one-point compactifications of Y and X, 
respectively. Then, Y,, is also a topological subspace of X,,. Since Y,, is closed in X,,, it follows 
that Y is closed in X. But, although every Y-compact set is X-compact, Y  K cannot be assumed 
compact in Y for an X-compact set K. Thus, although the topology of Y,, is weaker than the 
X,,-induced topology, the reverse inclusion has not been established. The italicized assertion is 
therefore unjustified. 


It is interesting to note how the above deficiencies can be overcome, and the proofs repaired, 
by reordering the final two steps in these erroneous proofs. Implicit in this approach (or any other 
proof of the Pontryagin Duality Theorem) is the realization that topological arguments alone, 
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without using the group’s structure, are not sufficient for proving Y closed in X. The proof that 
Y = 6(G) is closed in X is simple once it has first been established that Y is dense in X. 


THEOREM 2. If Y is a dense subgroup of a T, group X and if Y is LC in the topology induced by 
X, then Y = X. 


Proof. For any y © Y, there is an X-neighborhood v(y)) such that the Y-closure 
Z (= YOY v) of YN v 1s compact in Y, hence compact in X, and therefore closed in YX. 
Now suppose p & vu, and let u be a neighborhood of p. Then uO v is a nonempty neighborhood 
which, because Y is dense in XY, must intersect Y. Thus, uw intersects Y Nv C Z. Since Z is closed 
in X, it follows that p © Z. Thus, v C Z and hence uv C Y. Finally, if xe X= Y, the 
neighborhood vx~' contains some y € Y. Therefore, x € y ‘v. Since Y is a group and v C Y, it 
follows that x © Y.O 


Note (by way of contrast with Example 1) that when Y =X, we are assured that YO w is 
compact. This, of course, can also be proved without requiring that Y be dense in YX. Indeed, 


THEOREM 4. An LCT, subgroup Y of an LCT, group X has the following equivalent properties: 
(a) Y is closed in X, 

(i) Y O K is Y-compact for every compact set K C X, 

(11) Y,, is a topological subspace of X,,. 


Proof. An LCT, subgroup Y is complete and therefore closed in X. Thus, (1) and (Theorem 1) 
its equivalent (11) hold. Moreover, (i1) = (iii) = (4) based on the type of arguments used in 
Example 2. 


The author wishes to thank George Bachman for calling his attention to the incorrect proofs in the literature. 
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etc. 
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A publishable solution must, above all, be correct. Given correctness, elegance and conciseness are preferred. 
The answer to the problem should appear right at the beginning. If your method yields a more general result, so 
much the better. If you discover that a MONTHLY problem has already been solved in the literature, you should of 
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ELEMENTARY PROBLEMS 


Solutions of these Elementary Problems should be mailed in duplicate to Professor D. H. Mugler, Department 
of Mathematics, University of Santa Clara, Santa Clara, CA 95053, by January 31, 1986. Please place the 
solver’s name and mailing address on each (double-spaced) sheet. Include a self-addressed card or label (for 
acknowledgement). 


Correction to E 3090 (May 1985 issue, p. 359). The first part of the problem should read: 


Let f:[0, +00) — (0, 00) be a continuous function. Let h > 0 be given and let g:[0, +00) > R 


be defined by 
_ (_ fi) _ 
Wx) =| Oe i} 


(The remainder of the problem 1s as stated.) 


E 3099. Proposed by Weixuan Li and Edward T. H. Wang, Wilfrid Laurier University, 
Canada. 


Let a, <a, < --: <a, be n nonnegative reals (nm > 2) such that 


n 
» 4,4,41) = 1 (ay,41 — a;). 
i=l 


Determine the minimum value of 17_,4,. 

E 3100. Proposed by C. J. Smyth, James Cook University, Townsville, Australia. 

Given a rational number p/q, show that there are conjugate algebraic numbers, a, a’, with 
ja] # 0,1 and Jal?”“? = |a’|. 

E 3101. Proposed by Charles Vanden Eynden, Illinois State University. 

Let d(n) be the number of positive divisors of n > 0. Show that the number of solutions x to 
d(nx)=nislif n=4, tl if n is the product of ¢ distinct primes (¢ > 0), and infinite otherwise. 

E 3102. Proposed by J. P. Lambert, University of Alaska. 


Prove or disprove: For all integers m,n > 1, 


» (27 - 1)" 
j=l 


Im 2n+1 
a Tmt) 
\2n m 
d (2) 
j=l 
E 3103. Proposed by Douglas B. Tyler, Fullerton, California. 
Show that 
§(2k) =Inz7—1, 


gay k(2k +1) -27* 
where ¢ is the Riemann zeta function. 
E 3104. Proposed by Jay Hook, Florida International University. 
Let A be a closed and bounded subset of R of Lebesgue measure 1. 


(a) Show that A contains two points an integer distance apart. 
(b) Show that (a) becomes false if ““and bounded” is omitted in the definition of A. 
(c)* Does (a) remain true if R is replaced by R”? 
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SOLUTIONS OF ELEMENTARY PROBLEMS 
Ellipses from Circles 


FE 2914 [1981, 763]. Proposed by R. C. Lyness, Southwold, England. 


A circle B lies wholly in the interior of a circle A. S is the set of all circles each of which 
touches B externally and A internally. 

(i) Find the locus of the internal center of similitude of the pairs of circles from S. (ii) Prove 
that every point of the locus, except one, is the i.c.s. of exactly one pair of circles from S. 


Solution by O. P. Lossers, Eindhoven University of Technology, Eindhoven, The Netherlands. 

(i) Let R and r stand for the radii of A (center M) and B (center NV), respectively, and let a, 
and o, be members of the family S. The center and the radius of o, will be denoted by K, and p, 
and the point of contact of o, with A and B by C, and D_ respectively (i = 1,2). Then we have: 


MK, =R-p,, NK,=r+o,. 


Hence K,M + K.N = R +r. The locus of the centers of the circles of S is therefore the ellipse e 
with M and WN as its real foci and R + r as the length of its major axis. The internal center of 
similitude J of o, and o, lies therefore in the interior of e and it may be conjectured that the 
locus of the internal centers of similitude of all pairs of S is in fact the whole interior of e. To 
prove this we observe that C, is the external and D, the internal point of similitude of o, with A 
and B, respectively. Therefore C, and C, as well as D, and D, are collinear with the external 
point of similitude FE of o, and o,. It is well known that o, and o, are interchanged under an 
inversion with center E and a real circle of inversion. If m is the radius of this circle, we have 
EC, X EC, = ED, X ED, = m?. This means that FE has equal powers with respect to 4 and B. 
The point EF is therefore on the radical axis / of A and B. Since J and EF are harmonically 
separated by K, and K,, we infer that FE 1s on the polar line p of J with respect to e. 

(1) Starting from any point J in the interior of e, we can construct its polar line p with respect 
to e. Its intersection FE with / gives us the external point of similitude of two circles of S having 
their centers on JE (note that E may be at infinity), provided that p does not coincide with |, that 
is: provided that I does not coincide with the pole L of | with respect to e. L is the exceptional point 
mentioned in the problem. Any line through L intersects e in two points which are centers of 
circles belonging to S. 


Also solved by J. Dou (Spain), L. Kuipers (Switzerland), W. A. Newcomb, A. Niyenhuis, and the proposer. 
A Triangle Inequality That Didn’t Make It 


FE 2924* [1982, 63]. Proposed by Jack Garfunkel, Flushing, NY. 


Triangle A,A,A, is inscribed in a circle; the medians through A,[A,] meet the circle again at 
M,[M,]. The angle bisectors through A,[A,] meet the circle again at T,[T,]. Prove or disprove: 
JA; M, — A, My) < |A,T, — Azth|. 


Solution by O. G. Ruehr, Michigan Technological University. The conjecture is false, as is shown 
by the following counterexample. Take the circle to be the unit circle centered at the origin and let 
the triangle be the inscribed 30° — 60° right triangle with vertices 


A, =(-1,0), 4, = (53). A, = (1,0). 


We find routinely that 


7, = (5 3) T, = (0, —-1), and M, = { - 
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Then 4,T, = A,T, = V2 + y3 and A.M, = 2. Clearly the segment A, M, has length < 2 since 
it is not a diameter. Thus 


|A,M, — A,M,| > 0 = |A,T, — A,T)}. 
Also solved by L. Bankoff, W. A. Gilbert, H. J. Ludwig, and J. W. Reed. 
Geometric Closure and the Convex Hull 


E 2970 [1982, 698]. Proposed by J. H. Conway, University of Cambridge, and B. Reznick, 
University of Illinois. 


Find all finite sets of points in the plane (P = { p,,...,p,}) with the following “closure” 
property: If the line segments p, p, and p, p, intersect in a single point p, then p € P. 


Solution by B. Richter (student), University of Waterloo, and H. Shank, Waterloo, Ontario, 
Canada. It is easiest to characterize a finite set S = { p,,..., p, } having the closure property in 
terms of the number m of vertices of its convex hull, c(S). 


CHARACTERIZATION. We must have m < 4 and 

(1) if m = 1, then n = 1; 

(2) if m = 2, then the points p, are all collinear; 

(3) if m = 3, then either (2) n = 6 and we have the situation illustrated in Fig. 1; or (ii) there is 
a line L through a vertex of c(S) such that all the non-vertices of S lie on L; or 

(4) if m = 4, then n > 5; a fifth point is the intersection of the diagonals of c(S) and all other 
points of S lie on the same diagonal. | 


Fic. 1 


The proof of this characterization revolves around the following two observations. The first is 
immediate. 


INHERITANCE LAw. Let S be a subset of the plane with the closure property and let T be a 
subset of S. Then Sf\c(T) has the closure property. 


THEOREM. Let S be a finite subset of the plane. Suppose S has the closure property and suppose, 
further, that c(S) has four vertices. Let p € S be on the border of c(S'). Then p is a vertex of c(S). 


Proof. Let v,, v5, v3 and v, be the four vertices of c(.S). Let them be labelled so that p is on 
the side v,v,. Let g be the intersection of the diagonals. (See Fig. 2.) Let Q, be c(S). The 
segment v,p meets v,g in some point, say g,, of S. Note that g, 1s in the interior of QO). Now 
consider the quadrilateral Q, determined by the points p, v,, v, and q,. The point g lies on the 
segment g,v,. Thus we can repeat the argument to find a point q, in S that is also in the interior 
of Q,. Inductively, we can find quadrilaterals Q,, and points g, in S such that gq, 1s a vertex of 
Q,, and g,,, 18 in the interior of Q,,. But this implies that S is not finite. Q.E.D. 


COROLLARY A. If m = 4, then any member of S that is not a vertex lies on a diagonal. 


Proof. (See Fig. 3.) The quadrilateral v, pv,v, and the inheritance law would violate the 
theorem. Q.E.D. 
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FIG. 2 FIG, 3 


COROLLARY B. Jf m = 4, then any two members of S that are not vertices lie on a common 
diagonal. 


Proof. (See Fig. 4.) Consider the quadrilateral p, p,v,v,. The diagonals meet at, say, g,. Now 
414g meets p, p, at, say, p,. Now p, p,v,v, violates the theorem. Q.E.D. 


Corollary B is (4) of the characterization. 


a) U> 


bv, 
Fic. 4 Fic, 5 
COROLLARY C. Jf m = 3, then any two points of S that are not vertices of c(S) lie on a line 
through a vertex. 


Proof. (See Fig. 5.) Let g be the intersection of the diagonals p,v, and p.,v,. Let g, be the 
intersection of v,g and p, p,. Then p, pjv2v; violates the theorem. Q.E.D. 


From Corollary C, one can easily deduce (3). 


COROLLARY D. Jf S is a finite set with the closure property, then c(S) has at most four vertices. 


FIG. 6 


Proof. (See Fig. 6.) Let v,,...,v5 be vertices of c(S). Let p, be the intersection of vv, with 
v,v,. Observe that the quadrilateral v,v,v,v, violates the theorem. Q.E.D. 


The cases (1) and (2) of the characterization are trivial. It is also straightforward to verify that 
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any set of points satisfying any one of (1) through (4) is indeed a finite subset of the plane, with 
the closure property. 


Also solved by W. Janous (Austria), O. P. Lossers (The Netherlands), J. G. Mauldon, V. Pambuccian (Romania), 
R. Patenaude, P. J. Ryan (Canada), A. J. Schwenk, E. Uerhoeff (The Netherlands), and the proposer. 


An Essentially Circular ODE 


E 2971 [1982, 698]. Proposed by Barry L. Zaslove, Northeastern University. 
Find the solution curves of the ODE, 


y” = y'(3y 


412 


_ y my y’) 
Solution by O. P. Lossers, Eindhoven University of Technology, The Netherlands. The differen- 
tial equation may be written as: 
y “(1 4 y”) _ 3y’y’”? 
(1 4 pry? 


where the left-hand member is obviously the derivative of 


yt 


y 
(1+ y"y” 


1.e., of the curvature K of the solution curves. Hence K is constant and the solution curves are the 
straight lines and the circles of the plane, with the exception of the lines x = constant. 


= 0, 


Also solved by 51 other readers and the proposer. 


A Problem Due to Steiner 


E 2992 [1983, 286]. Proposed by Jordi Dou, after a suggestion of the late M. S. Knebelman. 


Find the shape of a contour of length L that encloses the largest possible area and is 
constrained to pass through three given points. 


Comment by M. S. Klamkin, University of Alberta, Canada. This problem is due to Steiner (see 
J. Steiner, Gesammelte Werke, II, pp. 75-91) and has also been treated in my Olympiad Corner 
(Crux Mathematicorum, 6 (1980) 280-282). 


ADVANCED PROBLEMS 
Solutions of these Advanced Problems should be mailed in duplicate to Professor D. H. Mugler, Department 


of Mathematics, University of Santa Clara, Santa Clara, CA 95053, by January 31, 1986. The solver’s full 
post-office address should be on each sheet. 


6499. Proposed by Doug Hensley, Texas A & M University. 


Suppose X,, X,,... are independent random variables all uniformly distributed on [0,1]. Show 
that 


n> co 


lim Pro vem < 7 = 6, 
k=1 


where y is Euler’s constant. 
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6500. Proposed by Eugene M. Luks, University of Oregon, and Michael B. Ward, Bucknell 
University. 


In the first printing of an algebra text, the following problem appeared: 


Let G be a group, and H a normal subgroup of G. Prove: If every element of G/H has a 
square root, and every element of H has a square root, then every element of G has a square root. 


The statement is true if G is abelian or finite. Is it true if G/H and H are abelian? Is it true if 
G is torsion? Is it true in general? 


Do you have a problem? Submit it to the Advanced Problems Section. This section is running 
short of challenging and attractive problems. Advanced problems should have reasonably wide 
appeal and have non-tedious solutions. They may involve results unfamiliar to senior under- 
graduate students, but problems that are too technical or too specialized or whose solutions 
require arcane, little-known results are not in general suitable. 


SOLUTIONS OF ADVANCED PROBLEMS 


A Metric Space Having a Given Group of Isometries 


6398 [1982, 503; 1984, 146]. Proposed by Edgar Feldman and Michael Vulis, CUNY Graduate 
Center, New York. 


Is it true that for any group G of cardinality less than or equal to that of the continuum, one 
can construct a metric space M such that G is its full group of isometries? *What if |G] has 
greater power? 


Solution by Randall Dougherty (graduate student), University of California, Berkeley. Such a 
metric space M exists no matter how large G is. 

Given G, choose a one-to-one function F mapping G into some limit ordinal A. Let z be any 
object which is not in G. Choose (G U {z}) XA as the underlying set for the metric space M. 
Define the distance function d for M as follows: if a # B, let d((z, a),(z,B)) =1.1; if aE G 
and a < B, let d((a,a),(z,B)) = 1.2; if ae G and a> B, let d((a, a),(z, B)) = 1.3; if ae G 
and a # B, let d((a,a),(a,B)) =1.4; if a,beE G, a#b, and F(a 'b)=<a, let d((a, a), 
(b,a + 1)) = 1.5; if d(p, q) has not yet been determined, let d( p,q) = 1.6. 

First note that the set {(z,a): a © A} is distinguishable in M (using the 1.1 distances), and 
hence fixed under all isometries of M. Now, a point in M has the form (a,0) with a € G if and 
only if it is at distance 1.2 from all points (z, a); the point (z,0) is the only point in M at distance 
1.3 from all points in © other than those of the form (z, a) or (a, 0); the points (a,1), a © G, are 
just those points at distance 1.3 from (z, 0) but at distance 1.2 from all other points (z, a); and so 
on. In general, for any fixed B < A, the points (a, B), a © G, are just those points at distance 1.3 
from all (z, a) with a < B, but at distance 1.2 from all other points (z, a); and (z, 8) is the unique 
point not in {(z, a): a < B} which is at distance 1.3 from all points of the form (a, a) with a € G 
and a > B. Therefore, by transfinite induction, for each B < AX, the set {(a, 8B): a © G} and the 
point (z, 8) are fixed under all isometries of MM. 

Using the 1.4 distances, we now see that for any isometry i of M, there is a permutation 7 of G 
such that i((a, a)) = (aa, a) and i((z, «)) = (z, a) for all a € G and a < X. Let g = ze, where e 
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is the identity element of G; then ze = ge. Given any x € G other than e, let a = F(x). Then 
d((e,a),(x,a + 1))=1.5, so d((me, a),(7x,a + 1))=1.5, so F((we) '7x) = a; since F is 
one-to-one, x = (ze) ‘wx = g ‘ax, so mx = gx. Conversely, given any g € G, it is easy to see 
that the permutation 7 defined by 7x = gx induces an isometry of M. Therefore, M has G as its 
full group of isometries. 


Also solved by M. J. Pelling (England). A solution by the proposers to the first part of the problem (i.e., with G 


having cardinality less than or equal to that of the continuum) was published in this MONTHLY, 91 (1984) 146, 
#6398. 


Two Series 


6450 [1984, 59-60]. Proposed by J. O. Shallit, University of California, Berkeley. 
(a) Show that 

k-—nlk/n} _ log n, 
ko, k(kK +1) 


where n is a positive integer. 
(b) Let S(1, k) denote the sum of the digits of & when expressed in base n. Evaluate 


y S(n,k) 


way K(K +1) 
Composite solution. 
(a) ‘~ k—n[k/n] _ se 1 _— ey j 
k=1 k(k + 1) k=1 k+1 j=l k=jn k(k + 1) 
nN—\ N-1 
1 1 1 
SE 
fa, kK +1 VAM (it dn 
‘~ 1 N-1 
= ——. > logn asN-> oo. 
kel k+1 ja J + ] 
(b) If we write 
k= Yen, 0<¢<n, 
i=0 
then 
[An/] = Leen’, 
i=] 
so that 


{| 
Ms 
O 
M- 

se 

J 

| 
M 
NS 
= 
3 

| 
— 

we” 


Y [kn 


1 
bk —_ S(n,k)). 
It follows that 


S(n,k) =k —(n—1) E [kn] 
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=(n-1) y n/(k—n/[kn~]), 


j=l 
and consequently, by (a), that 


— S(n,k) = —~ k—n’/[kn’] 
2 ~Dhe anes k(k +1) 


=(n-1) y n~/ log n/ 


j=l 


= ” jog n 
n—-1 028” 


Solved by the proposer and forty-four others. 


A Function Satisfying Certain Conditions 


6451 [1984, 144]. Proposed by C.L. Mallows, Bell Laboratories, Murray Hill, NJ. 


Construct a function f on [0,0o) that satisfies f(0)=0, f) =1, f(x) =f — x) for 
O< x < 2, f(x) = 2f(2x) for x > 0, and f(x) f 0 as x > oo. 


Solution by Thager Bang, University of Copenhagen, Copenhagen, Denmark. We shall prove 
that there exists a unique continuous function f on (0, 00) satisfying f(1) = 1, f(x) = f(2 — x) 
for 0 < x < 2, and f’(x) =2/f(2x) for x > 0. The other conditions are redundant, for if f 
satisfies the above conditions, then by induction, _ 


f(x) =(-1)"f(2"*! — x) forn =0,1,2,...,0< x < 277}, 


and consequently the range of |/f(x)| on [m,m + 1] is independent of the integer m; it follows 
that f(x) ~ 0 as x > oo and, by continuity, that f(0) = f(2) = 0. In addition, such a function 
satisfies 


f(x) = fC s(d) at forx > 0 
0 
and, since 


2f(2x) = 2f(2— 2x) = f'(x) =f" — x), 


it also satisfies f(x) + f —x)=1for0<x<l1. 
Now consider a function g such that 


(1) g iscontinuous on [0,1] and ¢(0) = 0, 

(2) g(x) +e(1-—x)=1 for0<x <1. 

This function can be uniquely extended to a function on [0, 00) satisfying 

(3) g(x) =(-1)"2(2"*'-— x) forO0<x<2""', n=0,1,2,.... 


We then also have 2" (2) dt = 0 for n = 0,1,2,.... 

Denote by S the space of functions g on [0, oo) satisfying (1), (2) and (3). Then S is a closed 
subset of the Banach space of bounded continuous functions on [0, oo) with the uniform norm. 
Given any g € S, define 


g*(x) = [sl t)dt forx >0. 
0 


Then ¢*(0) = 0 and, forO < x <1, 
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g*(x) + gl — x) =2f g(r) dr = 1. 
0 
Also, for0 < x < 2”t!,n =0,1,2,..., 
2x n [2"*2-2x n n 
(4) g(x) = fo°g(t) de =(-1)" [7 g(2) dt = (-1)"g*(2"*? = x). 
0 0 
Hence g* € S. Moreover, since g*(0) = 0 = g(0) and g*(1/2) = 1/2 = g2(1/2), it follows from 


(4) that, for n = 0,1,2,..., ¢*(n/2) = g(n/2) = c,, where c, is independent of g. 
Suppose now that g,, g, © S. Then, for x > 0, 


2X 
git(x) — g(x) = f(a) - g(a) at, 
where n is the integer nearest to 2x. Hence, for the uniform norm, 


1 
ler — gill< Zl181 — Ball 


and so the *-operation is a contraction. It follows that there is a unique f € S such that f* = f 
and this f has the required properties. 


9 


Also solved by Patrick J. Fitzsimmons, Tomas P. Schonbek, V. Daniel Zurkowski, and the proposer. 


A Sequence of Random Variables 


6452 [1984, 144]. Proposed by Giovanni Emmanuele and Alfonso Villani, Catania, Italy. 


Construct a sequence of nonnegative random variables {€,}n.>1 such that sup, .,E&? < oo for 
all p > 0, and P(sup, 31h, < 00) = 0 whenever 


Len, <ng< c+ <n<.... 


Solution by Lajos Takacs, Case Western Reserve University, Cleveland, Ohio. Let 
£,&5,...5&,,... be a Sequence of mutually independent random variables for which 
P{é,<x}=l-e™* ifx>0. 
Then 


E(éP} = fo x?e* dx =T(p +1) < & 
0 
for all p > 0. In this case 


Pi max £, < x} =(1-—e*)* 


l<j<k 7 


for x > Oand k = 1,2,... . If we let k — oo, then by the continuity theorem for probabilities we 
obtain that 


P{ supé,, < x| =0 forall x > 0. 


j>l 


If we let x — oo, then again by the continuity theorem for probabilities we obtain that 


P/ sup §,,_ < co} = 0. 


jel 


Also solved by A. N. Al-Hussaini & J. R. McGregor (Canada), L. E. Clarke (England), Aleyandro deAcosta, 
Victor Hernandez (Spain), Ellen Hertz, Ignacy Icchak Kotlarski, O. P. Lossers (Netherlands) and the proposers. 
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An Exponential Identity 
6453 [1984, 144]. Proposed by Richard Askey, University of Wisconsin, Madison. 


Show that, for any real x, 


Ax? 
(n+r/) 


Composite solution. For a > 0, A > 0 we have 


lim I] 


A>oo n=0 


1+ 


0 dt — 1 1 00 dt 
i l+a < l+a < l+a +f l+a 
0 (t+A) n=0 (n +A) x 0 (r+A) 
and hence 
lim y a =] 
A 00 n=0 (n + d) “ 
It follows that if z 1s any complex number and e > 0, then 
= eXz — eXz ETA EZ? 
» log =| ~ l+e + of | 
n=0 (n+ A) n=0 (n+Ad) n=0 (n+A) 


—>z asAX->- oo 


1+ 


and consequently 


lim I] 


A> co n=0 


1 + xe = 2” 
(n+r)'** , 


This is more general than the identity in the problem. 
Solved by the proposer and fifty others. 


Evaluation of an Integral 


6454 [1984, 205]. Proposed by the Chico Problem Group, California State University, Chico. 


Evaluate the integral 


I(p) = [°° Otan?(8) 48, 2<p<l, 


in terms of the digamma function. 


Solution by O. P. Lossers, Eindhoven University of Technology, Eindhoven, Netherlands. The 
substitution tan# = x yields 


I(p)= i. tan” x dx. 


0 14+ x? 
Let 


oe) xP _ 
J(a) -| , 2 tan "(ax) dx. 


Then, for —2 < p < 0, 


oo xP *! dx 1 | 0 Pt! 0g? xP tt | 
J'(a) = ae SS —— dx — ———— dx}. 
(4) J, (1+ x*)(l+a*x*)  2(1- a7’) J, J, 
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Substituting x = Vt and x = yt /a, respectively, in the last two integrals yields, for —2 < p < 0, 
l—-—a?’ ;o ze/ —P 
I'(a) = a W a 1 


dt = — FF 
2(1 _ a’) o lt+t 2 sin( p7/2) 1 — qa? 
(see formula 3.241.2 in [1]). Hence, for —2 < p < 0, 


1¢ ?P—] 
I( p) = J(1) = ——— fo =a 
(p) (1) Yin mp) 4 1— 7? 
~(p+l)/2 _ 4-1/2 
_ , vi [+ u du 
4sin( ap/2) Yo l-—u 


= Gases (3) --9")). 


where yW is the digamma function (see formula 3.231.5 in [1]). By analytic continuation the 
identity in fact holds for —2 < p < 1 with 


I(0) = lim 1( p) = 77/8. 


Reference 
1. I. S. Gradshteyn and I. M. Ryzhik, Table of Integrals, Series and Products, Academic Press, 1965. 
Also solved by Paul F. Byrd, M. L. Glasser, A. A. Jagers (Netherlands), Frederic R. Schwab, and the proposers. 


REVIEWS 


EDITED BY ALLAN L. EDMONDS AND JOHN H. EWING 
COLLABORATING EDITOR FOR FILMS: SEYMOUR SCHUSTER 


Linear Algebra and Group Representations, Volumes 1 and 2. By Ronald Shaw. Academic Press, 
New York, 1982. Vol. 1, xi + 269 pp., $32.00. Vol. 2, ix + 270-579 pp., $35.00. 


GERALD J. JANUSZ 
Department of Mathematics, University of Illinois, Urbana, TL 61801 


The representation theory of groups 1s studied because an abstract group, initially defined by a 
set of axioms, takes on a concrete existence when its elements are described as linear transforma- 
tions on a vector space over some field. The structure implied by the group axioms is intertwined 
with the structure of the vector space and all its related objects to give insight into properties of 
groups of linear transformations and of abstract groups in general. What could be more natural 
than to study the subject of linear (and multilinear) algebra simultaneously with the subject of 
representations of abstract groups as groups of linear transformations? In this book (I regard the 
two volumes as one book) the rich subject of linear algebra is laid before the reader not only as a 
subject worthy of study in its own right but also with the goal of providing a framework in which 
the basic results about representations of abstract groups can be efficiently proved. The abstract 
groups involved are not always assumed to be finite, but the emphasis is placed upon algebraic 
results rather than topological, analytic, or arithmetic results. 

What is the structure possessed by a vector space and what are the related objects? The 
beginning student might be overwhelmed by Professor Shaw’s reply: dual spaces, covariant and 
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contravariant tensors, the tensor, symmetric, and exterior algebras, dyads, projections, the various 
types of bilinear forms, hermitian forms, isometries, etc., etc. However, the determined and 
questioning student will be rewarded by a study of linear algebra firstly because the ideas of this 
subject are in the toolbox of every theoretical scientist, and secondly, because of the intrinsic 
elegance of the subject. Shaw gives a highly polished treatment using some ideas not commonly 
seen in other texts. The systematic use of dyads (rank one linear transformations) for example, 
permits convenient descriptions of projection maps and decompositions of the identity map. The 
elegance of this treatment will be apparent to the reader who already knows the subject but, 
unfortunately, may be lost upon (and even confusing to) the beginning student. The presentation 
assumes considerable sophistication on the part of the reader. In somewhat less than 350 pages of 
material on linear algebra, Shaw covers far more material than other texts of comparable length. 
This is achieved by leaving many important results to the exercises (with ample hints). In return 
for this price, the reader is treated to examples and illustrations drawn from Shaw’s background in 
mathematical physics. 

Consider, for example, the Minkowski space; this is a four-dimensional vector space M 
equipped with a bilinear form with signature (1, 3). The associated quadratic form g = x? — y? — 
z* — w’ is not definite and so one seldom finds a detailed discussion of it in standard texts. In this 
text it is treated in great detail, as is its group of isometries, the Lorentz group, because of the 
connections with relativity and quantum mechanics. 

One of the tools used throughout the text is the tensor algebra and the closely related 
symmetric and exterior algebras. For a vector space V over a field F, the tensor algebra T(V) is 
the graded F-algebra which has the r-fold tensor product of V with itself as the rth homogeneous 
component. From it one obtains the symmetric algebra as a homomorphic image (essentially a 
polynomial ring) and the exterior algebra, a finite dimensional algebra having a distinguished 
one-dimensional subspace—an ideal even. These algebras are functorial in V; thus a linear 
transformation on V induces an automorphism on each of the algebras and moreover, the grading 
of each is preserved. This gives, for example, an action of GL(V) upon the distinguished 
one-dimensional ideal of the exterior algebra by which one defines the determinant map. One sees 
universal objects and commutative diagrams—a very modern approach to this very old subject. 

The remaining 200 pages or so are devoted to various results about representations of finite 
and infinite groups. There is an interesting mixture of subject matter and of style in these pages. 
One finds many standard and basic results about representations of finite groups often presented 
in a way that does not use finiteness. But there is much more! Here are two examples of not so 
standard topics. 

Let D denote the natural representation of the special linear group, G = SL(V), of a 
two-dimensional complex vector space V. If { x, y} is a basis for V, then the nth symmetric power 
S,, of V may be viewed as the space of homogeneous polynomials of degree n in the variables x 
and y. The representation of G on S, is denoted by D”. (This notation differs slightly from 
Shaw’s.) We have thereby constructed an infinite list of irreducible representations of G. Now 
consider the tensor product, D” @ D”, of two of these representations. How does it decompose? 
The solution to this problem is elegant. As a special case of a more general result, one knows that 
the vector space of G-homomorphisms from S, into S,, ® S, has the same dimension as the space 
of G-invariant elements in S, ®@ S,, ® S,. Next one finds such a nonzero invariant element for 
fixed m,n and all k on a certain interval. This shows that the corresponding D* are constituents 
of the tensor product D” ® D”. By an act of God, these D* have different degrees which sum to 
the degree of D” ® D” and so the decomposition is accomplished. The details are highly 
nontrivial and make heavy use of algebraic computations with dual bases. The study of these 
invariant elements goes back to Clebsch in 1872 and is in sharp contrast to the modern ideas 
mentioned above. 

Another pretty result is the isomorphism between the orthogonal group of positive definite 
three-dimensional form over the complex field and the quotient of the two-dimensional group of 
isometries of an alternating form modulo its center, the projective symplectic group. This 
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well-known isomorphism is efficiently proved by making use of the analysis of the Minkowski 
space mentioned above. 

One of the most attractive features of complex representation theory of finite groups is the 
theory of characters along with the arithmetic results which can be obtained by exploiting the fact 
that the values of the characters on group elements are algebraic integers. One such result is the 
fact that the degrees of the irreducible characters divide the group order. It is unfortunate that 
Shaw does not give more than token attention to characters and does not mention any arithmetic 
results such as this one. In fairness it must be said that Shaw’s goals do not suffer seriously from 
these omissions. 

I am very pleased with this book, especially with the linear algebra portions. It provides an 
alternative approach to many basic results in the subject where new insights are not easily 
acquired. The applications motivated by the interests of a mathematical physicist provide new 
insights to one educated in the traditional approach. The author states that the book is intended 
for advanced students. I would go a bit farther and state that the book is excellent for the student 
who 1s already quite knowledgeable in the subject and desires additional insights and applications. 


Riemannian Geometry. By Wilhelm Klingenberg. Walter de Gruyter, Berlin, Germany, 1982. 
x + 396 pp. 


H. Wu 
Department of Mathematics, University of California, Berkeley, CA 94720 


Writing a readable introduction to differential geometry is among the most difficult feats in the 
art of mathematical exposition. Such an introduction should ideally fulfill the dual purpose of 
preparing beginners for most contingencies in their march to the frontier of research, and at the 
same time educating them in the ways of geometric life without ramming facts down their throats. 

Now, since any mathematical work exceeding 1000 pages (perhaps even less) automatically 
invites suspicion, there would also be a natural upper bound on its size. This being the case, the 
two preceding boundary conditions may well be inherently incompatible. At the very least, no one 
has yet succeeded in simultaneously achieving these two objectives and, given the uncontrolled 
growth of the field in the past decade, the chance of anyone succeeding in the future would seem 
even more remote. 

The eminently quotable Michael Spivak, as one who has thought deeply about such things, 
explains the predicament facing any author in geometry: 


“For many years I have wanted to write the Great American Differential Geometry book. 
Today a dilemma confronts anyone intent on penetrating the mysteries of differential 
geometry. On the one hand, one can consult numerous classical treatments of the subject in 
an attempt to form some idea how the concepts within it developed. Unfortunately, a 
modern mathematical education tends to make classical mathematical works inaccessible. 
On the other hand, one can now find texts as modern in spirit, and as clean in exposition, as 
Bourbaki’s algebra. But a thorough study of these books usually leaves one unprepared to 
consult classical works, and entirely ignorant of the relationship between elegant modern 
constructions and their classical counterparts. Most students eventually find that this 
ignorance of the roots of the subject has its price—no one denies that modern definitions are 
clear, elegant, and precise; it is just that it’s impossible to comprehend how anyone ever 
thought of them. And even after one does master a modern treatment of differential 
geometry, other modern treatments often appear to be about totally different subjects.” 
({12], Volume I, p.1.) 
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On this pessimistic note then, let us turn to a brief examination of some of the existing works 
of this genre in order to put the volume under review in perspective. 

The Spivak opus [12] is a five-volume work totalling 2643 typed pages (which would come to 
about 1000 printed pages). Their great length notwithstanding, these books most emphatically are 
introductory texts. As an example, Spivak’s attitude towards curvature is that... there is no point 
in introducing the curvature tensor without explaining how it was invented and what it has to do 
with curvature” (p.i. in Volume I of [12]). The background he assumed of the reader is minimal: 
advanced calculus, linear algebra and metric spaces. But because of this he had to pay a price: out 
of the 635 typed pages of Volume I, 546 are devoted to such preparatory material as differentiable 
manifolds, tensor fields, Lie groups, algebraic topology... . I point this out not as a criticism, far 
from it, for I know of no other author willing to perform this valuable service for the beginner and 
who still goes on to write about substantive geometry. In addition, and I believe this is Spivak’s 
great contribution to book-writing, the entire Volume II (403 typed pages) is devoted to discussing 
not only the contributions of Gauss and Riemann, but also how they tie in with such latter-day 
constructs as moving frames and principal fibre bundles. This material is available nowhere else. 
The remaining three volumes (1605 typed pages) are concerned with more advanced topics which 
culminate in characteristic classes and the Gauss-Bonnet theorem in n-dimensions. There is the 
same clarity and insight all the way to the end. So with all these going for it, why then is Spivak 
[12] not THE book in differential geometry? The reason is that Spivak, being ever so solicitous of 
the reader’s learning process, would not allow himself to cram the maximal amount of material 
into a fixed number of pages purely for the sake of cramming. Thus, long as it is, a few topics of 
vital importance have been left out of [12], and among these are: homogeneous and symmetric 
spaces, and Hermitian and Kahler manifolds. 

By contrast, these and more can be found in the two-volume work of Kobayashi-Nomizu [8]. 
This is the work that convincingly demonstrates what efficiency and expertise can accomplish in 
777 printed pages. Minor carping from some quarters aside, this is THE standard reference work 
of the subject. By the same token, anyone who thrusts these two volumes on a beginner for use as 
an introductory text would be guilty of committing an act of inhumanity against a fellow being. In 
the terse 62-page Chapter 1, for example, the prerequisites for reading this work are relentlessly 
itemized (with some proofs): differentiable manifolds, tensor fields, Lie groups and fibre bundles. 
Of course the official line is that anyone not acquainted with this material can pick it up from 
there, but let no one be so deceived! Moreover, given the reference nature of the work, there is no 
room in this kind of tightly organized writing for such activities as giving motivation or tracing the 
historical evolution of a concept. (After all, does anyone expect the Oxford English Dictionary to 
be a treatise on etymology?) But when all is said and done, this work is the one place where one 
can find a logical and systematic development of differential geometry ab initio (a la Bourbaki) 
which does lead to the frontier of research (at the time of its writing) in a great many areas. For 
that alone we have to be thankful. 

At this point, I hope the reader may be more receptive to the idea of seeking instead a 
readable, compact introduction to a specific area in differential geometry. Since Riemannian 
geometry is at the core of differential geometry no matter how the latter is defined, it comes as no 
surprise that indeed several attempts have been made to write textbooks on precisely this topic. 
For the purpose of this review, it suffices to narrow our scope even further. We will be discussing 
synthetic Riemannian geometry, which may be loosely defined as the area where theorems are 
proved by predominantly geometric arguments, with “bare hands” so to speak, and where hard 
analysis intrudes but slightly. A typical theorem here is the sphere theorem (Rauch, Berger, 
Klingenberg), which states that a compact simply-connected Riemannian manifold whose sec- 
tional curvature K satisfies 1/4 < K < 1 is homeomorphic to the sphere. The proof involves 
nothing more than calculus and point-set topology, plus a generous supply of ingenuity and 
geometric insight. 

Now it so happens that there are already two classic expositions of synthetic Riemannian 
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geometry: Gromoll-Klingenberg-Meyer [3] and Cheeger-Ebin [1]. I shall refer to these as GKM 
and CE respectively. Starting with minimal prerequisites (for GKM, the same as Spivak; for CE, 
the first half of GKM, say), they lead the reader straight to the deepest available theorems in 
synthetic Riemannian geometry at the time of their respective publications: for GKM, this would 
mean the sphere theorem, and for CE, the differentiable version of the sphere theorem plus the 
structure theorems of noncompact manifolds of nonnegative curvature and compact manifolds of 
nonpositive curvature. By design, neither book deals with such topics as connections on fibre 
bundles, differential forms and characteristic classes, or Kahler metrics. In GKM, historical 
references are sprinkled throughout the text but are nowhere pursued as vigorously as in Spivak; 
CE is even more reticent in such matters. However, the expository style of both books possesses 
that rare combination of clarity, incisiveness, and power. Within the confines of their avowed 
goals, both are unqualified successes. 

I may add that the simple beautiful German of GKM is much easier to read than quite a few 
English mathematical texts that have come my way. Thus the linguistic problem is hardly a serious 
one. 

And finally, we come to the volume under review (all 396 densely-printed pages of it). It 
represents the author’s second thoughts on GKM, so to speak. What he succeeded in doing is to 
subsume all of GKM as a proper subset of the first two chapters of this volume (exactly 
two-thirds in length), and to devote the third and final chapter to a systematic account of the 
structure of the geodesic flow. The prerequisites are the same as Spivak or GKM (with occasional 
exceptions). Unlike any of the above-mentioned works, this book begins with the definition of 
manifolds modelled on Hilbert space. This makes possible a lengthy discourse on the Hilbert 
manifold of curves, a topic that is appearing in an elementary text on Riemannian geometry for 
the first time. 

The style of this book is noticeably different from that of GKM: more chatty, the notation at 
times more formidable, and, surprisingly, in some places overtaxing the reader’s perseverance and 
good will. But whatever the loss vis-a-vis the virtues of GKM (inevitable in any such expansion, be 
it a restaurant or a book), there is ample compensation in being able to include some delectable 
topics from the theory of geodesic flows. Here one finds a proof of the theorem of Lyusternik- 
Schnirelman (a Riemannian manifold homeomorphic to the 2-sphere must possess three simple 
closed geodesics), an account of Anosov’s work on manifolds of negative curvature, and a long 
detailed discussion of closed geodesics on compact surfaces in general and on ellipsoids in 
particular. There is also a proof of a beautiful theorem of E. Hopf: Any metric without conjugate 
points on the 2-torus must be flat. A generalization of this to higher dimensions would be most 
interesting. 

In a general review of this nature, any technical discussion would be out of place. Yet I 
consider the omission of the definitions of the Ricci and scalar curvatures in such an introductory 
text to be serious enough to warrant a breach of protocol on my part. This is serious because, not 
only are these curvatures among the most basic objects in Riemannian geometry, but also very 
great progress has just been made in our understanding of (specifically) these two curvatures in 
the past few years (see, for example, [4]—[6], [9]-[11] and [13]). There is no excuse for not letting 
the reader in on something of such great current interest. The omission of Ricci curvature hampers 
the author’s exposition as well: the Bonnet-Myers theorem (which is incidentally left out of the 
Index but which appears on p. 205) is now stated in terms of the sectional curvature rather than 
the Ricci curvature. One objects to this not because of the obvious loss of generality but rather on 
conceptual grounds. Perhaps this omission also accounts for the author’s neglect to mention on p. 
229 that the Maximal Diameter Rigidity Theorem of Toponogov has been generalized by S. Y. 
Cheng [2] to a statement concerning Ricci curvature; elsewhere, the author is very conscientious 
about pointing out the latest advances concerning some of the theorems in the book. (On this 
same subject, mention should be made of some curious omissions and errors in the references to 
the literature on pp. 239-40 and on p. 254.) 
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To return to the first topic of this review, it is unlikely, as of 1984, to see a textbook 
introducing the student to all or even most of the topics of current interest in differential 
geometry. The field has simply grown too big (see the introduction to Volume III of [12], which 
was written back in 1975!). Even if one starts where [8] leaves off, would it be reasonable to expect 
an author to write about minimal submanifolds, harmonic maps, spectra of compact manifolds, 
Yang-Mills fields, function theory on non-compact manifolds, holomorphic mappings, holomor- 
phic vector bundles over complex projective spaces, applications of the Monge-Ampére equations, 
etc., all within the guidelines spelled out at the beginning of this review? The best one can hope for 
is perhaps a systematic and concise account of a given topic. In this sense, the present volume, 
together with the author’s other book [7], fill a gap in the area of geodesic flows and closed 
geodesics. In the author’s words: “I have no other excuse to proffer for the selection of contents, 
except that I am convinced that my choice represents a lasting contribution to the field, and that 
future fruitful developments seem most likely’ (p. vii). What is arguably the most urgent need at 
present is an introduction to the recent advances in harmonic maps, minimal submanifolds, 
Yang-Mills fields and the geometric applications of the Monge-Ampére equation; these seemingly 
disparate topics are tied together by one common thread, that of nonlinear elliptic equations, 
which is itself in need of a good introductory text. Is there no one to take up the challenge? 
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Charles Babbage: Pioneer of the Computer. By Anthony Hyman. Princeton University Press, 
Princeton, NJ, 1982. xiv + 287pp.+ illustrations, $25.00. 


JOHN T. O’DONNELL 
Department of Computer Science, Indiana University, Bloomington, IN 47405 


The modern era of computing began in the 1930’s with the development of theoretical models 
of computation by Turing, Church, Kleene and others. World War II provided the impetus to 
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construct practical machines which led to the modern stored-program digital computers. The 
Difference Engines and Analytical Engines designed by Charles Babbage a century before stand 
as a fascinating prologue to the modern era: they embody many of the concepts of modern 
computers, but they were largely forgotten and had little influence. 

The union of scientific and mathematical theory with engineering practice was the-central 
theme in Babbage’s life. Although his mathematical training and early research concerned calculus 
and the theory of functions, he also spent great time and effort on learning engineering and 
industrial techniques, even going so far as learning to use power machine tools. For many years 
Babbage made it a practice to visit industries of all kinds in order to study both their engineering 
techniques and the economics of their operation, becoming England’s leading expert on the 
application of science to manufacturing. This work culminated in Babbage’s most influential 
book, On the Economy of Machinery and Manufactures (1832). Babbage recognized the social 
problems caused by the industrial revolution, but believed that the continued advance of science 
and engineering would enhance the quality of life, and that the real interests of management and 
labor coincided. A firm supporter of capitalism, and, after the revolutions of 1848, a foe of 
socialism, Babbage’s economic writings influenced John Stuart Mill and Karl Marx. 

Babbage’s concern with practical applications of mathematics led directly to the Difference 
Engine. Appalled by the profusion of errors in mathematical tables compiled by hand, Babbage 
decided to build a reliable machine to calculate and automatically typeset tables using the method 
of finite differences. Although he stressed the importance of tables in navigation and the dire 
consequences of errors, Babbage was unable to convince politicians and bureaucrats that his idea 
was useful.* Over several years the government provided 17,000 pounds for the project, and 
Babbage also spent much of his own money, but more was needed and he completed only a small 
working prototype (which still works today). Hyman argues persuasively that only the obstacles of 
a dishonest chief engineer and government incomprehension prevented Babbage from successfully 
completing the Difference Engine. 

In the process of simplifying and generalizing the Difference Engine, Babbage realized the 
importance of allowing the results of calculations to become inputs to further calculations. He 
described this form of self reference as “the engine eating its own tail’. Of course the finite 
difference method does this, but always in a fixed pattern. A computation requiring a different 
pattern of calculations requires construction of a new machine, but a better solution is the 
construction of a machine whose pattern can be changed. This was exactly the problem faced by 
the French loom designers, leading to the famous Jacquard loom which allowed the weaver to 
specify a pattern by punching a set of holes in cards. Babbage adapted the Jacquard cards to his 
Analytical Engine. 

Since the pattern of calculations is flexible, as in the Jacquard loom, the Analytical Engine 
must have a general ability to use arbitrary values as inputs to calculations, and to use the results 
in arbitrary ways. Therefore the Analytical Engine contained separate sections for representing 
numbers (the Store) and calculating (the Mill, in an analogy with the active part of a cotton mill). 
Using modern terminology, each number in the Store had a unique address. The Mill contained 
two number cylinders to hold the inputs to a calculation, and another to receive the result; these 
are called registers today. Arbitrary operations consisted of a simple sequence of steps: (1) move 
the first operand, specified by an address on a “variable card”’, into a Mill register; (2) move the 
second operand, specified by another variable card; (3) perform the arithmetic operation specified 
by an “operation card”; (4) move the result into the Store location specified by another variable 
card. This procedure is similar to the operation of “three address computers”, except that the 
Analytical Engine had no real instructions containing an operation code and operands; the 


*An example of the extreme conservatism of the nineteenth century scientific bureaucrats is the belief attributed 
by the Prime Minister to the Master of Trinity College, Cambridge, “that a century should pass before new 
discoveries in science are admitted into the course of academical instruction” [Hyman, p. 149]. 
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operations and variable addresses came from separate card readers. 

The “engine eating its own tail” suggested another advance, one of the most important of all: 
using the results of a calculation to affect the motions of the card readers, thereby controlling 
what the Engine will do next. Babbage hinged the cards together, making it possible for the card 
readers to back up to previous cards as well as going on to the next card. Some of the Engine’s 
instructions could move cards through the readers without executing any operations punched on 
them, giving it the ability to perform “conditional go to statements”. The Analytical Engine card 
readers are often compared with Hollerith cards used until recently in computers, but in fact they 
are more analogous to paper or magnetic tapes, which allow similar control. 

The key operation in the Analytical Engine (as in computers) was the movement of data from 
place to place. One of Babbage’s insights was to separate the mechanism that allowed data 
movement within the machine from the mechanism that controlled which movements actually 
occurred. The control mechanism contained a studded barrel with one position to control each 
piece of data movement hardware. An operation was then specified by putting studs in the barrel 
corresponding to the correct data movements. This approach simplified the machine and also 
made it possible to change the sequence of data movements executed for any instruction merely 
by inserting or removing studs. The alternative would be a total redesign of the control 
mechanism. This technique was rediscovered in the 1950’s and is called microprogramming. 

As Hyman points out, it is difficult to determine how close Babbage came to inventing some of 
the concepts of the modern computer. A computer design consists of a hierarchy of levels: logical 
organization; register transfer model; microcode host; logic gates; electronic implementation of 
gates. In contrast, Babbage’s writings concentrate on the lowest level—gears, cams, linkages, 
torque, etc.—with little comprehensive documentation of higher-level issues. This appears to 
violate the currently popular idea of top-down design, but top-down design requires some 
understanding of the nature of the lower levels in order to design the higher levels. Babbage had 
no fixed low-level computer technology to work from. 

Even some of the detailed mechanical descriptions contain hints of fundamentally important 
ideas. One example occurs in Henry Babbage’s (son of Charles Babbage) description of the 
implementation of the “chain” [3, pp. 334-336], which corresponds roughly to the modern And 
gate. The chain transmitted a control signal to a destination only when all of a set of conditions 
were Satisfied. Henry Babbage wrote of “another important principle largely adopted, viz., to 
break up every train of motion as far as possible into short courses, the last step of each furnishing 
a mere guide for a fresh start in the mechanism from the driving power’. This might appear just to 
be a mechanical way to control losses through friction and to prevent the mechanism from 
jamming. However, it is similar to the form of logic signal amplification used in modern Very 
Large Scale Integrated Circuits, in which logic values (represented by a quantity of charge) must 
periodically be “amplified” by controlling the creation of a new charge from the power supply. 
Mead and Conway argue that such issues are fundamentally important to computation, and 
propose the term “physics of computational systems’, in contrast to the conventional “ mathe- 
matics of computation” [2, Chapt. 9]. 

The central theme in modern Computer Science is the use of abstraction and notation to deal 
with complexity. Both computers and programs may contain hundreds of thousands of compo- 
nents which individually have simple behaviors, but the analysis of their aggregate behavior can be 
extremely difficult. Therefore a crucial prerequisite to progress is the development of techniques 
for dealing with this complexity. Structured programming, hierarchical decomposition and micro- 
programming are examples of techniques which lessen the difficulty of designing hardware and 
software systems. Babbage did not organize the design of his Engines in levels of abstraction in 
the modern sense, but he recognized the problem: “The complicated relation which then arose 
amongst the various parts of the machinery would have baffled the most tenacious memory. I 
overcame that difficulty by improving and extending a language of signs, the Mechanical 


1985] REVIEWS 525 


Notation. ... By such means I succeeded in mastering trains of investigation so vast in extent that 
no length of years ever allotted to one individual could otherwise have enabled me to control.’ [1; 
quoted in Hyman, p. 165]. Babbage considered the Mechanical Notation one of his most 
important contributions, and computer scientists today would agree. 

There are three striking differences between the Analytical Engines and computers: the 
implementation technology, the number system, and the representation of programs. The first two 
are actually insignificant. Babbage chose mechanical devices as his implementation technology for 
the same reason that modern designers use solid state electronics: in order to reach the best 
compromise among ease of manufacture, speed, reliability and cost. It was convenient to represent 
numbers on geared wheels in base 10, but it is more convenient to represent numbers with 
electrical charge or voltage in base 2. The binary number system is not essential to computers: if 
in the future the best available technology has three states, we will see base 3 computers. In fact, 
Babbage considered several other number systems. However, it is not clear whether Babbage 
considered representing programs in the Store instead of punched cards. The stored program 
concept is central to some theoretical results (such as the Halting Problem), and it was considered 
a major practical advance in the early years of electronic computers because it allowed the 
flexibility of self-modifying programs. For example, a program to increment all the elements of an 
array would contain in a loop a Load instruction addressing the first word of the array, an 
Increment instruction, a Store instruction addressing the first word, and then instructions to 
increment the address fields of the Load and Store instructions. Subsequent executions of the loop 
would then address consecutive elements of the array. The invention of index registers made this 
trick unnecessary, and advocates of structured programming consider self-modifying programs 
very bad. A new set of practical advantages to the stored program computer became evident in the 
1950’s: it made possible compilers and operating systems, which must treat programs as data. 

Babbage wrote an extremely entertaining autobiography available in a modern edition [1] 
which partially describes the Engines, introduces his other research interests, and contains a vast 
store of anecdotes (he was much in demand by Society, and a good story teller). Hyman’s 
biography gives a more systematic and complete description of Babbage’s life and influence, and I 
highly recommend it. It is objective and scholarly but includes some trenchant comments (e.g., 
“Dionysus Lardner...ballooned across the engineering landscape of the time sustained by an 
inexhaustible supply of hot air.” [p. 147]). Most importantly for the reader, Hyman relates 
Babbage’s life to the contemporary scientific, political and social events. However, the book does 
not explain how the Engines worked. It contains several reproductions of technical drawings of 
the Engines, but not enough detail to follow their operation. Reprints of papers giving more 
technical details on the operation of the Engines are available in [3], and [4] reprints a wide range 
of papers on early computers. 

Babbage met frustration in the construction of the Difference Engine, but he had confidence 
that both science and his ideas would triumph. According to Philip and Emily Morrison, 
“Babbage once said that he would gladly give up the remainder of his life if he could be allowed 
to live three days five hundred years hence and be provided with a scientific guide to explain the 
discoveries made since his death.” [3, p. xxxi]. One hundred years later, computer science has 
justified his work, and its stream of discoveries shows no sign of slowing. 
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LETTERS TO THE EDITOR 


Material for this department should be prepared exactly the same way as submitted manuscripts (see the inside 
front cover) and sent to Professor P. R. Halmos, Department of Mathematics, University of Santa Clara, Santa 
Clara, CA 95053. 


Editor: 


A. Wilansky (this MONTHLY, 91 (1984) 531) asks whether there is a normed Q-algebra which is 
not inverse closed in its completion. In fact a normed algebra is a Q-algebra if and only if it is 
inverse closed in its completion. Since the details are too technical for this letter column, 
interested readers may obtain a preprint from the undersigned. 


Theodore W. Palmer 
Department of Mathematics 
University of Oregon 
Fugene, OR 97403 


Editor: 


Recent articles in the MONTHLY by Renaud (March 1983, pp. 202—203) and Galvin (May 1984, 
pp. 308-309) address the question of generating integer matrices with known integer eigenvalues 
and eigenvectors. A very simple method for this, which has been used for many years (see, ¢.g., 
J. M. Ortega, Generation of Test Matrices by Similarity Transformations, Comm. ACM 7, 1964, 
pp. 377-78) by numerical analysts to generate test. matrices is the following: 


If u and v are column vectors with u’v # —1, then 
(i+w) =T- Bw", B=(14+07u)™. 
If D is a given integer diagonal matrix and u and v are integer vectors with v’u = 0, then 
= (I+ uw") D(1+w") * = D+ u(Dv)' — Duv’ — v'Duw" 


is an integer matrix with the same spectrum as D and with eigenvectors that are the columns of 
I + uv’. It is relatively easy to generate, either by hand for small matrices or computer program 
for larger ones. 

Of course, the treatments by Renaud and Galvin have other strong points. However, for the 
purpose of generating a matrix with prescribed eigenvalues and known eigenvectors, the simplicity 
of the above approach has much to recommend it. 


James M. Ortega 

Department of Applied Mathematics 
University of Virginia 
Charlottesville, VA 22903 


153. MISCELLANEA 


A mathematician doesn’t look like anything! Which means, he will always look so generally 
intelligent that there is no single definite thing behind it all! 


—Robert Musil, The Man Without Qualities, 
Coward-McCann, New York, 1953, p. 70. 
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HERMITIAN MATRIX INEQUALITIES AND A CONJECTURE 


N. N. CHAN 
Chinese University of Hong Kong, Shatin, New Territories, Hong Kong 


MAN KAM KWONG * 
Department of Mathematics, Northern Illinois University, DeKalb, TL 60115 


1. Introduction. In many ways Hermitian matrices resemble real numbers. Indeed, all eigenval- 
ues of a Hermitian matrix are real and the matrix is diagonalizable. This similitude may lead an 
unwary mind to wrong conclusions. This is especially true in the study of inequalities involving 
Hermitian matrices. 

In the sequel we use capital letters A, B,..., X, etc., to denote n X n Hermitian matrices where 
n is Some integer greater than 1; A = A* where A* denotes the conjugate of the transpose of A. 
We use wu and v to denote complex column vectors in C” furnished with the usual inner product 
(u,v). We define: 


1. A > (>)0if all eigenvalues of A are nonnegative (positive) or equivalently if (u, Au) > (> )0 
for all nonzero vectors u € C”. 

2. A>(>)B if A-BSe(>)O, or equivalently if (u, Au) > (>) (u, Bu) for all nonzero 
vectors u€ C”. 


Unfortunately this ordering is only a partial one. Thus it is not true that if A is not greater than 
or equal to B then A must be smaller than B. Another source of trouble is the fact that matrix 
multiplication is not compatible with the ordering (unless all matrices involved are mutually 
commutative). Most inequalities involving multiplications of real numbers cease to hold when the 
real numbers are replaced by Hermitian matrices. Two simple examples are: 


1. It is not true that 4 > 0 and B > 0 imply that 4B + BA > 0. Simply take A =(? °) and 
_(1 1 

B=(1 1). 
2. It is not true that A > B > 0 implies that A? > B’*. Take A =(? | and B =(! °)), 


0 0 


To illustrate the delicacy of Hermitian inequalities further, we note two more examples: 


3. Let A and B be nonnegative Hermitian such that A + B> 0, and let X be an nxn 
Hermitian matrix that satisfies the inequality 


(1.1) (4+ B)X+X(4 +B) > AB+ BA. 


In the scalar case (n = 1), it is obvious that the real number X must be positive. However, that is 
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not always true for n > 2. A counterexample is 


4=(3 0), B=(3 3), x=({ _t). 


Then (A + B)X + X(A + B)=AB+ BA but X # 0. 
However, if in addition A > 0 and instead of (1.1) X satisfies 


(1.2) (4+ B)X+ X(A+B)>A'B+ BA", 
then X > 0. In fact if we have equality in (1.2), then X is given by 
X=A-!}-(A+B)">0. 
Lemma 1 of Section 2 can be used to complete the proof for the general case. More generally, it 
can be proved that if X satisfies 
(1.3) (4+ B)X+ X(4 + B) > AYB + BA’ 
with y € (—1,0], then X > 0. 


4. Let A > 0, P > 0 and let ¢ be a real number in (— 2, oo). In the scalar case it is trivial that 
the solution of the following equation is nonnegative: 


(1.4) A*X + XA? + tAXA = P>O. 


Can this be generalized to matrices? Equation (1.4) extends Lyapunov’s Theorem (Lemma 1 of 
Section 2). Recently in [10] it was shown that the same result holds for 2 X 2 matrices. But for 
3 x 3 matrices the conclusion is true only for t € (—2,8]. If ¢ > 8, there is always a pair (A, P) 
for which the solution X is not nonnegative definite. For each dimension n, there is a cutoff value 
ty such that the result holds for t © (—2,t,) but not for ¢ > ¢). Itis also known that t) > 2 for all 
n. The question of the exact value of ty for n > 4 is open. 


Those few results concerning inequalities that continue to hold are usually hard to prove. The 
following is a natural question to ask: 


What functions preserve the ordering of Hermitian matrices? In other words, what must f 
satisfy so that 


A> B= f(A) > f(B)? 


Let us recall how functions of a Hermitian matrix are defined. If D is a diagonal matrix with 
diagonal elements {d,,d,,...,d,}, a<d,< b, and f: (a,b) > (— 0,0) is any real-valued 
function, then f(D) is defined to be the diagonal matrix with diagonal elements 
{ f(d,), f(d,),..-,f(d,,)}. In general, if A is not diagonal, there exist a unitary matrix U and a 
diagonal matrix D such that Ad = U*DU. We then define f(A) = U*f(D)U. 

The question was studied and completely resolved by C. Loewner in his 1934 paper [12]. It 
turns out that the condition on f depends on the size of the matrices in question. More precisely, 
let 


P.(a,b) = { f: (a,b) > (—00, 00): f(A) > f( B) for all pairs of n X n Hermitian matrices 
A, B with eigenvalues in (a,b) such that A > B}. 
Then 
Pi(a,b) 2 P,(a,b)2 °°: 2P,(a,b)2---. 


The class of functions P,(a, b) can be characterized by the positivity of a certain matrix formed 
with divided differences of the function with respect to n arbitrary distinct points taken in (a, b). 
It is necessary that f © P,(a,b) possess continuous derivatives up to order 2n — 3 and that 
f@"-») be convex. 
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Loewner went on to show that the set P(a,b) = N°, P,(a,b) is miraculously tied up with 
the theory of analytic functions. Indeed P(a, b) is precisely the set of real-valued functions on 
(a, b) that are continuable analytically to the upper half plane in which the functions take values 
with positive imaginary parts (the so-called Pick functions). 

For an exposition of this beautiful theory, we refer the readers to the monograph by Donoghue 
[6], and the article by Ando [1]. 

From an advanced point of view, Loewner’s result is elegant, but the proof is by no means 
short or elementary. For instance, it is not at all obvious that f(t) = Vt belongs to P(0, 00). 
Perhaps this is the reason why the fact “t” € P(O,0o),0 < r< 1” has been rediscovered more 
than once. We shall come back to this in the next section. 

It is an interesting exercise to try to discover as many functions in P(a,b) as possible by 
entirely elementary means. In Section 2 we look at this problem. In Sections 3 and 4 we discuss a 
conjecture and some results that are inspired by an attempt to look for further inequalities. 


2. An inequality-generating theorem. We shall establish the following general result. 


THEOREM 1. Let A, B,C, D be n X n Hermitian matrices. Suppose that A commutes with C and 
that B commutes with D. If A> B>0 and C > D> 0, then for any positive r and s such that 
r+s <1 we have 


A’'C’ > B'D’. 
The proof of Theorem 1 depends on the following very special case. (Take r = 1/2, s = 0, and 
C = D=T, the identity matrix). 


THEOREM 2. Jf A > B > 0, then A'/* > B/*. 


As pointed out in the introduction, Theorem 2 is an obvious corollary of Loewner’s deep result. 
R. Bellman [3] proved Theorem 2 analytically using ideas in the theory of differential equations. 
Y. H. Au-Yeung [2] gave a simple algebraic proof of the following generalization. 


THEOREM 3. Jf A > B > 0, then A’ > B’ for allr & [0,1]. 


Special cases of Theorem 1 were used by Au-Yeung in his proof of Theorem 3 but Theorem 1 
has not been stated in such generality. 

Theorem 3 was in fact “discovered” earlier in 1951 by E. Heinz [7], not just for matrices but for 
general Hermitian operators in a Hilbert space. In the following year T. Kato [8] gave a shorter 
proof. These authors seem not to have been aware of Loewner’s result at the time the papers were 
written. The various proofs are, however, different and are worth recording. In [9] the second 
author gave yet another proof that resulted in a stronger form of Theorem 3. 

Recently the operator version of Theorem 2 found an interesting application in linear neutron 
transport theory; see [11]. An important normequivalence result can now be given a simple 
elementary proof. This is an impetus for further investigation of related inequalities. Still more 
recently Theorem 2 was used in a study of the oscillation of second order systems of differential 
equations of the form X’’(t) + Q(t) X(t) = 0, where Q(t) is an n X n Hermitian matrix for all ¢ 
and X(t) is an n-vector, see [5]. 

The following proof of Theorem 2 is a simplification of a proof that appeared in Marshall and 
Olkin [13]. Alternative proofs by other authors can also be found in [13]. 


Proof of Theorem 2. If suffices to establish the conclusion under the additional condition that 
A > 0, since the general case follows by continuity. Let 


C=A'?, D=B'”? and X=C-D. 
Then 
0<C*—- D*=C?-(X-C) =CX4+ XC — X?. 


Theorem 1 is thus a consequence of the following well-known result due to Lyapunov. 
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LEMMA 1. Suppose C and P are square matrices of the same size with C > 0. The equation 
(2.1) CX + XC =P 
has a unique solution X. Moreover if P > 0, then so is X. 


Proof. For the sake of completeness we give the proof. That (2.1) has a unique solution X can 
be checked directly after we notice that we can assume without loss of generality that C is 
diagonal. In particular, by taking adjoints of both sides of (2.1), we conclude that X must be 
Hermitian whenever P is. Suppose finally that P > 0 and wu is an eigenvector of X corresponding 
to the eigenvalue a. Then 


0 <(u, Pu) = (u,CXu) +(u, XCu) = 2a(u, Cu). 
It follows that a > 0 and hence X is nonnegative. 


An alternative proof is to make use of the explicit formula X¥ = [&° e “'Pe~“' dt which can be 
found in Bellman [4, p. 179]. 
The following lemmas can be found in [2] and [9]. 


LEMMA 2. Suppose C > 0. Then A > B implies that CAC > CBC. If furthermore C > 0, then 
the converse also holds. 


Proof. It is not difficult to prove the lemma using the definition of ordering via the inner 
product. 


Lemma 3. If A > B> 0, then B™' > A7?. 


Proof. Let I denote the identity matrix. First notice that X > J implies that X~' < J. This 
can be verified either directly from the definition of X~! or from Lemma 2 with A = X, B= 7, 
and C = X-!/* Now Lemma 2 gives B~'”7AB~'/* > I from the hypothesis that 4 > B. Taking 
the inverse we get B'/74~'B!”* < J. Then the conclusion follows after using Lemma 2 one more 
time. 


LEMMA 4. Suppose thatG > F > 0 andX>0,Y>0. If YFY > XGX, then Y > X. 


Proof. By Lemma 2, YGY > YFY, which together with the second hypothesis gives YGY > 
XGX. It follows from Lemma 2 that 


G/YGYG'? > GxGxXG'?_ or (G7YG'7)" > (E72 XE”)’. 
By Theorem 2, G'’*YG'”* > G'/”* XG'”*. Lemma 2 then gives the conclusion. 
We are now ready to give a proof of Theorem 1. 


Proof of Theorem 1. As in the proof of Theorem 2, we may assume without loss of generality 
that B > 0. By Lemma 3, B-! > 47}. 

Using Lemma 4 with X = (BD)'”*, Y=(AC)'”, F=A7! and G= B™!, we obtain the 
conclusion of the Theorem for the special case r= 1/2 and s=1 — r. 

Replacing 4 and B by (AC)!” and (BD)'””, respectively, in the special case r = 1/2 of the 
Theorem, we obtain the special case r = 1/4, s = 3/4. 

Replacing C and D by (AC)'” and (BD)'”’, respectively, in the special case r = 1/2, we 
obtain the special case r = 3/4, s = 1/4. Repeating this process, we see that the Theorem holds 
when r is any dyadic fraction, namely, when r is of the form k/2” (n = 1,2,...;k =1,2,..., 
2” — 1) and s = 1 — r. Since such fractions are dense in [0,1], the Theorem holds for all general 
r © (0,1) and s = 1 — r by continuity. For s < 1 — r, the required inequality can be written as 


(A'C’)"* > (B’D") 


We can thus complete the proof of Theorem 1 by using the result established above and Theorem 
3, which as has been pointed out is a special case of Theorem 1 when C = D=J,r+s=1. 
An immediate consequence of Theorem 1 is the following property of the Pick functions. Let 


r+s 


, where r’ =r/(r+s),s’=s/(r+s) and r’'4+s’=1. 
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P, (0, co) be the subfamily 
{ f © P(0, 00): f: (0,00) — [0, 00)} of P(0, 00). 


THEOREM 4. For any r,s > 0 such that r+s <1 and any f,g © P,(0,00), we have f’g* € 
Py (0, 00). More generally, let r, > 0 (i= 1,2,...,n) be such that Lv_,r, <1. Then V1", ff" € 
P,(0, 00) for all f; © Py(0, 00) GG = 1,2,..., 7). 


In other words, the set of functions {In f: f © Py) (0, 00)} is convex. 

Let us show how further examples of functions in P)(0,00) can be generated. We assume 
throughout the hypotheses A > B > 0; in a few cases the stronger condition B > 0 is needed. 
Notice that in any case we need only establish the result under the stronger condition B > 0 as 
the general case follows by continuity. Also notice that Theorem 4 can be used in the obvious way 
to yield generalizations. 

Let r,s > 0,7 +s <1 and a, > 0. Then a simple consequence of Theorem 4 is 


(1) (al + A)'(BI+ A) >(al+ B)'(BI +B)’. 

For alla>y>0,0<r<l 

(2) (yi + A)(al + A)" >(yI + B)(al +B)’, 

or equivalently for all B > 0, B-' > y 

(3) (y+ A)(I + BA)" > (xT + B)(1 + BB). 

Indeed by Lemma 3, A~! < B™!. Thus (1 + aAd7') < (1+ aB"'). Lemma 3 again yields 
(1+ aA~')~! > (1 + aB™!)7! which is exactly (2) with r= 1, y = 0. Inequality (2) with y = 0 
now follows if in Theorem 1 we let C = A(al + A)7! and D = B(al + B)~'. To See that (2) 
holds in the general case y # 0 we replace A by yJ + A and a bya — y. 


Integrating a known inequality with respect to some parameter leads to another inequality. The 
following are three examples. 


(4) A[In(7 + A)]-" > BlIn(7 + B)]’ 


for all r € [0,1]. We require B > 0 to guarantee that [In(J + B)]~” is defined. However, in the 
singular case we can still define B{In(J + B)|~” by a limiting process, since lim, _, 9 ¢{In(1 + t)]7’ 
exists as can be seen using L’Hospital’s rule. By Theorem 3, 


(1+ A)’ >(7+B)’,  re[0,1]. 


Integrating this inequality with respect to r over [0,1] gives (4) with r = 1. The general case then 
follows upon applying Theorem 1. 

In (2) we let y = 0 and r = 1 and integrate the resulting inequality with respect to a over [0,1] 
and finally invoke Theorem 1 to get 


(5) A[in(7 + 471)]" > B[In(7 + Bo )]”. 


On the other hand, if in (3) we let r = 1, restrict y to be in [0,1], and integrate with respect to 
B over [0,1], we obtain 


(6) (1+ yA~1)In(7 + A) > (2+ yBu!)In(7 + B). 
In particular we have 
(7) In(7 + A) > In(J + B). 


We let the readers investigate for themselves what will result from integrating, for instance, 
rA’ > rB’, r & [0,1], or inequality (2) with y = 0, r = 1 and a replaced by a’, a € [0, b]. 
In (7) we can replace A by nA and B by nB to deduce 


inf +4) > nf 2 +B) 
n n 
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Letting n — oo we get 
(8) InA > InB. 
Here f(t) = Int is an example of a function belonging to P(0, co) but not to P)(0, oo). Inequality 
(8) can also be derived from Theorem 3 via differentiation. By Theorem 3, 
A’—I>B'—-T. 
Dividing by r and taking limits as r — 0, we have 
dA’ 
dr 


dB’ 
r=0 dr 


r=0 
which is (8). 
The findings above can be summarized as follows: 
Examples of functions in P)(0, oo) include 


(a+ ¢)’(B +t) a,B>0 rs>O,r+s<1 
(y + t)/(a + ty)’ a>y>0 r € [0,1] 

(y+ t)/( + Bt)’ Bois>y>0O  reé[0,1] 
t/In’ (1 + t) r & [0,1] 

tin’(1 + 1/t) r & [0,1] 

(1 + y/t)lnQ + 1) y € [0,1] 

(tInt—t+1)In-7t 

vt tan“ (b/ yt) b € (0,0); 


Int belongs to P(0,0c). Both P)(0,0co) and P(0,0o) are closed cones (hence the integration 
technique for generating new examples works) and furthermore P)(0, co) satisfies Theorem 4. It 
should be stated again that all these can be directly deduced from Loewner’s original result. 


3. A conjecture. Taking square roots of course restores the inequality that is destroyed after 
squaring. It is interesting to ask if the following sequence of operations will preserve the 
inequality: 


1. Squaring the two sides of the inequality 4 > B > 0 to get (A’, B’). 
2. Multiplying both on the right and on the left by some C > 0 to get (CAC, CB°C). 
3. Taking square roots. 


The question is whether 
(CA?C)'”* > (CB2c)'”’. 


Lemma 2 seems to indicate that the second operation should not have worsened the situation. 
Thus ones tempted to answer yes. 

However, after some trial and error using the MAT-LAB package on the VAX at Argonne 
National Laboratory, the following counterexample was discovered. Take 


{5 2 _(1 0 
4=(3 a) B (4 too) 


_{1 0 
c=(1 5): 


It is easy to check that A — B > 0. The computer output shows (originally with 16 decimal 


and 
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places) 


4.116681... 1.679158... 
_ 2ny\l/2 _ 2ny\l/2 _ 
E = (CA’C) (CBC) 1.679158... 0.134061... ]° 


The eigenvalues of E are — 0.4794... and 4.730... . Thus (CA47C)'”* # (CB’C))”. 

The same experiment seems to indicate that with the special choice C = B or A, the inequality 
persists. We therefore propose the following conjecture. We have found a proof for the case of the 
lowest dimension n = 2. It is proved by sheer brute force although the proof is by no means 
straightforward. 


CONJECTURE. Jf A > B > 0, then 


(3.1) (BA*B)'”* > B?, 
and 
(3.2) A? > ( AB?A)'””. 


Notice that the second inequality is a consequence of the first one. We may assume without 
loss of generality that B>O so that Bu! exists. By hypothesis, B~' > 4~! > 0. The first 
inequality then gives (4~'B~*A~')'”* > A~?. Taking inverses now gives (3.2). 

It is tempting to guess that the similar inequality 


(AB?4)'”” > B? 


also follows from the hypotheses of the conjecture. If this were true, then by transitivity (using 
(3.2) at least for n = 2), it would follow that A? > B*, which we know is false. 
Repeated use of (3.1) shows that 


(3.3) (B242B?)'”* > B’, 

and more generally 

(3.4) (B"-142Bm-ly'/* s Bm 

for m = 2*, where k = 1,2,... . We know that (3.4) is true at least in the two-dimensional case. It 


is only natural to ask if (3.4) is true for all m © (1, 00) and for all dimension n. 


4. Some evidence for the conjecture. Let us see how Conjecture 1 leads to another proposition. 
Let B be any positive Hermitian matrix and P be any nonnegative matrix. Define 


y(t) =| B(B+ Py Bl”. 
By Conjecture 1, Y(t) > Y(O) for ¢ > O. Thus 
(4.1) Y’(0) > 0. 


By differentiating the equation Y7(t) = B(B + ¢tP)*B and then letting t= 0, we see that 
X = Y’(0) satisfies the matrix equation 


(4.2) B?X + XB* = B’PB + BPB’. 


Notice that Q = BPB is also nonnegative. Thus the validity of the following is a consequence of 
that of Conjecture 1. 


THEOREM 5. Let B be a positive Hermitian matrix and O a nonnegative Hermitian matrix. The 
solution X of the following matrix equation is always nonnegative: 


(4.3) B?X + XB? = BO + OB. 


If the right-hand side is nonnegative, then, by Lemma 1, the conclusion holds. However, as 
noted before BO + QB is in general not nonnegative. 
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Proof of Theorem 5. Let Y be the solution of 
(4.4) B’Y + YB*=@Q. 


By Lemma 1 of Section 2, Y > 0. Thus it follows from Lemma 2 that BYB > 0. 


Direct computation shows that BY + YB satisfies equation (4.3). Hence by uniqueness 
(Lemma 1) 


X= BY+ YB. 
Now 


BX + XB = B(BY + YB) +(BY+ YB)B 


=~ Q+2BYB 
> 0. 
Thus by Lemma 1 again X > 0. 
COROLLARY 6. Let b,,...,5, ben positive numbers. The determinant 
bb ( b? + b? 
(4.5) jer li +H) 
(b + b*) Iy=l,...n 


is nonnegative. 


Proof. Apply Theorem 5 with B = diag(b,,...,b,) and O = B'”? RB'”? where all the entries 
of R are 1—direct computation shows that the solution of (4.3) is given by (4.5). 


It was originally planned to derive Theorem 5 as a consequence of Corollary 6 which can be 
proved directly in case n = 2 and 3. 
It is interesting to point out that the following generalization of Theorem 5 is still true. 


THEOREM 7. Let s be any real number in [1, 00) and B,Q be as in Theorem 5. Then the solution 
of the matrix equation 


(4.6) B’Z + ZB’ = BO + OB 
is nonnegative. 


Direct computation easily confirms the case n = 2. Lemma 1 can be thought of as the limiting 
case s > 00 (with B’ = C and B = C’’s = identity). Notice that the Theorem is false for s < 1. 
Theorem 7 and other generalizations will be given in a forthcoming paper. 

A most natural question to ask is: How can one characterize all functions f such that the 
solution of the matrix equation f(B)Z + Zf(B) = BO + OB is nonnegative? 

To conclude the paper we mention another tantalizing conjecture. 


PROBLEM. Let A, B and C be nonnegative Hermitian matrices such that A < C,B < C. Is it true 
that 


(A? + B?)'” <2? 


There are counterexamples to the more general conjecture: A < C,B< D_ implies 
(A? 4 B*y 7? < (C? 4 D?) 72, 
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MONOSTATIC SIMPLEXES* 


R. J. MacG. DAWSON 
Department of Mathematics, Corpus Christi College, Cambridge, England CB2 1RH 


It is a fact, learned by many of us at an early age, that certain objects are liable to fall over if 
stood the wrong way up. Flat-faced objects are usually considered to be safer, but it is easy to 
design a polyhedron that is unstable on some of its faces. Some years ago, it was shown that a 
polyhedron may be constructed which is monostatic (unstable on all but one of its faces) [1]. 

In the same article, it was stated that no tetrahedron can be monostatic, although the published 
proof was incomplete, making no explicit use of the position of the center of mass of a 
tetrahedron. (That this is essential may be seen from the construction, due to Conway, of a 
non-uniform tetrahedron with a center of mass very near one corner which is stable only on one 
face. James Tappin has suggested to the author the picturesque term vanka-vstanka for a 
non-uniform polytope stable on only one face, from the Russian word for a doll that rights itself 
when knocked over.) A complete proof, not published, was given by Conway [1], [2], who also 
asked various related questions. This paper presents a partial answer to one of these: in how few 
dimensions does a monostatic simplex exist? While I cannot give a sharp answer to this, it will be 
shown that no such simplex exists in six or fewer dimensions, and that there is an example in ten. 
Additional references and open problems are given in [4]. 


1. Nonexistence of Monostatic Simplexes in Low Dimensions. The nonexistence result is 
essentially a more quantitative version of the following 


THEOREM (Conway). No tetrahedron is monostatic. 


Proof. Obviously, a simplex cannot tip about an edge unless the dihedral angle at that edge is 
obtuse. As the altitude, and hence the height of the barycenter, is inversely proportional to the 
area of the base for any given tetrahedron, a tetrahedron can only tip from a smaller face to a 
larger one. 

Suppose some tetrahedron to be monostatic, and let A and B be the largest and second-largest 
faces respectively. Either the tetrahedron rolls from another face, C, onto B and thence onto A, or 


R. J. MacG. Dawson: I was born in 1961, in Halifax, Nova Scotia, into an academic. but definitely non-mathe- 
matical family. My B. Sc. (mathematics and physics), is from the University of King’s College, in Halifax, although 
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buy him or her a drink.) I am currently working in geometry under Dr. Hallard Croft, in the hope of eventually 
getting my Ph.D. My outside interests include cycling, pastel drawing, playing the recorder, and hiking; and I have 
recently bought a motorbike and am learning to nde it. 
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else it rolls from B to A and also from C to A. In either case, one of the two largest faces has two 
obtuse dihedral angles, and one of them is on an edge shared with the other of the two largest 
faces. 

The projection of the remaining face, D, onto the face with two obtuse dihedral angles must be 
as large as the sum of the projections of the other three faces. But this makes the area of D larger 
than that of the face we are projecting onto, contradicting our assumption that A and B are the 
two largest faces. 


Fic. 1. If the dihedral angles shown are obtuse, face A cannot be larger than face D. 


In more than three dimensions, it will be convenient to use a representation for the simplexes 
based on a theorem of Minkowski. A set of vectors in R” is said to be equilibrated if the following 
hold: 


(1a) yi x, = 0, 
(1b) {x,} spans R”, 
(1c) x, #ax, foranyi#j,a>0. 


For any face of a polytope, the face vector is the vector with length equal to the area of the face 
and directed normal to the face. 


THEOREM (Minkowski). Every equilibrated set of vectors in IR" is the set of face vectors for a 
unique convex polytope. 
The set of face vectors of any convex polytope is equilibrated. 


(For proof, see [5]; there is a discussion of this result in [3].) 


We will refer to the set of face vectors as the Minkowski representation of the polytope. In this 
representation, there is a particularly simple criterion to determine whether a simplex will tip 
between two given faces. 


PROJECTION CRITERION. Let, for each face F, of a simplex, x, be the face vector, and let 0,, be 
the angle between x; and x,. The simplex can tip from F, to F, if and only if 


(2) IIx, | < I| x, ||cos 6. 
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Proof. Let v, be the vertex opposite F,. The barycenter B is given vectorially by: 


_ 2M 


(3) Brel 


For any vector x, let p(x) be the projection of x into the plane (F, 0 F)~ . Note that p,(v,) is 
the altitude of F, from (F, N F); hence 
Ix, || _ llpiCy,) I 


(4) =: 
Ix lla) I 
Project again into the line (F|N F)~ OF; 


(5) p(B) = P20) Fal) 


n+1 


The simplex tips from F, to F, if and only if 
P2(B) < 0, 
= 1P1(¥, Il < Ilpi(¥) Ileos 6;, , 
(2) & ||x,|| < [|x,l|cos,,. 


LEMMA. Suppose F.,, F,_,,...,f are faces of a simplex which tips from F to F,_, for 
i=1,2,...,m. Then 


oT mt+1 
(6) |%nlleos %,, > —(cos[ —"—}] — Ixolh 
Proof. By the projection criterion (2), 
III 


<cos@_,; 
lx, nl 
SO 
Xnll — 
ml < TT] cos6,,_,, 
IXoll = jaa - 
x, lIcos 8 m 
(7) [1>mlIEOS om > min| 0, cos 6,, | | cos 9,;_1 |. 
lIXoll i=1 
By the triangle inequality, 
m 
» 6; 5 2 9om> 
i=l 
and hence 
m 
(8) (a — In) + » 9-1 ZT. 
i=1 


Minimizing the right hand side of (7) under the constraint (8), we see that the left member of (7) is 
greater than the minimum of —I1”,cosa; taken over sets of a; whose sum is 7, and this 
immediately gives (6). 


THEOREM. No simplex in R® is monostatic. 


Proof. Label the faces in decreasing order of size (i.e., with Fy the largest face and with K the 
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smallest). We consider two cases, depending on whether the simplex tips onto Fy from one face or 
from more than one face. 


Case 1. F, is the only face from which the simplex tips onto Fo. 


If we place the simplex on F,, i > 1, it reaches F, after at most (i — 1) falls. Thus, by the 
lemma, 


IIx, ||cos 4), ( 7) 
9 ————= > —cos'| — }. 
0) I Xa 
Using this and the other inequalities 


(10) 


X,|lcos 6 
IXolleos Bor 


3 


1P.Sul 
(11) || Xz||cos 6,. + ||x¢||cos A; 6 || X||cos 8,4 — ||X¢l| 
all 
IP.Sal Ix; || 
|x| 
IP.Sul 


> cos’6,, — cos86,, 
> 4; 
we can derive the following lower bound on the projection of the sum of the face vectors onto x,: 


6 
Ix,llcos,, 7 gm gm gt 
12 —_———— > — — cos’ > — cos"—= — cos’ = 
(12) 2X I XII 4 3 4 : 
= (77 — 5V5) /64 
> 0, 
contradicting our assumption that the vectors form an equilibrated set. 


Case 2. There are two or more penultimate faces. 


We deal similarly with this case, using the bounds 
|X; [Cos O, + || X¢lleos Ae 


(11’) OS 


1 
Ixol 4 for F, and F, 


|| x |leos 4, 


(13) Ixol 


and (6) for any other face, given that the simplex rolls from F to Fy in at most (i — 1) rolls. 
Thus: 


> 0 for any penultimate face other than F,, 


6 
I|X,||cos , _ 3 37 4% 57 
(14) X ixoll > | ~ C08’ 3 — cos" — cos’; 
= (13 — 575) /64 
>0. 


2. Existence of a Monostatic Simplex in R!°. Suppose a set of vectors in R™ to be equi- 
librated; if we embed the R” in a higher-dimensional R”, (1a) and (1c) will remain satisfied. If 
the set has more than n elements, we may also perturb it slightly so that it spans R” without 
losing the other properties. If there are m+ 1 vectors, the perturbed vectors must be the 
Minkowski representation of a simplex; and if, in the original set of vectors, for every x, but one 
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there exists an x, satisfying (2), then for small enough perturbations the resulting simplex will be 
monostatic, as (2) will continue to hold in each case. 
Let 6, i = 1,2,3,4,5, be defined as follows: 
_ 8la _ d4a _ 367 _ 24a _ lon 
2277 2 227? 3 227? 4 227” 8 227” 
and define, for any fixed ¢ > 0, {X9,x,,|i = 1,2,3,4,5}: 


O, 


(15a) arg(x,) = —arg(x_,) = ¥ 6 aralso) =0 


(15b) Px, l] = [x |] = | [1 cos6 — &, 
j= 


(15¢) Ixol= — Z is + Ix_,IDeos © “| 


It follows from (15c) that the vectors are equilibrated. From (15a) and (15b), we see that for 
i = 1, 2,3 or 4 and any small positive e the projection criterion (2) is satisfied for a fall from F ,, 
to F, or from F_;,,,) to F_,. It remains only to verify (2) for a fall from F,, to fo. As 
|X .1|| = cos @, — e, it is sufficient to show that, for small enough g«, ||xo|| > 1. In fact, if we 
calculate the right hand side of (15c) numerically as ¢ approaches zero in (15b), we obtain a value 
of 1.21953 + , and hence (2) is satisfied for small enough e in (15b). The relative sizes and angles 
of these vectors are shown in Fig. 2. (The 3:2 ratio between successive angles was found 
heuristically; it does not in fact give the set of 6, which maximizes ||x,||, but comes tantalizingly 
close! It would be interesting to know why this happens.) 


Xx} 


X94 


X3 


% [4 a 
WTS g < ws 
0, \8, x 


—4 


x_3 


X_y 


Fic. 2. Plane vectors approximating the face vectors of a ten-dimensional monostatic simplex. 
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THE ISOPERIMETRIC POINT AND THE POINT(S) 
OF EQUAL DETOUR IN A TRIANGLE 


G. R. VELDKAMP 
Biltsteijn 2, 3732 GT De Bilt, The Netherlands 


This paper takes its being from an asterisk occurring in Problem E 3020, proposed by Clark 
Kimberling, University of Evansville. It runs as follows: 


Suppose ABC is a nonisosceles triangle. Find three hyperbolas concurrent in a point P such 
that triangles APB, APC, and BPC all have the same perimeter. (*) How does this common 
perimeter compare with that of ABC? 


As usual in this Journal the asterisk indicates that at the time of publication neither the 
proposer nor the editors of the problem section were in possession of a solution. 

It is not difficult to describe unambiguously the three hyperbolas referred to in the problem, 
but one discovers at once that these curves are not very informative as to a full understanding of 
the problem. Several questions arising in this connection have been completely treated in the 
present paper. 

For readers not very well acquainted with some geometric terms occurring in the paper 
References [1] and [2] might come in useful. 


1. The problem. A finite point X in the plane of the triangle ABC will be called an 
isoperimetric point of this triangle if the triangles XBC, XCA and XAB all have the same 
perimeter 20. This paper deals with the following questions: 

Has any triangle an isoperimetric point? If not, find a necessary and sufficient condition for a 
triangle to have such a point (§§7, 8). Is this point related to well-known remarkable points of the 
triangle? How could one construct it by ruler and compasses? Does there exist an upper and lower 
bound for the ratio o/s, where 2s stands for the perimeter of AA BC? (§10). It will be shown that 
answers to these questions can be obtained by analytic as well as synthetic means. 


2. Notations and some frequently used formulas. The sideline of AA, A, A, opposite A, will be 
denoted by /,, the side (and its length) lying on /, by a, and the angle opposite a, by a,. 
Furthermore, J and r stand for the center and the radius respectively of the inscribed circle y; 
B, is the point of contact of y) with a,. The radius of the escribed circle of AA, A.A; having its 
center on A,J will be noted by r,. For briefness’ sake we shall use the abbreviations sin 5a, = 5,, 
cos}a, =c, and tanja, =t, (k =1,2,3). Finally, 2s, F and R denote in this order the 
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perimeter, the area and the radius of the circumcircle of AA, A,A,. The following formulas and 
the formulas obtained by cyclic permutation shall be used frequently: 


(2.1) SyC3 + $3Cy = Cy3CoC3 — S753 = 8159nCy + 8303 = C,( CnC, + 5453); 
(2.2) r= F/s;r, = F/(s — a,);R = a,a,a0,/4F. 
Furthermore: 

(2.3) S = 4Re,c,¢,;r = 4Rs,555337, = St, (k = 1,2,3). 


Using the suitable formulas from the collection, we find: 

s(t, +t, +t) = 4RaoG(t, +t, + t,) = 4R( 506, + 590,¢, + 53¢,c,) = 4R(1 + 5,555); 
hence: 
(2.4) nt+nHn+n=4R+r. 
We observe finally that 
(2.5) tot; +t + ht, =1. 
Except for §8, we assume throughout that AA, A, A, is nonisosceles. 

3. Three hyperbolas. Let P be a point in the plane of AA, A,A;. We put PA, = p, (k = 1,2,3). 
By definition P is an isoperimetric point of AA, A, A, if and only if 
(3.1) PP, + a + P3 = Py + 4. + py = py + 43 + py = 20. 
This is equivalent to 
(3.2) P3~ Pi =4,-— 4, and p, — py = a, — ay. 
Let Aj be the mirror image of A, into the perpendicular bisector of a, (k = 1, 2,3). The first of 
the equations (3.2) expresses the fact that P is a point of the branch passing through A% of the 
hyperbola h, going through A, and having A, and A, as its real foci. The second equation (3.2) 
states that P is on the branch through A* of the hyperbola h, going through A, with A, and A, 
as its real foci. It follows from (3.2) that p, — p, = a; — a,. This means that any common point 
of the aforementioned branches also lies on the branch going through A* of the hyperbola h, 
with A, and A, as its foci and passing through 4,. 


Our conclusion is: an isoperimetric point of a triangle is a common point of three well-defined 
branches of three uniquely determined hyperbolas. 


4. The equations of A, and h,. We shall use homogeneous triangular point coordinates 
X1,%X7,X3 With respect to the basic points A,(1,0,0), A,(0,1,0), A;(0,0,1) and the unit point 
7(1,1,1). 

Let X, be the orthogonal projection of X(x,,x.,x,) onto /, and d, the directed distance 
from X to /,; then 


d, = fx,(k = 1,2,3;f # 0) 
and 
2F = a,d, + a,d, + ad; = f( a,x, + a,x, + a;x3). 
Hence, 


2F 


(4.1) ~ A,X, + A,X + a3X3 


We have in the quadrangle A, X, XX, (Fig. 1) the relation: XA, sina, = X, X3 and therefore 


2. ——2 
XA jsin’a, = X,X3 = di + d} + 2d,d,cosa, = f?( x? + x3 + 2x,x,cos a,). 
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A, 
x; 
Xy 
A, 
A, xX, 
Fic. 1 
Consequently, 

oe 1/2 
(4.2) XA, sina, = fi x3 + xj + 2x,x3c08 a, ) ; 


The locus of the points with zero distance to A, is the pair of isotropic lines through A,. These 
lines are therefore represented by x3 + x} + 2x,x,cosa, = 0. Since A, is a focus of h,, the said 
lines are tangents of h,. If u,x, + u.x, + u,x3 = 0 represents the join of the points of contact 
(i.e., the directrix of h, corresponding to A,), the equation of h, takes the form 
(4.3) (uy, x, + Ux, + u,x,)° — A( x5 + x} + 2x,x,c08 a,) = 0. 
Because h, passes through A, and has the internal bisector x, — x, = 0 of 2A,A,A3 aS a 
tangent, we must have A = u3 and 
(4.4) u, — u,cosa, + u, = 0. 
The directrices of h, are perpendicular to the principal axis x, = 0, that is, parallel to the altitude 
x, COS a, — x,coSa, = 0 through 4,. Since the line at infinity is represented by 

x, sina, + x,sina, + x,sina, = 0, 


we must therefore have 


COS a, 0 — COS 3 
sing, sina, sina,| = 0, 
uy ) U3 
or 
(4.5) U,COS a, — u, + u,cosa, = 0. 


It follows from (4.4) and (4.5) that u,, u, and u, are proportional to sin’a,, cosa; — cosa, and 
COS a, COS a, — 1, respectively. From these facts results the following equation for h,: 


{ x, sin’a, + x,(cosa, — cosa,) + x3(cose, cosa; — 1)}" 
—(cos a, — cosa,)( x? + x? + 2x.x; cos a,) = 0. 


We know that A, is also a focus of h,. This permits us to obtain the equation of h, simply by 
interchanging the indices 2 and 3 in the equation for h,. This yields 
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{ x, sina, + x, (cos a,cos a, — 1) + x3(cosa, — cos o,)}° 
—(cos a, — cos a, )*( x3 + x3 + 2x,x,cos a,) = 0 


as the equation for h,. 


5. Pairs of lines in the three pencils determined by 4,,/, and h, taken two by two. Elimination 
of x3 + x$ + 2x,x,3cos a, between the equation of h, and h, yields 


2 ; 2 
(cos a, — cosa,)°{ x,sin’a, + x,(cosa; — cos a,) + x3;(cos.a,cosa, — 1)} 
(5.1) 2 : 2 2 
= (cosa, — cos a,)°{ x,sin’a, + x, (cos a,cos a, — 1) + x3(cos a, — cosa,)}. 
This equation obviously represents two straight lines constituting a degenerate conic of the pencil 
II,, determined by the pair {h,,h,}. An easy computation reveals that (5.1) is the equation of 


the pair of lines {/,,,}, where / is represented by 


(5.2) c?(c3 — ch) x + (3 — c?) x. + 2 (c? — 3) x, = 0, 
and n,, by 
(5.3) s?(c2 + c? — 2c?) x, + 83(c? — c2) x, + 83(c? — c2) x, = 0. 


The equation of / is invariant under cyclic permutation of the indices 1, 2,3, which cannot be said 
of equation (5.3). We infer that the pencils II,, and II,, determined by the pairs {h,,h,} and 
{h,,h,} contain the degenerate conics {/,n3,} and {/,n,,}, respectively, where n,, and n, are 
represented by 


(5.4) s?(c3 — ch) x + 93( ch + 7? — 2c3) xy + 53?(c3 — c?) x, =0 
and 

(5.5) s?(c? — c2) x, + s3(c} — c?) xy + 82(c? + cf — 22) x, = 0. 
This proves 


THEOREM 1. The hyperbolas h,,h, and h, have two points in common lying on 1. 


We observe that in the equation of / the cofactors of x,, x, and x; add up to zero and that the 
same is true for the cofactors of c7x,, c}x, and cjx;. The meaning of this is that the point 
7(1,1,1) and the point G(c3c3, c3c7,c?7c3) are on /. It is easily seen that G is the point of 
concurrence of the lines A, B,, A,B, and A,B,; it is generally known as the Gergonne point of 
A A,A,A,. Hence the 


COROLLARY. The line 1 of Theorem 1 is the join of the center of the inscribed circle to the 
Gergonne point of the triangle. 


In the equations (5.3), (5.4) and (5.5), the sum of the cofactors of s7x,, six, and s?x, 
vanishes. This means that n,,, 13, and n,, have the point N(s3s}, 52s}, 57s?) in common. It is 


the isotomic conjugate of G and generally known as Nagel’s point. 


6. The coordinates of the points common to h,, 4, and h,. To find these coordinates, we 
determine the intersections of GJ with h,. We shall use for our aim a somewhat modified 
equation of h, which can be obtained by the following argument. We observe that 


B, A, —_ B,A, = 5 — a, —(s —_ a») = ay —_ a, = A,A, —_ A,A,. 


Therefore B,(c3,c?,0) is on h; and the same is true for the isotomic conjugate Bj(s3, 57,0). 
Hence h, is a member of the pencil of conics passing through A,, B,, Bj and having x, — x, = 0 
as their tangent at the point A,. This pencil is represented by 


550 G. R. VELDKAMP [October 


(6.1) (cP x, ~ c5x_)( Spx, — $3x>) +Ax3(x%, — x2) = 0. 
Since JB, L A,A,, the line 7B, touches h, at B,. Hence A,B, is the polar line of J with respect 
to h,. The equation of A,B, is c?x, — c}x, = 0, whereas the polar line of 7(1,1,1) with respect 
to (6.1) is given by 
(c? — c2)(s?x, — 53x.) +(s? — s3)(e?x, — efx) +A( — x2) = 0. 

Hence 

(¢? — c3)(s2e5 — sic?) = A(¢? — ci) 
and X = cs — c?. The equation of h, is therefore 
(6.2) (c?x, — chx,)(s?x, — s3x,) +(c3 — c?) x3(%, — x2) = 0. 
By means of a parameter p any point on GJ may be represented by 

(cle? + py cde? + py cdc} + m). 
22,2 


Substituting these coordinates into (6.2) we find that this point is on A, if and only if p* = c?csc?. 
This proves 


THEOREM 2. The common points of h,, h, and h, are 


(6.3) O( cx¢3( cc; + cy), €3€,(¢3¢, + Cy), €1€4( C1, + c3)) 
and 
(6.4) P( 03 (C263 — cy), €3€,(¢3¢, — Cy) 5 C4 Cq (C1 C2 — c3)). 


With I = (C1 €9C3, €1C9C3, C1 CnC, we have in symbolic notation 
O=G+I/1,P=G-—I. 
Hence the 


COROLLARY. The two common points of the three hyperbolas are harmonic conjugates with respect 
to G and I. 


7. The position of P and Q on h,. To decide which of the points P and Q is on which of the 
branches of h,, we introduce the point Z, with coordinates 
( c3c3 + EC)C7C3, csc; + EC1C7C3, cecs + €C,C7C3 ) ; 


where e” = 1. Hence Z, coincides with P or Q according as e = —1 or e = 1. We put moreover 
QA, = q, and Z,A, = z, (k = 1,2, 3) and we proceed to compute z, by means of (4.1) and (4.2). 
To find the value of f. of f corresponding to Z, we have to evaluate 


a, (2c? + ec,¢)¢,) + ay( C3? + ec,,¢,) + ay (c2e} + ec, CC; ) 
= 4Re, 0503 (5,C)€3 + $€3C, + 53C,C,) + 2esc,cyC3. 
Using the formulas from §2, we find that this is equal to c,c,c,(4R + r+ 2s). Hence 


(7.1) f.= 


IF _ 32 R*s, 5554 
C1C,0,(4R+r+2es) 4Rt+rt+2es” 


It may be observed that our result shows that Q is always a finite point, whereas P is at infinity if 
and only if 4R + r = 2s. We suppose further on that 4R + r # 2s. By means of (4.2) we obtain 


2 2 
_ 2,2 2,2 
25,¢,2, =|f, I{ ( cei + @C,C,¢,) +(cPe3 + ec,cy¢;) 


+2(2¢? — 1) (3c? + €C,CC; )( c7c3 + eee) ) 
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This leaves us with 
(7.2) z= si Is; te, (c3 + cf + ecc9¢;), 
and by cyclic permutation 
(7.3) z= a Isylen(c} +c? + 2ecyeye,) and z, = a Is3 'e3(c? + cf + 2ecycx¢3). 
It follows that 

Z,-2,= sli Isp "sz 1{(sy¢, — $1€,)( c} + 2ec,c,¢3) + c4¢)(52¢, — 51%) }, 
or 

Z,—- 2 = sli Is; 85 '(85c, — 8,¢,)(1 + 8,858, + 2ec,C2¢;). 


Using (7.1) we arrive at 


4Rs3;(s5c, — 5,C))(4R + r+ 2es) 4R+r+ 2es 


(7.4) 0 4 > 22> aR +r + 2es = (42 — WAR T+ Des] 


Taking « = 1 we get: g, — q, = a, — a,. This means: Q is on the branch of h, passing through A,. 
If we take ¢ = —1 we are dealing with P and we have to distinguish two cases: 

i. 4R+r> 2s. Then p, — p» =a, —a,, which means that P is on the branch passing 
through 4,. 


ii. 2s > 4R + r. In this case P lies on the branch of h, passing through A¥. We therefore find 
as a final result of this paragraph: 


THEOREM 3. A triangle has an isoperimetric point if and only if 2s > 4R + r. 


It is known from elementary geometry that a triangle is acute-, right- or obtuse-angled 
according ass >2R+r,s=2R+rors<2R+ r. Hence the 


COROLLARY. A triangle without an isoperimetric point is always obtuse-angled. 


Fig. 2 gives an idea of the situation for a triangle with an isoperimetric point P. 


8. Transformation of the condition 2s > 4R + r. It will be clear that it is not possible to 
decide at first sight whether a given triangle fulfills the condition of Theorem 3. In this paragraph 
we replace this condition therefore by an equivalent one that is somewhat more transparent. 

We have 4R+r=r,+%5+n,=S(t, + t, + £,); the inequality of Theorem 3 is therefore 
equivalent to ¢,; + t, + t, < 2. Let us suppose a, > a, > a,,1.€., 4, > t, > t,. From (2.5) we get 


t,=(1-4t)/(t, +b) 
and therefore 
t+ti+h=(7+4h4+44+1)/(t, +h). 
The system of the preceding inequalities is therefore equivalent to 
(8.1) (,+t,-1)° -—4t <0OAH4+24t,-1<0At>4,>0. 


We consider (¢,,¢,) as a point T with rectangular coordinates ¢t, and ¢t,. The equation 
(x + y — 1)* — xy = O represents an ellipse ¢ touching the coordinate-axes at A(1,0) and B(0,1) 
(Fig. 3) and meeting the line x = y at C(1,1) and D(4,+). The equation y* + 2xy —1 <0 is 
that of a hyperbola h with asymptotes y = 0 and 2x + y = 0; it meets e at B and E(4/3,1/3) 
and the line x =y at F(V3 /3,V3 /3) and F’(— y3 /3, — V3 /3). The inequalities (8.1) are 
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geometrically equivalent to the assertion that F(¢,, t,) lies either in the interior of the curvilinear 
triangle DFB shown in Fig. 3 or on the line x = y between D and F. We introduce the pencil of 
conics given by xy + A(x + y) —1=0, where AX is a parameter. Its members are equilateral 
hyperbolas having only two real points in common, namely the points at infinity of the 
coordinate-axes. Only those parts of these hyperbolas are of interest for our problem, which are 
lying in the first quadrant. Clearly these parts meet the line x = y between D and F if and only if 
they go through a point F(¢,, ¢,) in the interior of the curvilinear triangle DFB. A member of the 
pencil on hand goes through F if and only if 


A=(1- 4b) +h) =b. 
If (u, uw) is its intersection with x = y, we have therefore 
where } < u < 1¥3. Since (1 — u?)/2u is monotonic decreasing on this interval, we infer 
173 <1, < 4; hence 7/3 < a; < 2 arctan 4/3. The first inequality is trivial because «, was 


assumed to be the largest angle of AA, A,A,. The second half of the inequality provides us with a 
rather practical condition for a triangle to have an isoperimetric point, namely 


THEOREM 4. A necessary and sufficient condition for a triangle to have an isoperimetric point is 
that its largest angle not exceed 2 arcsin 4/5 = 106°15’37”. 


It may be observed that the equilateral hyperbola h, of the aforementioned pencil passing 
through D corresponds to an isosceles triangle with 2 arcsin 4/5 as its vertical angle; it does not 
have an isoperimetric point. 


9. An expression for o in terms of R, r and s. If P is the isoperimetric point of AA, A,A;, we 
find by means of (7.2) and (7.3) that 


1 = 
Py, + Po + P3 = 5 IF 515253) { s583¢,( ¢% + C3 _ 2¢1€5¢3) + $3810 ( ¢3 + cy _ 2¢1C¢3) 


(9.1) + $1 55€3( ce + 05 — 2c,c2¢3) } , 
where 

2F 
(9.2) lf| = 


€,€,0;(2s — 4R—r)- 
Formula (9.1) can be simplified to 
1 1 
Pi + Py + P3 = 5 IF 515253) €1€703(1 + 45,5553 — 2¢,€C3). 


Using (9.2) we get, in view of F = rs = 4Rss,58553, 
(4R + 4r — 2s)s 


Py + Pr + P3 = 9s—-4R—pr 
From (3.1) we derive p, + p, + p3; + s = 30. Hence 
rs 
(9.3) rr oe 


10. Inequalities for the ratio o /s. It will be clear from the above that o/s has no finite upper 
bound. The inequalities 
Po + P3> 4,Pp3 +p, > a4, and p, + py > a; 


are valid for any triangle and any position of P. Hence p, + p, + p; > s. Combining this with 
30 = p, + Po + p3 + S, we find o/s > 2/3. This inequality can be improved by observing that 
the minimum value of o/s corresponds to the maximum of (2s —4R-—r)/r. Using the 
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inequality 
s —2R < (3¥3 —4)r, 
where equality holds if and only if the triangle is equilateral [3], [4], we get: 
2s-—4R-—r< (6y3 — 9)r. 
Hence: 


THEOREM 5. /f a triangle has an isoperimetric point, the inequality 
1 
(10.1) o/s > 5(3 + 2/3 ) 


holds with equality if and only if the triangle is equilateral. 


Fic. 4 


11. A construction for the isoperimetric point. We assume a, > a, and we suppose that 
AA, A.A, has an isoperimetric point P. It lies on GJ such that 


PA, — PA, = A,A, — A;A,. 


According to our assumption, this means that P is the center of a circle 6 passing through A, and 
having internal contact with the circle y with center A, and radius c ‘= A,A, — A,A,. The set of 
all circles with centers on GJ and passing through A, is obviously a pencil with the perpendicular 
n from A, onto GI as its radical axis (Fig. 4). Let e be a member of this pencil meeting y at S, 
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and S§,. The radical axis of e and y is S,S,; that of e and the circle 6 we are looking for is n, 
whereas that of 6 and y is a common tangent to these circles. Since these three radical axes are 
concurrent, this common tangent passes through the intersection S of n and S,S,. The points of 
contact C, and C, of y with its tangents through S coincide with the points of contact of y with 
the two members of the pencil which are tangent to y. At Fig. 4 the point C, corresponds to the 
member with external contact and A,C, meets G/ therefore at the point which in the preceding | 
section has been denoted by Q. The intersection of A,C, and G/ is the isoperimetric point P of 
the triangle. 


12. The geometric point of view. Let I, be a circle concentric with y, with radius exceeding r. 
The six intersections of T, with the side lines of AA,A,A, are denoted by C,, in such a way that 
the segments C,,C,3, Cy,C,, and C,,C,, are of the same length 2p and have B,, B, and B, as 
their midpoints, respectively, and that, moreover, C,,C,,||B,B, for any permutation of the indices 
(Fig. 5). 


FIG. 5 


Let C,,C,,C, and D,, D,, D, be in this order the intersections 
Cyan 1 Cy 3.Cg3 5, Co3@y3 1 Cy Cay, C31 Ca) OV CnC 
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and 
CC ny, () C3C3 » C33C32 () Cy Cy, C31 Cy3 () C32C)3. 


Then AC,C,C, and AD,D,D, are homothetic with AB,B,B,. The triangles D,C,,C,, and 
B, B, B, are homothetic too. Since C,, B, and C;,, B, intersect at A,, the line D, B, passes through 
A,. Moreover, AC,C,3;C;, is homothetic with AB, B, B,. The lines C,,B, and C,,B,; have A, in 
common; hence B,C, passes through A,. It follows from these facts that A, and B, are on C,D,. 
Likewise A, and B, are on C,D, and A, and B, on C,D,. Since AC,C,C, and AD, D, D, are 
homothetic, C, D,, C,D, and C,D, are concurrent. It follows that A,B,, A,B, and A,B, are 
concurrent. Their common point G is known as the Gergonne point of AA, A,A,. Obviously G is 
the center of homothecy for each of the 3 pairs of triangles taken from the triplet 
{A B, B, B,, AC;C,C,, AD, D, D;}. 

We observe that 4,C,, = A,C,, =|p — (s — a,)|. The circle with center A, passing through C,, 
and C,, will be denoted by y, (i = 1,2,3). The radical axes of the pairs {I’, y, } and {T, y, } meet 
at C,. Hence C, is on the radical axis of y, and y;, this line being therefore the perpendicular 
from C, onto /,. This means that the radical axis r,; of y, and y3; and the line /B, are 
corresponding lines under the homothecy between AC,C,C; and AB, B,B,. The same holds for 
r,, and JB,, r,, and JB,. The radical center of y,, y, and y; (i.e., the common point C of 13, 7, 
and r,,) therefore corresponds to J. So we have 


THEOREM 6. The radical center C of the triplet {y,,Y, 73} coincides with the center of the 
circumcircle T° of C,C,C, and lies on GI. 


If 6, is the circle with center A; passing through C, and C,,, we can prove in a similar way 


THEOREM 7. The radical center D of the triplet {8,,6,, 6} is the center of the circumcircle A of 
AD, D, D; and lies on GI. 


13. The powers of C and D with respect to the corresponding triplets. The dilatation factors to 
obtain the triangles A,C,,Cy,, C,,C,C,, and C,,C,,C, from AB, B, B, are in this order: 


—({s—a +S —a +S —a 
pr(s—a)_ op yy epts~a _ op yy Pt sma py 


AC,C,C; is therefore obtained from AB, B, B, by the factor 


+r, + 4R + 
Pp, PP, Py Pn tnt), _ PpARt) 
S—-a, S-a, SsS-aA; PF F 

The radius R,. of I’ is therefore 
4R + 
(13.1) p, = Pak +) 


5 


Using B, Bg = 2(s — a,)s,, we obtain furthermore 


(13.2) CiC, = GC, + CuiG -| Poe, 2) Be. __? 


and 


CC -( +1] B, B, = 2re(— 


a3 


The power of C; with respect to y, is therefore 


(13.3) C3At — pp = CC X GCy = ll fn 4 i). 
3 
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where p, stands for the radius of y,. 
Observing that 2C;,C,C = 4a, and applying the law of cosines in ACC, A,, we obtain for the 
power m, of C with respect to y,, in view of (13.2) and (13.3), 


m. = CA, — p? = R2+ GA, -2R,- CA, cosZA,C,C — p? 


= R?-2R,- CA, cosZA,CG,C + el or + 1 


cz \S — a 


- R-Fr + PS(—P— 41), 
Re} Rc? \ 5 — a3 


Using (13.1), we get m, = Ap? + 2Bp + C with 


4a AR Rr (AR r= 487 (ARP) 
s? Rc3(s — a3) s? 5 
It follows that 
(13.4) s°m. = {((4R+r4+2s)pt+r}{(4R+r—2s)pt+rs}. 


This result implies that there exists a value of p such that m, vanishes if and only if 2s > 4R + r, 
namely: 
_ rs 
"2s —4R—47 

If p =o, the circles y,,y, 7, have one point in common: the isoperimetric point P of the 
triangle. In this case we find from (13.1) for the dilatation factor from AB, B, B; into. AC,C,C, the 
value 2s/(2s — 4R — r). The radius of the circle I is therefore in this special case equal to 20 and 
its center coincides with P. 


and the 


4R+ 
The dilatation factor to transform AB,B,B, into AD, D,D, is PRY? —1 


radius R, of A is therefore given by 


p(4R + r) _ 
S 


(13.5) Rp = 


For the power of D with respect to the triplet {6,,6,, 6, } one has 
(13.6) s*my = {((4R+7r4+ 2s) p— rs} {(4R+r—2s)p—rs}. 


It follows from the above that 
ac = (PCRS) +1} Gi and GD - (PGR) -1\ G7. 


As a consequence, the centers of T and A lie symmetric with respect to I. 


14. Additional remarks. 
1. We have for the point Q: 
4, ~ 42 = 4, — a), 92-93 = 43;-— 4, and 4g, - q =a, — ay. 
This is equivalent to 
(14.1) 92+ 93-4 = 43+ 4 —4,= 4, + GQ. — a; 


The shortest way from a point A to a point B is along the straight line AB. If X is a point not 
between A and B, we make a detour of magnitude AX + XB — AB if we walk from A to B via 
X. The equalities (14.1) are obviously the expression for the following property of the point Q: the 
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detour OA, + QA, — A,A, in travelling from A, to A, via Q is the same for all pairs {i, j} taken 
from {1,2,3}. Therefore Q could be named a point of equal detour for AA, A,A,. Denoting this 
detour by 27, we find 


3T = + G7 + G3 — 5S. 
It is easily found (see §7 and §9) that 
(4R + 4r+2s)s 


n+ 2+ ~ “GRY Qs 
and therefore 
rs 
(14.2) "= GRart ds: 


It can be shown that 4R + r + 2s > (6V3 + 9)r and therefore 
t/S < 5 (213 — 3) 


with equality if and only if the triangle is equilateral. 
Contrary to the isoperimetric point, a point of equal detour exists for any triangle. 


2. We see from (13.6) that mp) = 01if p = 7. Therefore by taking p = 7 the triplet { 6,, 6,, 6, } 
has a point in common, namely, the point Q of equal detour. It is the center of the corresponding 
circle A with radius 27. Furthermore (13.6) shows that Q is the only point of equal detour if the 
triangle has an isoperimetric point, because in this case (4R + r — 2s)p — rs < 0 for all p > 0. 

If, however, no isoperimetric point exists, that is, if 2s < 4R + r or—what is the same—the 
largest angle exceeds 2 arcsin4/5, there are two points of equal detour. They are: Q and the point 
P* corresponding to the value 
eo IS 

— 4R+r—2s 
of p, this value now being apart from 7 a zero of the second member of (13.6). The corresponding 


triplet {5,,6,,6,} has P* as the common point, lying on the branch of h, passing through A, 
(k = 1,2,3). The radius of A is in this case 27*. 


T 
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154. MISCELLANEA 


We are so accustomed to treating continuous processes as limits of discrete processes that we 
may tend to forget that this was once a startlingly original idea, put forth as a hypothesis. It is 
called Boscovich’s Hypothesis. [1758] 


—Richard Bellman, Adaptive Control Processes: A Guided Tour, 
Princeton University Press, 1961, p. 80. 


One of the members 
(See p. 578.) 
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B 


This theorem gives some insight into the p-adic expansion. It follows easily from this theorem 
that the p-adic expansion for a is 


(Notice that no terms | enter into the coefficient of x”.) ™ 


ia a 
a= > { » }mod p} p”. 
n=0 P 
From this expression one can obtain formulas for the coefficients of the p-adic expansion for 
a + B in terms of the coefficients of a and B. If a=X%,a,p', B=X~ bp’, then a+ B 
ioe) 
= °c, p' where 
1=0 
n-1 n—-1 
a, + b, 
Cc, =a, + b, + y (5 y mod p. 
s=0 P i=s+l1 Pp 1 


The details for the proof of this formula are left to the interested reader. 


COROLLARY 1. [fn =a) + a,pt+-:: +a,p* is a natural number, then 
S fly _ 
(2”") = (arr {4 moa p. 
MN i=0 a, 


Proof. From the uniqueness of p-adic expansions and the geometric series for 1/(1 — p), it 
follows that 
1 p-li 2 
an y) (l+pt+p + ) 


1 
is the p-adic expansion of — 4. Evaluating | 2 by means of the theorem and combining it with 
n 


—4 — { _ n nal n | 
( . = (=1)"2-2"( 
we obtain the stated result. 


According to Euler’s Theorem, for an odd prime p, 


the formula 


2 L 
=) = 22?—Ymod p. 
(5) 


2 
Using the next corollary we can obtain the evaluation of [*). 


COROLLARY 2. For an odd prime p, 22(?~) = (—1)"~/8mod p. 
Proof. If p = 1mod4, let n = (p — 1)/4 and apply the previous corollary. If p = —1mod4, 
let n = (p + 1)/4 and apply the previous corollary. @ 


Reference 


1. N. Koblitz, P-adic Numbers, P-adic Analysis and Zeta-Functions. Springer-Verlag, 1977. 


ANSWER TO PHOTO ON PAGE 559 


The picture shows Grace Chisholm Young (1868-1944) who with her husband William Henry 
Young (1862-1943) produced fundamental research in general topology, measure theory, and 
functional analysis—and also six children, who include Professor L. C. Young of the University of 
Wisconsin. The picture was taken before her marriage, in fact probably in the mid 1890s when she 
took her doctorate at Géttingen, the first in the new program. 
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L. C. LARSON 
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The following results of the forty-fifth Wiliam Lowell Putnam Mathematical Competition, 
held on December 1, 1984, have been determined in accordance with the governing regulations. 
This annual contest is supported by the William Lowell Putnam Prize Fund for the Promotion of 
Scholarship, left by Mrs. Putnam in memory of her husband, and is held under the auspices of the 
Mathematical Association of America. 

The first prize, five thousand dollars, was awarded to both the Department of Mathematics of 
the University of California, Davis, California, and the Department of Mathematics of Washing- 
ton University, St. Louis, Missouri. The teams from these institutions tied. The members of the 
team from the University of California, Davis, were: John B. Boyland, Robert J. Filippini, and 
Michael P. Quinn. The members of the team from Washington University, St. Louis, were: 
William H. Paulsen, Richard A. Stong, Dougin A. Walker. Each member of these teams was 
awarded a prize of two hundred fifty dollars. 

The third prize, one thousand five hundred dollars, was awarded to the Department of 
Mathematics of Harvard University, Cambridge, Massachusetts. The members of its team were: 
Benji N. Fisher, Howard M. Pollack, and John M. Sullivan; each was awarded a prize of one 
hundred fifty dollars. 

The fourth prize, one thousand dollars, was awarded to the Department of Mathematics of 
Princeton University, Princeton, New Jersey. The members of the team were: Douglas R. 
Davidson, Gregg N. Patruno, and James C. Yeh; each was awarded a prize of one hundred 
dollars. 

The fifth prize, five hundred dollars, was awarded to the Department of Mathematics of Yale 
University, New Haven, Connecticut. The members of its team were: Thomas O. Andrews, 
Nathaniel E. Glasser, and Stephen E. Mark; each was awarded a prize of fifty dollars. 

The five highest-ranking individual contestants, in alphabetical order, were Noam D. Elkies, 
Columbia University; Benji N. Fisher, Harvard University; Daniel W. Johnson, Rose-Hulman 
Institute of Technology; Michael Reid, Harvard University; and Richard A. Stong, Washington 
University, St. Louis. Each of these students was designated a Putnam Fellow by the Mathemati- 
cal Association of America and awarded a prize of five hundred dollars by the Putnam Prize 
Fund. 

The nexts five highest ranking individuals, in alphabetical order, were Leland F. Brown, 
California Institute of Technology; Douglas R. Davidson, Princeton University; Zachary M. 
Franco, Harvard University; John M. Sullivan, Harvard University; and David S. Yuen, 
University of Chicago. Each of these students was awarded a prize of two hundred fifty dollars. 

The following teams, named in alphabetical order, received honorable mention: California 
Institute of Technology, with team members Everett W. Howe, Jung C. Im, and Eric H. 
Kawamoto; University of California, Berkeley, with team members Michael J. McGrath, Jonathan 
E. Shapiro, and Christopher S. Welty; University of Chicago, with team members Eric K. Lossin, 
Keith A. Ramsay, and David S. Yuen; Colorado State University, with team members Curtis D. 
Bennett, Jorg A. Brown, and Mark S. Vincent; and Rose-Hulman Institute of Technology, with 
team members Todd G. Fine, Erich J. Friedman, and Daniel W. Johnson. 

Honorable mention was achieved by the following thirty-five individuals, named in alphabeti- 
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cal order: Michael A. Abramson, Princeton University; Wayne E. Aitken, Brigham Young 
University; Thomas O. Andrews, Yale University; Mark E. Banilower, University of Penn- 
sylvania; Arthur B. Baragar, University of Alberta; William J. Bruno, Massachusetts Institute of 
Technology; Francois Destrempes, Université de Montreal; Art Duval, California Institute of 
Technology; Duane J. Einfeld, Dordt College; Robert J. Filippini, University of California, 
Davis; Leonid Fridman, Harvard University; Nathaniel E. Glasser, Yale University; Michelangelo 
Grigni, Duke University; George E. Homsy, University of California, Berkeley; Liaw Huang, 
Rutgers University, New Brunswick; Douglas S. Jungreis, Harvard University; Joe J. Kilian, 
Massachusetts Institute of Technology; Tsz Mei Ko, The Cooper Union; Rama R. Kocherlakota, 
Princeton University; Chun-Nip Lee, Massachusetts Institute of Technology; Tak Kwan Lee, 
University of California, San Diego; Daniel E. Loeb, California Institute of Technology; Michael 
J. McGrath, University of California, Berkeley; Mark E. Meyer, Indiana University; Lee A. 
Newberg, Massachusetts Institute of Technology; Jeremy D. Primer, Princeton University; 
Michael P. Quinn, University of California, Davis; Keith A. Ramsay, University of Chicago; 
Daniel N. Ropp, Washington University, St. Louis; Alistair M. Rucklidge, University of Toronto; 
David S. Salopek, University of Alberta; Kenneth W. Shirriff, University of Waterloo; Charles 
S. A. Timar, University of Waterloo; Luis G. Valdez-Sanchez, University of Texas, El Paso; and 
Dougin A. Walker, Washington University, St. Louis. 

The other individuals who achieved ranks among the top 102, in alphabetical order of their 
schools, were: Brigham Young University, Christopher P. Grant; California Institute of Technol- 
ogy, Eric K. Babson, William D. Banks, Kent J. Cantwell, Karen L. Condie, William D. Cutrell, 
Jung C. Im, James T. Liu, Julian West, Tad P. White; University of California, Davis, John B. 
Boyland; University of California, Santa Barbara, Emerson S. Fang; Calvin College, Randy V. 
Gritter; Carnegie-Mellon University, Steven G. DesJardins, Jamshid Mahdavi; University of 
Chicago, Geoffrey R. Harris, Rainer Hollerbach, Thomas R. Lippincott, Susan Tolman; Colorado 
State University, Curtis D. Bennett, Jorg A. Brown; Concordia University (G. S. W.), Chinh Mai; 
Cornell University, Denise E. Freed; Harvard University, Glenn D. Ellison; Haverford College, 
Kian-Tat Lim; University of Illinois, Urbana, Mark A. Thompson; University of Kansas, Glenn G. 
Chappell, Ryan D. Moats; Université Laval, Mario M. B. Bergeron; University of Louisville, 
Dung T. Nguyen; Massachusetts Institute of Technology, Avrim L. Blum; Michigan State 
University, Frank Sottile; University of Michigan, Ann Arbor, Steve Newman; University of 
Minnesota, Minneapolis, Jay A. Jorgenson; University of Missouri, Rolla, Ervan E. Darnell; 
University of Nebraska, Lincoln, Bartley E. Goddard; North Dakota State University, Jim B. 
Becker; University of Notre Dame, James R. Roche; Princeton University, Gregg N. Patruno, 
James C. Yeh; Rice University, Garrett T. Biehle; Rutgers University, New Brunswick, Scott E. 
Axelrod; Simon Fraser University, Stuart G. Cowan; Swarthmore College, John H. Palmieri; 
University of Toronto, Gary F. Baumgartner, William J. Rucklidge; University of Victoria, Philip 
H. Spencer; University of Washington, Charles N. Curtis; Washington University, St. Louis, 
William H. Paulsen; University of Waterloo, Yong Yao Du, Alexander T. Kachura; Wichita State 
University, Paul D. Sinclair; Williams College, Martin V. Hildebrand; University of Wisconsin, 
Eau Claire, Kenneth J. Dykema; University of Wisconsin, Milwaukee, Mark W. Hopkins; and 
Yale University, Ramzi R. Khuri, David R. Steinsaltz. 

There were 2149 individual contestants from 350 colleges and universities in Canada and the 
United States in the competition of December 1, 1984. Teams were entered by 264 institutions. 

The Questions Committee for the forty-fifth competition consisted of Melvin Hochster 
(Chairman), Bruce Reznick, and Richard P. Stanley; they composed the problems listed below 
and were most prominent among those suggesting solutions. 


PROBLEMS 
Problem A-1 


Let A be a solid a X b X c rectangular brick in three dimensions, where a,b,c > 0. Let B be the set of all 
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points which are a distance at most one from some point of A (in particular, B contains A). Express the volume of 
B as a polynomial in a, b, and c. 


Problem A-2 
Express D%_, (6% /(3* +1 — 2**1)(3* — 2*)) as a rational number. 
Problem A-3 


Let n be a positive integer. Let a,b, x be real numbers, with a # b, and let M, denote the 2n X 2n matrix 


whose (/, j) entry m,, is given by 
x fis, 
m,={a ifi#jy and i +7 is even, 
b ifi# jy and i+) is odd. 
x b a b 
Thus, for example, M, = ° ; ° A . Express lim, _, ,det M,,/(x — a)?"~? as a polynomial in a, b, and n, 
b a b x 
where det M, denotes the determinant of M, 


Problem A-4 


A convex pentagon P = ABCDE, with vertices labeled consecutively, is inscribed in a circle of radius 1. Find the 
maximum area of P subject to the condition that the chords AC and BD be perpendicular. 


Problem A-5 


Let R be the region consisting of all triples (x, y, z) of nonnegative real numbers satisfying x + y + z < 1. Let 
w= 1-—x-—y-— z. Express the value of the triple integral 


If x y?z8 w4 dx dy dz 
R 


in the form a!tb!c!d!/n!, where a, b, c, d, and n are positive integers. 


Problem A-6. Let 1 be a positive integer, and let f(”) denote the last nonzero digit in the decimal expansion of n!. 
For instance, f(5) = 2. 

(a) Show that if a,,a5,...,a, are distinct nonnegative integers, then f(5“ + 542 + --- +5) depends only on 
the sum ad, + a, +++: +a,. 

(b) Assuming part (a), we can define 


g(s) =f(5% + 5% 4+ +++ +5%), 
where s = a, + a, + +++ +a,. Find the least positive integer p for which 
g(s)=g(s+p), foralls >1, 


or else show that no such p exists. 


Problem B-1 
Let n bea positive integer, and define 
f(n) =1!4 224+ +--+ +nt. 
Find polynomials P(x) and Q(x) such that 
f(n+ 2) = P(n)f(n+1)+ Q(n)f(n), 
for all n > 1. 


Problem B-2. Find the minimum value of 


for0<u<y2 andu>0O. 
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Problem B-3 


Prove or disprove the following statement: If F is a finite set with two or more elements, then there exists a 
binary operation * on F such that for all x, y,z in F, 


(i) x*z=y*z implies x = y (right cancellation holds), 


and 
(ii) x *(y * Zz) # (x * y)*z (no case of associativity holds). 


Problem B-4 


Find, with proof, all real-valued functions y = g(x) defined and continuous on [0, 00), positive on (0, 00), such 
that for all x > 0 the y-coordinate of the centroid of the region 


R, = {(s,t)|O<s<x, O0<t<g(s)} 
is the same as the average value of g on [0, x]. 
Problem B-5 


For each nonnegative integer k, let d(k) denote the number of 1’s in the binary expansion of k (for example, 
d(O) = 0 and d(5) = 2). Let m be a positive integer. Express 


27" —1 
y(n en 
k=0 


in the form (—1)”a/(™(g(m))!, where a is an integer and f and g are polynomials. 
Problem B-6 


A sequence of convex polygons { P,,}, n > 0, is defined inductively as follows. Py is an equilateral triangle with 
sides of length 1. Once P,, has been determined, its sides are trisected; the vertices of P,,, are the interior trisection 
points of the sides of P,. Thus, P,,, is obtained by cutting corners off P,, and P,, has 3 - 2” sides. (P, is a regular 
hexagon with sides of length 1/3.) 

Express lim, _,,, Area(P,,) in the form va /b, where a and 5 are positive integers. 


SOLUTIONS 


In the 12-tuples (11), #9,..., 9, N_,) following each problem number below, n,; for 10 > i > 0 
is the number of students among the top 205 contestants achieving i points for the problem and 
n_, is the number of those not submitting solutions. 


A-1. (146, 3, 33, 0, 0, 0, 0, 0, 21, 1, 1, 0) 
The set B can be partitioned into the following sets: 


(i) A itself, of volume abc; 

(11) two a X b X 1 bricks, two a X c X 1 bricks, and two b X ¢ X 1 bricks, of total volume 
2ab + 2ac + 2be; 

(ii1) fowr quarter-cylinders of length a and radius 1, four quarter-cylinders of length b and 
radius 1, and four quarter-cylinders of length c and radius 1, of total volume (a + b+ c)a; 


(iv) eight spherical sectors, each consisting of one-eighth of a sphere of radius 1, of total volume 
4a /3. 


Hence the volume of B is 


abe + 2(ab + ac + be) + (a+ b+0) +. 


A-2. (73, 3, 12, 1, 0, 11, 3, 17, 2, 1, 16, 66) 


Let S(n) denote the nth partial sum of the given series. Then 
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n 3k Bk +1 ntl 


S(n) = 3k — 9k 7 gk — gktl =3- gut — gnti? 


and the series converges to lim, _, ,,S(”) = 2. 
A-3. (38, 1, 7, 0, 0, 0, 0, 1, 7, 0, 55, 96) 

Let N = M,],_,,. N has rank 2, so that 0 is an eigenvalue of multiplicity 2m — 2. Let e denote 
the 2n X 1 column vector of 1’s. Notice that Ne = n(a + b)e, and therefore n(a + b) is an 
eigenvalue. The trace of N is 2na, and therefore the remaining eigenvalue is 2na — n(a + b) = 
n(a — b). [Note: This corresponds to the eigenvector f, where f,, = (—1)'*', i= 1,...,271] 

The preceding analysis implies that the characteristic equation of N is 

det(N — AIT) = ""7(A — n(a+ b))(A — n(a— b)). 
Let A = a — x. Then 
det M, = det(N —(a — x)I) =(a— x)" “(a—x-—n(at b))(a—x-—n(a-—b)). 
It follows that 


det M,, 
yon? 


lim 
x>a(x—-—a 
A-4. (7, 3, 4, 3, 0, 0, 0, 6, 3, 4, 82, 93) 
Let 6=ArcAB, a=ArcDE, and B=ArcEA. Then ArcCD=a—-—8@ and Arc BC = 
a—a-— B. 


The area of P, in terms of the five triangles from the center of the circle is 


= lim (a —x-—n(a+t+b))\(a-x- n(a— b)) = n*(a* — B*). 


1, 1, 1. 1. 1. 
3 sind + 5 sin( 7 — 8) +3 sina + 5 sin B + 5 sin( 7 — a — B). 


This is maximized when # = 7/2 and a = B = 7/3. Thus, the maximum area is 


1 1 1 y3 1 y3 1 y3 3 
Pita l+g ata a tag Tht gs. 
A-5, (52, 4, 4, 1, 0, 1, 0, 0, 5, 3, 39, 96) 


N | 


For t > 0, let R, be the region consisting of all triples (x, y,z) of nonnegative real numbers 
satisfying x +y+z<t. Let 


I(t) = [ff y(t —~x—y-—z)° dxdydz 
R, 


and make the change of variables x = tu, y = tv, z = tw. We see that I(t) = I(1)t”’. 
Let J = fo°l(t)e ‘dt. Then 


J = fo 1a) Pe~' dt = 1(1) F(26) = 1(1)25!. 
0 
It is also the case that 
J= * of x4 y978(¢ — x — y — 2) dx dydzdt. 
JA yr" y—z) dxdy 
Let s=t—x-—y-—z. Then 


J =f ff fo tee ren exty 284 dx dy dz ds = T(2) T(10) (9) (5) = 11918141. 
0 “0 “0 “0 


The integral we desire is [(1) = J/25! = 1!9!8!41/25!. 
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A-6. (3, 1, 0, 1, 1, 0, 0, 1, 2, 0, 23, 173) 
(a) All congruences are modulo 10. 
LEMMA. f(5n) = 2”f(n). 
Proof. We have 
"1 (5i + 1)(5i + 2)(5i + 3)(5i + 4) 


( *) (Sn)! =10"n! [ | 
1=0 2 

If i is even, then 

1 1 

3 (Si + I)(Si + 2)(5i + 3)(5i + 4) = 5 (1 2-3-4) = 2, 
and if i is odd, then 

1 

5 (5i + 1)(5i + 2)(Si + 3)(S/ + 4) =5(6-7-8-9) =2. 
Thus the entire product above is congruent to 2”. From (*) it is clear that the largest power of 10 
dividing (5n)! is the same as the largest power of 10 dividing 10””!, and the proof follows. 

We now show by induction on 5” + --- +5 that 
f(5% 4fo.e. + 54) = to Fag 


(which depends only on a, + --: +a, as desired). 
This is true for 5% + --- +5¢* = 1, since f(5°) = 2° = 1. 


CASE 1. All a, > 0. By the lemma and induction, 
f(5% foes 4 54) = qa h + vee +5 (5-1 fou. 4+ 54-1) 


= 2k. gm -Dt- (4D (since 2° = 2 for i > 0) 


= 24 to tag 


CASE 2. Some a, = 0, say a, = 0. Now 
(1+ 5m)! =(1 + 5m)(5m)!, 


so f(1 + 5m) = (1 + 5m)f(5m). But f(5m) is even for m > 1 since (5m)! is divisible by a higher 
power of 2 than of 5. But 


(1 + 5m) -(2/) = 2), 

so f(1 + 5m) = f(5m). Letting m = 54271 + --- +571, the proof follows by induction. 

(b) The least p > 1 for which 2°*? = 2? for all s > 1 is p = 4. 
B-1. (179, 9, 6, 0, 0, 0, 0, 1, 0, 0, 4, 6) 

We have 

f(nt+ 2) —f(n4+1) =(n4 2)! =(n4 2)(n 4 1)! =(n 4 2)[ f(n 4+ 1) —f(n)]. 

It follows that we can take P(x) = x + 3 and O(x) = —x — 2. 
B-2. (62, 13, 19, 6, 0, 0, 0, 0, 15, 15, 34, 41) 


The problem asks for the minimum distance between the quarter of the circle x? + y? = 2 in 
the open first quadrant and the half of the hyperbola xy = 9 in that quadrant. Since the tangents 
to the respective curves at (1,1) and (3,3) separate the curves and are both perpendicular to 
x = y, the minimum distance is 8. 
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B-3. (93, 16, 1, 1, 0, 0, 0, 1, 2, 5, 33, 53) 
The statement is true. Let ¢ be any byection on F with no fixed points, and set x * y = $(x). 
B-4. (2, 11, 2, 8, 0, 1, 0, 25, 4, 7, 48, 97) 


Such a function must satisfy 


[se dt Loe 
2 ely at 
falda *% 
0 
or equivalently, 
x] 5 _i1 x 2 
i 58 (1) ae =] fa) a , 
Let z(x) = fig(t) dt. Then z’(x) = g(x) and we have 
x] 2 _ z° 
i 5(2) d=—, x>0. 


Differentiating, we have 


9) 2 ? xX > 0, 
x 

x*(z’)” — 4xzz’ + 222 =0, x > 0, 

(xz’ — r,z)( xz’ — 4z) = 0, x > 0, 


where r, = 2 + V2 and r, =2 - y2. 
Now x, z’, and z are continuous and z > 0, so the last equation implies | 
r=r, or r =1r,. Separating variables, we have z’/z = r/x and it follows tl 


Inz=rilnx+ G, 
or equivalently, z = C,x’, C, > 0. Differentiating, we have z’ = g(x) = Cx’"/, C > 0. But g is 
continuous on [0, 00) and therefore we cannot have r = r, (because r, — 1 = 1 — V2 < 0). Thus 
g(x) = Cxtt V2 C> 0, 
and one can check that such g(x) do satisfy all the conditions of the problem. 
B-5. (5, 1, 2, 0, 0, 3, 0, 4, 9, 11, 17, 153) 


Define 


D(x) =(1 —x)(1 — x?)(1 — x4) --- (1 — x?” ”). 
Since binary expansions are unique, each monomial x* (0 < k < 2” — 1) appears exactly once in 
the expansion of D(x), with coefficient (— 1)“. That is, 


2"—-1 


D(x) = Y (-1)4 x. 


k=0 


Applying the operator [x d 


4 to D(x) m times, we obtain 


on 


(x=) PO) = TF (1) mek, 


k=0 
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so that 
(x£)" D()| y(n en. 
* x=1  K=0 
Define F(x) = D(x + 1), so that 


(2) oo] [een] 


x= 


But 
F(x) = Th -G+0" | = TI [-2x + 0()], (x > 0), 


= (—1)"2(m—D/2 ym 4 O(x™*}), 
and by observing that [(x + 1)d/dx]x” = nx” + nx"—', we see that 


IC» + "(ax + O(x™*")) =m!A + O(x). 


So 


x= 


(x + 4)" r9| ; =(—1)"2""—-D2m! + O(x)] 9 =(-D 72" PA m!. 


B-6. (7, 3, 3, 0, 0, 0, 0, 0, 6, 1, 48, 137) 


Suppose that # and v are consecutive edges in P,. Then u/3, (u+v)/3, and v/3 are 
consecutive edges in P,,,. Further, 


Lym By _ 

2 || 3 3 18 
is removed at this corner in making P,,,. But at the next step, the amount from these three 
consecutive edges is 


lu X d|| 


1 ut+vD 1l|at+ié ¥ o 

2 9 2\| 9 9 

Thus, the amount removed in the (k + 1)st snip is 2/9 times the amount removed in the kth. 
Note that one-third of the original area is removed at the first step. Thus, the amount removed 


altogether is 


1 


u _ti uXovD 


s[l +(2/9) +(2/9) +--+ | = > . = = 


of the original area. Since the original area is 73 /4, we have 


4 3 ¥3 


jim Area F, = a 

The curve in this problem has been studied extensively by Georges de Rham. (See “Un peu de 
mathématiques a propos d’une courbe plane,” Elem. Math., 2 (1947), 73-76, 89-97; “Sur une 
courbe plane,” J. Math. Pures Appl., 35 (1956), 25-42; and “Sur les courbes limites de polygones 
obtenus par trisection,” Enseign. Math., 5 (1959), 29-43.) Among de Rham’s results are the 
following. The limiting curve is C’ with zero curvature almost everywhere, but every subarc 
contains points where the curvature is infinite. Consequently, the curve is nowhere analytic. De 
Rham parametrizes pieces of the curve so that the tangent vector is intimately related to the 
Minkowski ?-function. If the construction is repeated, but with each edge divided in the ratio 
(1/4, 1/2, 1/4) rather than (1/3, 1/3, 1/3), then the resulting limit curve is analytic, consisting 
of piecewise parabolic arcs. 
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HOW DOES A COMPLETE GRAPH SPLIT INTO BIPARTITE GRAPHS AND HOW ARE 
NEIGHBORLY CUBES ARRANGED? 


JOSEPH ZAKS 
Department of Mathematics, University of Haifa, Haifa 31999, Israel 


The complete graph K, has n vertices and every pair of vertices is joined by an edge. The 
complete bipartite graph K,, ,, has n vertices of one type and m vertices of another type, and it 
has mn edges, joining every vertex of one type to every vertex of the other type. 

We are interested in splitting the edge-set of K,, into parts, so that each part will consist of the 
full edge-set of some complete bipartite subgraph of K,,. The equation 


(1) K,, = ))x,,K,., 


will mean that K,, is decomposed into x,, copies of complete bipartite subgraphs K, ,, 
i,j >1,i+j <n and x,, are nonnegative integers. 

A theorem of Graham & Pollak [1] (for shorter proofs, see [5] and [3]) states that K, can be 
decomposed into not fewer than n — 1 complete bipartite subgraphs; this theorem was extended 
recently by Reznick, Tiwari & West [4]. We restate this result. 


where 


THEOREM 1. K, = Lx,,K, , implies that 
(2) Lx, >n-1. 
By counting the total number of edges, we easily derive (see also [6]) 


THEOREM 2. K,, = Lx,,K,, implies that 


(3) Lax, = (5). 


We raise the following open question. 


PROBLEM. Given n and a set of nonnegative integers {x, |i, 7 >1, i+ j <n}, satisfying the 
equation (3) and the inequality (2), when is there a decomposition of K,, of the form (1)? 


The conditions (2) and (3) on n and the x,, are far from being sufficient: If K,,, ,, is imbedded 
in K,,,, the remaining graph is the disjoint union of two copies of K,,; therefore K,,, has no 


decomposition of the form K,, ,, + K,, + +++, whereat+b>m+1. 
A more interesting example is the following. 


CLAIM. Ky, # 6K, + 2K, (remark that the right-hand side has at least as many components 
as required and as many edges as needed.). 


Proof. All the vertices of K, and of K,, have even valence (number of edges having that 
vertex as an end-vertex); if Ky = 6K,, + 2K, , then the pairs of vertices of the two copies of 
K, 4, which are the only vertices of odd valence in the decomposition, must coincide; therefore 
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2K, is actually equal toa K,,. Hence K, = 6K,, + K,., which contradicts Theorem 1. 
More generally, 
Ky6m+9 F (16m + 6) Ky gma. + 2Ko.6m43 forall m > 0. 
Let us consider next the d-cube 
Q= {(x,...,x,)}-l<x,<1 foralli,l<i<d} 


in the Euclidean d-space E%. A translation of the dcube Q is a set of the form v + Q = 
{u+yly © Q}. Let F= {Q,,Q),...} be a collection of translations of the d-cube Q; F is 
called neighborly (see [6] and the many references there) if every two members of F intersect in a 
(d — 1)-dimensional set; 1.e., if every two members of F are non-overlapping, yet they share a 
(d — 1)-dimensional part of a facet of each. | 

There is a connexion between the decompositions of K,, into complete bipartite graphs and 
neighborly families of translations of the d-cube in E“%. 


THEOREM 3. There is a 1-1 correspondence between the decompositions of K,, into d complete 
bipartite graphs and the combinatorial type of neighborly families of n translations of the d-cube in 
E* 


Proof. If QO, = (y1,---,¥7) + Q and Q, = (2,...,2Z7) + Q are two translations of Q which 
are neighborly, then Q, M Q, is a (d — 1)-dimensional set which is convex; therefore Q, N Q, 
lies in a hyperplane which is parallel to one of the major hyperplanes in E“, given by the equation 
x, = 0, for some i,1 <i < d; the following then holds 


(4) Iz, -y,=2, and |z,-y|<2 forall j#i, l<j<d. 


In fact, condition (4) for some i is a necessary and sufficient condition for two translations 
(Vi5-++5¥y) + Q and (2,...,27) + Q to be neighborly. 

Suppose F = {Q,,...,Q,,} is a neighborly family of translations of the d-cube Q; Q, = 
(Vais-++s¥a) + Q for all i,1 <i<n. Consider the n X d matrix Y =(y,,), formed by the 
centres of the cubes. By the neighborliness of F and condition (4) we get 


(5) For every two rows i and k of Y, there is a unique column ; of Y such that 
IY, — Yel = 2 and |y,, — Yee] < 2 for all t#j,1<t<d. 


Construct a graph G, which might look as if it has multiple edges, as follows: G has the 
numbers 1,2,...,” for its n vertices, in 1-1 correspondence with the rows of Y; two vertices i and 
k of G are joined by an edge whenever there is a column j in Y for which |y,, — y,,| = 2. It 
follows from (5) that there are no multiple edges in G (because of the uniqueness of /), and in fact 
G is the complete graph K,, (because every i and k are connected by an edge). 

Consider an arbitrary column of Y, say the jth column, and the edges which it alone 
contributes to K,,. If max, ,|y,, — y,,| < 2, then the jth column does not contribute any edges, 
and the corresponding complete bipartite subgraph is taken to be empty. If 


max, 4|¥,; a Vx, | = 2, 
we let 
A,={tly,, = min,y,} and B = { tly, = max,y,}. 


,» one edge for every pair of indices (a, b) where 
a © A, and b € B;; in fact, all of these edges form a complete bipartite subgraph of K,,, which we 
denote by K A,B," 
It follows that K,, = 421K 4B is the decomposition of K,, which is associated with F. 
Suppose, on the other hand, that a decomposition of K,, into d complete bipartite subgraphs is 
given; to simplify the notation, let the vertex set of K,, be {1,2,...,n} and let K, = L7_,K A,B, 


The jth column contributes |A,| -|B;| edges to K 


n°? 
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where 4,,B Cc {1,2,...,n} and A, B = ©. Form the n X d matrix Z = (z,;) by defining 
0 ifie€4, 

Zi; = 2 fie B, 
1 iff €A,UB. 


The corresponding neighborly family of n translations of the d-cube Q in E% is obtained by 
using the rows of Z for centres; i.e., the ith cube in the family is given by (z,,, Z,.,.--,Z,¢) + Q; 
1 <i <n. The neighborliness follows easily from (4). 

This completes the proof of Theorem 3. 
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To illustrate the correspondence mentioned in Theorem 3, we have in Fig. 1 the four 
translations of a 3-cube with centres (0,0, 0), (0,2, 0), (0,1,2) and (2,1,1) in a neighborly family, 
related to the decomposition K, = K,, + K,,+ K,3; Fig. 2 shows it for K, = K,, + K,.+ 
K, 5, with centres (1, 0, 0), (1, 2, 0), (0, 1, 2) and (2, 1, 2). 

It follows that a complete characterization of the decompositions of K, into d complete 
bipartite subgraphs is equivalent to a complete characterization of the combinatorial types of 
neighborly families of n translations of the d-cube in E%. 

J. Kasem [2], a student of M. Perles, has recently been successful in finding small collections of 
neighborly translations of the d-cube in E“, which are maximal in the sense that they cannot be 
enlarged; he did it by treating some specific types of decompositions of K, into complete 
bipartite subgraphs. 

We close with the following: 


COROLLARY. The maximum number of neighborly translations of the d-cube in E% is d + 1, for 
all d. 


Proof. By Theorems 1 and 3, if there is a neighborly family of translations of the d-cube in 
E*%, then d > n — 1; hence n < d+ 1; thus at most d + 1 translations of the d-cube in E% can be 
neighborly. 


The construction of a neighborly family of d + 1 translations of the d-cube in E% can be easily 
done inductively, or via the decomposition of K,,, into any d complete bipartite graphs, for 
example K,,, = L%,K, ,. 
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SOLUTION OF POLYNOMIALS BY REAL RADICALS* 


J. M. Isaacs 
Department of Mathematics, University of Wisconsin, Madison, WI 53706 


Let F be a field and let f € F[x] be a polynomial. Recall that f is said to be solvable by 
radicals if f splits over some extension field EF D F for which there exists a chain of fields 


F=KhCKC:::CKR=E 


such that F = F_,[a,] for1 <i < k, where a, has some power lying in F\_,. Such an extension 
field F is called a repeated radical extension of F. 

By the work of Galois, we know that (in characteristic zero) every polynomial of degree < 4 is 
solvable by radicals, and that over suitable fields (the rational numbers Q, for instance), there 
exist, for every integer n > 5, some irreducible polynomials which are solvable by radicals and 
some which are not. 

Let us now limit the discussion to the case F = @. What is the situation with respect to 
solvability by radicals if we require that we work entirely inside the real numbers R? We shall say 
that a € R is a real radical element if it hes in some repeated radical extension E of Q@ with 
E CR. In other words, a is constructible from the rationals by some combination of multiplica- 
tions, additions and extractions of real nth roots. 

A polynomial f € Q[x] is solvable by real radicals if all of its (complex) roots are real radical 
elements. As we shall see, a polynomial which is solvable by radicals is rarely solvable by real 
radicals even given that all of its roots are real. More surprisingly, we have the following: 


THEOREM. Suppose f © Q[x] is an irreducible polynomial which splits over R. (In other words, 
its roots are all real.) If f has any root which is a real radical element, then the degree of f is a power 
of 2 and the Galois group of f over @ is a 2-group. 


The fact that irreducible cubics are never solvable by real radicals is classical. This phenome- 
non is sometimes referred to as the “casus irreducibilis”. (See the discussion of this in [2].) 


*Research partially supported by an N.S.F. grant. 
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A striking consequence of the theorem is the following: 


COROLLARY. Let f be as in the theorem. Then all of the roots of f lie in a repeated square-root 
extension of Q and thus they are constructible with compass and straightedge. 


Proof. The Galois group, because it is a 2-group, necessarily has a chain of subgroups, each of 
index 2 in the preceding, which begins with the whole group and ends with the trivial subgroup. 
By the fundamental theorem of Galois theory, the corresponding fields form a tower of quadratic 
(and hence square-root) extensions, starting with @ and ending with a splitting field for f. @ 


The key step in the proof of the theorem is the following result in which we use the notation 
|: F| to denote the degree of the field extension F C L. 


PROPOSITION. Let FC E be subfields of R and assume that E = Fla] with a" € F. If 
FCLC E with L Galois over F, then |L: F| < 2. 


The proofs of the proposition and the theorem make heavy use of the theorem on “natural 
irrationalities” (Theorem 29 of [1]). We state this here for the convenience of the reader after a 
brief review of the relevant notation. If E and L are subfields of some common field, their 
compositum «FE, L is the (unique) smallest subfield containing both FE and L. It is most simply 
defined as the intersection of all subfields which contain the two given fields. For fields F C E, we 
write Gal(E/F) to denote the associated Galois group. 


THEOREM (Natural irrationalities). Let F, E, and L be subfields of some field (1 and suppose 
E D> F is a Galois extension. Let K = (E,L), the compositum. Then K is Galois over L and 
Gal( K/L) = Gal( E/E  L). In particular, |K: L| =|E: EO L|. (See Fig. 1.) @ 


K 
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COROLLARY 1. In the above situation, assume |L: F| < 00. Then |K: EJ) =|L: EO Li. 
Proof. Divide both sides of the equation |K: L| = |E£: EM L| into the number |K: EO L|. @ 


A somewhat more interesting corollary permits us to draw another line in the natural 
irrationalities lattice diagram. 


COROLLARY 2. Let F, L, E and K be as before with E Galois over F and K = (E, L). Suppose 
LOMCK. ThnM=(L,EQM). 


The point here is that by drawing the line segment down from M, parallel to KE, we define a 
new node, identified with the field EM M. The resulting diagram, if interpreted as a lattice 
diagram, would imply M = (L, EMM). Corollary 2 asserts that this is a valid inference. Of 
course, the diagram is not a proof. (See Fig. 2.) 
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FIG. 2 


Proof of Corollary 2. Let M, = (L, EM). Itis clear that M, C M, EQN M, = EM and 
(M,,E) = K=(M,E). Applying natural irrationalities with M in place of L yields that 
|K: M| =|E: EO M|and similarly, |K: M,| = |E: EO M,|. We conclude that |K: M| = |K: M,|, 
and since M, C M, we have M, = M as desired. & 


We will also need the following lemma which is probably well known. 


LEMMA. Let FC E be fields with E = F[a] where a" € F, and suppose F contains a primitive 
nth root of unity. Let |E: F| = m. Then 


a” € F. 


Proof. Let f € F[x] be the minimal polynomial of a over F so that deg(f) =m. Write 
a” =a © F. Then f divides x” — a in F[x] and so all the roots of f in a splitting field are nth 
roots of a. Each root, therefore, has the form a6 where 6 is some nth root of unity. It follows 
that the constant term of f is equal to +a”e, where ¢e is the product of all the 6’s and so is an 
nth root of unity. Thus e € F and we conclude that a” € F as required. 


Proof of the proposition. Adjoin a complex primitive nth root of unity to each of F and E to 
obtain the fields F* and E*, respectively. Note that E* = (F*,E) and let K = (F*,L) and 
M = EC K so that we have the lattice diagram as shown. (See Fig. 3.) 

Since F* is a splitting field for the polynomial x” — 1 over F, we see that F* D F is Galois 
and in fact Gal( F* /F) is abelian. By Corollary 2, M = (L, F* \ M) and we observe that L is 
Galois over F by hypothesis and F* 1 M is Galois over F by the fundamental theorem of Galois 
theory, since Gal( F* /F) is abelian and so Gal( F* /F* \ M) is a normal subgroup. 

Since each of the fields L and F* \ M is Galois over F, it follows that any F-automorphism 
of any field which contains (L, F* 1M) leaves each of them (setwise) invariant. Since the 
compositum (which is equal to M) is uniquely determined, it follows that it too is invariant under 
all these automorphisms and therefore M is normal, and hence Galois over F. 

By natural irrationalities applied to the situation (F*, M) = K with F* Galois over F, we 
conclude that K is Galois over M. We can therefore apply natural irrrationalities to the situation 
(K, E) = E* and we conclude from Corollary 1 that |E*: K| =|E: M|. 

Now write m = |E*: K|. Since E* = K[a], a” € K and K contains a primitive nth root of 
unity, we conclude from the lemma that a” © K. Since, of course, a” © E, we see that a” © M. 
We claim, in fact, that M = Fla’. 

To see this, write M, = F[a™”] and note that E = M,[a] and a is a root of the polynomial 
x™ — a” € M,[x]. It follows that |E: M,| < m. However, M, C M and |E: M| = |E*: K| =m, 
and it follows that M, = M, as claimed. 

Write B = a”. The situation now is that FC MCR, M= F[B], B" © F and M is Galois 
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over F. Let o € Gal(M/F). Then 

(B")" = (B")" = 
and so B° = Bd, where 5 € M is some nth root of unity. Since M C R, however, 6 = +1 and 
thus B° = +8 for every element o € Gal(M/F). Since M = F[B], o is determined by its action 


on £ and it follows that |Gal(M/F)| < 2. 
We now have 
|\L: F| <|M: F| =|Gal(M/F)| < 2._ 

Proof of the theorem. Suppose f © Q[x] is irreducible and let S C R be a splitting field for f. 
Let G = Gal(S/Q). Our goal is to show that G is a 2-group. Since deg f divides |S: Q| = |G|, this 
will yield that deg f is a power of 2, as required. 

Suppose that G is not a 2-group and choose an odd prime divisor p of |G|. Let N C G be the 
subgroup generated by all subgroups P C G with |P| = p. Since this collection of subgroups is 
permuted by conjugation by elements of G, it follows that N is normal in G. Also, by Cauchy’s 
theorem, there do exist subgroups P C G of order p, and hence N > 1. 

We are given the real radical element a with f(a) = 0, and so a € S and we write T = Q[a]. 
Let H = Gal(S/T), so that H C G is the subgroup corresponding to 7 via the fundamental 
theorem of Galois theory. We claim that N cannot be contained in H. To see this, let 
L = Fix(N), which is the field corresponding to N. Then L is normal over Q since N<G. Also, 
L < § because N > 1. If N C H, then L D T and so a € L. Because f is irreducible and L is 
normal over @, we conclude that f splits over L and this is a contradiction since L < S. 

We have established that N ¢ H and thus P ¢ H for some subgroup P C G of order p. Let 
F = Fix(P). Since P ¢ H, we have F p T and so a €& F. 

By our assumption on a, there exists a tower of fields 


Q=@%cCc9C::' C9, CR 
with a€ Q,, and QO, = Q,_,[y,] for some element y, with a power in Q,_,. Construct a new 
tower of fields starting with F by setting Fj = F and F = F__,[y,]. Then Q, C F CR for all i 


and, in particular, a € F. Since a € fj, we can choose an integer r > 0 so that a € F_, and 
ace F. 
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Now |S: F| = |P| = p, a prime, and thus F is a maximal subfield of S. Also, 
FCF_,NASCENSCS. 


Since a € F._, OS and a € F.™M S, we conclude that F= F_, 1S and F.1 S =S so that 
SCF. 

Write E = (S,F_,). Then F_, C EC FE. Also, S is Galois over @ and so natural irrational- 
ities applies and we conclude that |F: F._,|=|S: F| = p and E is Galois over F._,. However, 
F._,[y,] = -& CR and some power of y, lies in F._,. Our proposition now applies and yields 
p =|E: F._,| < 2. This contradiction proves the theorem. (See Fig. 4.) ml 


F 


r 


F 
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We close with the observation that solvable polynomials with real roots but which are not 
solvable by real radicals seem to abound. For example, for any prime p, the polynomial 


f(x) =x° — 2px + p 
has this property. (Note that it is irreducible by the Eisenstein criterion and it has three real roots 


since f(0) > 0 and f(1) < 0.) It is amusing to solve this polynomial by Cardan’s method to see 
where nonreal numbers come in. 
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A THEOREM IN COMBINATORICS AND WILSON’S THEOREM 


HARAGAURI N. GUPTA 
Department of Mathematics and Statistics, University of Regina, Regina, Canada S4S 0A 2 


The purpose of this note is to state and prove a theorem in combinatorics and deduce from it 
the famous Wilson’s theorem as a special case. 


THEOREM 1. Let B(n,r), 1 <r<n, denote the number of distributions of n distinct objects 
among r ordered boxes, none remaining empty. Then, 

(i) ifn is prime, then n divides B(n,r) for all r > 2, 

(ii) ifn is prime, then n divides B(n — 1,r)+(—-1))’, forallr, l<r<n-—1. 


Proof. We shall not require the explicit formula for B(n,7r), but will make do with the 
following identities I and II, both of which can be established solely by combinatorial arguments: 
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taining social science applications and review exercises. JNC 


Calculus, T*(13: 2). Mathematics with Applications to Business, Economics, and Social Sciences. 
Richard Bouldin. Saunders College, 1985, xvii + 765 pp. [ISBN: 0-03-062164-X] Written in the same 
style as Bouldin’s Calculus (see previous review), this text contains all the material from Calculus 
except trigonometric functions. Also includes chapters on systems of equations and matrices, linear 
programming, mathematics of money, sets and counting, probability. JNC 
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Real Analysis, P. Lecture Notes in Mathematics-1089: Measure Theory, Oberwolfach 1983. Ed: D. 
Kolzow, D. Maharam-Stone. Springer-Verlag, 1984, xiii + 327 pp, $10 (P). [ISBN: 0-387-13874-9] 
Proceedings of a conference held at Oberwolfach, West Germany. Thirty papers and a problem section 


are included. PZ 


Complex Analysis, $(18), P. Lectures on the Theory of Functions of Several Complex Variables. B. 
Malgrange. Springer-Verlag, 1984, 128 pp, $7.10 (P). [ISBN: 0-387-12875-1] A re-issue of the 
author’s well-known Tata Institute lecture notes from 1958. A few exercises are included. The 
bibliography is limited and, naturally, dated. PZ 


Differential Equations, S*(16-18), P. Seminar on Nonlinear Partial Differential Equations. Ed: S$.S. 
Chern. Math. Sci. Res. Inst. Pub., V. 2. Springer-Verlag, 1984, 373 pp, $24. [ISBN: 0-387-96079- 
1] Eighteen well written research-expository essays whose topics vary widely in scope and level of 
technicality, for instance: P.D. Lax on shock waves; R. Melrose on propagation of singularities; 
J.C. Polking on removable singularities; and A. Weinstein on plasma physics. Every reader with some 


affinity for the subject should find in this collection a few fascinating papers. YN 


Differential Equations, P. Singularities and Groups in Bifurcation Theory, Volume I. Martin Golu- 
bitsky, David G. Schaeffer. Appl. Math. Sci., No. 51. Springer-Verlag, 1985, xvii + 463 pp, $38. 
[ISBN: 0-387-90999-0] Applies techniques from singularity theory, especially unfolding theory and 
classification theory, to bifurcation problems. Suitable for applied scientists as well as for 
mathematicians. Attractively illustrated; many examples and exercises; case studies. Second volume 
will concern bifurcation problems in several state variables. DA 


Differential Equations, T*(14-15: 1, 2). Elementary Differential Equations with Applications. C.H. 
Edwards, Jr., David E. Penney. Prentice-Hall, 1985, xvii + 664 pp, $32.95. [ISBN: 0-13-254129-7-01] 
Very attractive, carefully-written but readable, student-friendly treatment of standard fare. Flex- 
ibility permits use with students with different backgrounds and in courses with different emphases. 
Linear algebra needed for systems is complete and self-contained. BASIC programs for numerical 
methods require little prior programming experience. Numerous concrete realistic worked-out exam- 
ples. Exercises and problems range in difficulty from routine to challenging. Good illustrations. 
JK 


Differential Equations, T(16: 1), S, L*. Linear Turning Point Theory. Wolfgang Wasow. Appl. Math. 
Sci., No. 54. Springer-Verlag, 1985, ix + 246 pp, $38. [ISBN: 0-387-96046-5] "Linear turning 
points" are singularities, in the complex plane, of linear analytic ordinary differential equations. 
Begins with an historical account of the subject. Then demonstrates the classical and modern 
methods for studying the behavior of solutions around singularities: asymptotic power series and 
expansions, parameter shearing, doubly asymptotic expansions, etc. Ends with an appendix on holo- 
morphic similarities of holomorphic matrices to Jordan’s and Arnold’s forms. Without exercises, but 
well written. YN 


Partial Differential Equations, P. Navier-Stokes Equations: Theory and Numerical Analysis, Third 
(Revised) Edition. Roger Temam. Stud. in Math. & Its Applic., V. 2. Elsevier Sci, 1984, xii + 526 
pp, $35 (P); $84.50. [ISBN: 0-444-85308-1; 0-444-85307-3] First treats the linearized stationary 
case, then the nonlinear stationary case, finally the full nonlinear time-dependent case. Contains 
an updating of the bibliography of earlier editions and a description of some directions in which 
there has been recent progress. (First Edition, TR, December 1978; Revised Edition, TR, November 
1981) DA ras 


Functional Analysis, T(15-17: 2), L. Applied Functional Analysis. D.H. Griffel. Ser. in Math. & 
Its Applic. Halsted Pr, 1981, 386 pp, $27.95 (P). [ISBN: 0-470-27196-5] A comprehensive view of 
functional analysis, covering and proving most of the well-known results in distribution theory and 
Green’s functions, Banach spaces and fixed point theorems, and operators in Hilbert space. Defini- 
tions are well illustrated with examples, theorems are carefully motivated, and proofs are detailed 
and readable. MU 


Analysis, T(18: 2), P, L. Topology and Analysis: The Atiyah-Singer Index Formula and Gauge-Theoretic 
Physics. B. Booss, D.D. Bleecker. Transl: D.D. Bleecker, A. Mader. Universitext. Springer- 
Verlag, 1985, xvi + 451 pp, $34 (P). [ISBN: 0-387-96112-7] A well-motivated introduction to the 
Atiyah-Singer Index theorem replete with historical comments, alternative routes (cobordism, imbed- 
ding, heat equation), helpful sketches, extensive references, and wide-ranging connections to many 
parts of mathematics and physics. Includes necessary introductions to operators, manifolds, K- 
theory, and differential equations. A well done thorough survey of the "deepest theorem" in modern 
analysis. LAS 


Analysis, T(17-18: 1, 2), S, P. A Treatise on Generating Functions. H.M. Srivastava, H.L. Manocha. 
Ser. in Math. & Its Applic. Halsted Pr, 1984, 569 pp, $89.95. [ISBN: 0-470-20010-3] A systemic 
introduction to "methods of obtaining linear, bilinear, bilateral or mixed multilateral generating 
functions for a fairly wide variety of sequences of special functions." FLW 


Algebraic Geometry, T(18: 2), P*. Geometry of Algebraic Curves, Volume I. E. Arbarello, et al. 
Grund. der math. Wissenschaften, B. 267. Springer-Verlag, 1985, xvi + 386 pp, $34. [ISBN: 0-387- 
90997-4] Contains much advanced material on algebraic curves on compact Riemann surfaces. The 
material goes much beyond the standard results like the Riemann-Roch theorem. Some topics are 
varieties of special divisors and the geometry of the theta divisor. LN 
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Differential Geometry, P. Differential Geometry and Complex Analysis. Ed: I. Chavel, H.M. Farkas. 
Springer-Verlag, 1985, xiii + 222 pp, $32. [ISBN: 0-387-13543-X] A collection of papers in differen- 
tial geometry and complex analysis, in memory of H.E. Rauch. Three historical papers survey and 
assess Rauch’s contributions to differential geometry, function theory, and theta functions. PZ 


Optimization, $(17-18), P. Minimization Methods for Non-Differentiable Functions. N.Z. Shor. 
Transl: K.C. Kiwiel, A. Ruszczyfski. Ser. in Comput. Math., V. 3. Springer-Verlag, 1985, viii + 
162 pp, $30. [ISBN: 0-387-12763-1] Theoretical treatment of methods for minimization of non-smooth 
functions, using generalized gradient sets, accelerated subgradient methods based on space dila- 
tions. Applications to important mathematical programming problems. RM 


Optimization, T(14-16), S. Linear Programming with BASIC and FORTRAN. Carvel S. Wolfe. Reston, 
1985, x + 293 pp, (P). [ISBN: 0-8359-4082-9] Readable yet complete introduction to linear and 
integer programming. Simplex method, duality, parametric and cutting plane techniques, matrix 
games, transportation problems. Nice examples, many exercises. Discusses subroutines for pivoting. 
Updated version of 1973 FORTRAN edition. (TR, August-September 1973) RM 


Control Theory, T*(16: 1), S*, P*, L*. Linear Multivariable Control: A Geometric Approach, Third 
Edition. W. Murray Wonham. Applic. of Math., No. 10. Springer-Verlag, 1985, xvi + 334 pp, $42. 
[ISBN: 0-387-96071-6] Mathematical treatment of controllability, observability, tracking, distur- 
bance decoupling, and quadratic optimization, preceded by a review of prerequisites in abstract 
linear algebra. Uses linear algebra abundantly, and successfully, to avoid cumbersome matrix compu- 
tations. Written carefully enough to be understandable by senior undergraduates in mathematics or 
engineering; includes exercises. Appropriate choice for an applied course that nicely pulls 
together linear algebra, ordinary differential equations, holomorphic functions, and Laplace 
transforms. YN 


Probability, P. Lecture Notes in Mathematics-l1095: Stochastic Analysis and Applications. Ed: A. 
Truman, D. Williams. Springer-Verlag, 1984, v + 199 pp, $11 (P). [ISBN: 0-387-13891-9] Proceedings 
of the International Conference held in Swansea, Wales, in April 1983. Contains 13 papers, most 
from the conference, the rest more recent research results by the Swansea school. Applications are 


mainly in areas related to mathematical physics. RSK 


Probability, T(18: 1, 2), S, P. Markov Chains, Revised Edition. D. Revuz. Math. Lib., V. ll. 
Elsevier Sci, 1984, xi + 374 pp, $57.75. [ISBN: 0-444-86400-8] This new edition contains material on 
zero~one laws and sub-additive ergodic theory. Assumes knowledge of measure theoretic probability, 
point set topology, topological groups, and Fourier transforms (First Edition, TR, February 1976). 
FLW 


Statistics, T(18: 1), P. Muitivariate Calculation: Use of the Continuous Groups. Roger H. Farrell. 
Ser. in Stat. Springer-Verlag, 1985, xvi + 376 pp, $42. [ISBN: 0-387-96049-X] Expansion of the 
author’s 1976 paperback, #520 in the Springer-Verlag Lecture Notes in Mathematics series, entitled 
Techniques of Multivariate Calculation (TR, May 1977). Integrated treatment of various mathematical 
techniques for finding probability density functions, together with required background material on 
topics such as null sets, exterior algebra, symmetric functions, and consequences of the uniqueness 
theorem for Haar measures. RSK 


Statistics, P. Multivariate Analysis: Methods and Applications. William R. Dillon, Matthew Gold- 
stein. Wiley, 1984, xii + 587 pp, $37.95. [ISBN: 0-471-08317-8] In the Wiley Series in Probability 
and Mathematical Statistics. Non-theoretical but thorough presentation, illustrated with real data, 
designed for users who want more than a "cookbook." Includes chapters on cross-classified categori- 
cal data, latent structure analysis, and linear structural relations, in addition to the more stan- 
dard topics. RSK 


Statistics, T(13-14: 1, 2), S. Statistics & Probability & Their Applications. Patrick Brockett, 
Arnold Levine. Saunders College, 1984, xvi + 576 pp. [ISBN: 0-03-053406-2] Presupposes only high 
school mathematics. The usual topics plus more emphasis on data collection, and survey question- 
naire design. FLW 
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Ed: Theo K. Dijkstra. Springer-Verlag, 1984, v + 129 pp, $9.50 (P). [ISBN: 0-387-13893-5] Proceed- 
ings of a Workshop held in Groningen, The Netherlands, December 15-16, 1983. Contains eight papers 
covering a wide range of topics. RSK 


Statistics, $. An Introduction to Linear Regression and Correlation, Second Edition. Allen L. 
Edwards. WH Freeman, 1984, xv + 206 pp, $19.95 (P). [ISBN: 0-7167-1594-5] Claims to require only a 
“working knowledge of elementary algebra." Discusses equations for lines, bivariate regression, 
several sorts of correlation coefficients and tests for them, multiple regression, matrix algebra, 
dummy variables, and orthogonal polynomials. The examples are from psychology. FLW 


Statistics, $(15-18), P*, L. W.G. Cochran’s Impact on Statistics. Ed: Poduri $.R.S. Rao, Joseph 
Sedransk. Wiley, 1984, xvii + 431 pp, $41.95. [ISBN: 0-471-09912-0] In the Wiley Series in Proba- 
bility and Mathematical Statistics. Interesting collection of 20 articles, written by former stu- 
dents, colleagues and friends, examining Cochran’s great influence on both theoretical and applied 
Statistics. Topics discussed are sampling and observational studies, public health and biometrics, 
categorical data, analysis of variance and design of experiments, variance components, some philo- 
sophical issues, and theoretical developments. Valuable reading for those interested in the history 
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of mathematical statistics. RSK 


Statistics, P*. Jack Carl Kiefer, Collected Papers. Ed: Lawrence D. Brown, et al. Springer-Verlag, 
1985. I: Statistical Inference and Probability (1951-1963), xxxi + 502 pp, $42 [ISBN: 0-387-96003- 
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Ill: Design of Experiments, xxiv + 718 pp, $39. [ISBN: 0-387-96004-X] Includes all of the more than 
100 papers Kiefer (1924-1981) wrote, together with brief commentaries on many of them (particularly 
in the area of design of experiments). Also contains some short articles on his life and work, and 
a complete bibliography. RSK 


Computer Literacy, S, P, L. Starting with UNIX. P.J. Brown. Addison-Wesley, 1984, xiv + 221 pp, 
$12.95 (P). [ISBN: 0-201-10924-7] An excellent, well-written introduction for beginners, moving from 
UNIX principles (files, directories, pipes, shell) to specific commands and applications: shell, 
editing ("ed", "grep"), mail, errors, document preparation, programming (including an introduction 
to C). LAS 


Computer Literacy, $(13). The Secret Guide to Computers. Russ Walter. Birkhauser Boston, 1984, 349 
pp, $14.95 (P). [ISBN: 0-8176-3190-9] A chaotic compendium of detail for computer-phobic users of 
personal computers, emphasizing common problems, simple BASIC, and popular jargon. Very well writ- 
ten, but the unimaginative type script pages are very difficult to scan and read. LAS 


Computer Programming, $(13-18), P. Mastering Apple BASIC: II Plus, Ile, Iic. John J. DiElsi, Elaine 
S. Grossman, John P. Tucciarone. Apple Programming Ser. Holt, Rinehart & Winston, 1984, xiii + 454 
pp. [ISBN: 0-03-071173-8] Text is concise and to the point. Contains many programs to illustrate 
various techniques. Includes several sorting routines and iteration structures. A good all-purpose 
reference. TR 


Computer Programming, $(16-17), P*, L. Foundations of Logic Programming. J.W. Lloyd. Symbolic Com- 
putation. Springer-Verlag, 1984, x + 124 pp, $17. [ISBN: 0-387-13299-6] The mathematical theory for 
using logic as a programming language. Introduces logic programming. Establishes results on fixed 
points, resolution, completeness, independence, negation and concurrency. Presumes logic, PROLOG, 
induction and metric spaces. RWN 


Computer Science, $(16-17), P. The Evolution of Programs. Nachum Dershowitz. Progress in Comp. 
Sci., No. 5. Birkhauser Boston, 1983, 357 pp, $16.95 (P); $24.95. [ISBN: 0-8176-3156-9] Formal 
methods for manipulating programs for debugging, modification to meet new specifications, abstrac-— 
tion to produce program schemata. Synthesis rules for generating code from specifications, annota- 
tion rules to generate assertions and facilitate analysis. Goal to include semi-automated tools in 
programming environments. RM 


Computer Science, &, P. Fundamentals of Human-Computer Interaction. Ed: Andrew Monk. Comp. & Peo- 
ple Ser. Academic Pr, 1984, xvii + 293 pp, $26.50. [ISBN: 0-12-504580-8] Recently there has been a 
great deal of research in studying the interface between a human and a computer system. The point 
of this research has been to learn how to maximize the productivity of the user while minimizing the 
likelihood of making errors. The areas of research have included sereen characteristics, use of 
speech and graphics interfaces, design of command languages, and physical workstation layout. This 
text summarizes the current research work and results which have been obtained in this field. MS 
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pp, $30. [ISBN: 0-262-08143-1] ACM Distinguished Doctoral Dissertation. Deals with measurement of 
visual motion and perceptual studies of motion. FA 
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Theoretical Computer Science. Ed: Mathai Joseph, Rudrapatna Shyamasundar. Springer-Verlag, 1984, 
viii + 468 pp, $19.50 (P). [ISBN: 0-387-13883-8] Proceedings of a December 1984 conference, the 
fourth in a series on the same topic, held at Bangalore, India. LAS 


Computer Science, S(18). Algorithm-Structured Computer Arrays and Networks: Architectures and 
Processes for Images, Percepts, Models, Information. Leonard Uhr. Comp. Sci. & Appl. Math. 
Academic Pr, 1984, xxiii + 414 pp, $34.50. [ISBN: 0-12-706960-7] A wide-ranging survey of algo- 
rithmic and architectural issues involved in the design of multi-computer systems, networks, paral- 
lel processors, arrays, and VLSI. Emphasizes applications that are intrinsically two- and three- 
dimensional such as image processing, meteorological research, and modelling the human brain. LAS 


Computer Science, T(15-17: 1), S, L. Introductory Theory of Computer Science. E.V. Krishnamurthy. 
Springer-Verlag, 1984, xii + 211 pp, $15 (P). [ISBN: 0-387-91255-X] Thorough yet informal introduc- 
tion to theoretical computer science with emphasis on relationships between theory and practice. 
Algorithms, machines, recursive functions and computability, grammars and production rules, program 
correctness, complexity. Too few exercises. RM 


Computer Science, $(18). Eurographics Tutorials “83. Ed: Paul J.W. ten Hagen. Springer-Verlag, 
1984, 425 pp, $34.50. [ISBN: 0-387-13644-4] Introduces computer graphics, covering interactive tech- 
niques and information on graphical techniques. Describes a graphics standard (GKS, Graphical Ker- 
nel System) as well as experiences in its implementation. Three-dimensional models are covered 
using GKS. FA 
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Applications (Artificial Intelligence), P*. Catalogue of Artificial Intelligence Tools. Ed: Alan 
Bundy. Symbolic Computation. Springer-Verlag, 1984, xxv + 150 pp, $17 (P). [ISBN; 0-387-13938-9] 
A listing of 264 artificial intelligence portable software packages. Most entries include a "tele- 
graphic" review, references and information on availability. Well indexed and cross-referenced. A 
more thorough edition is accessible on-line in the UK. RWN 
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Intelligence. Ed: Patrick H. Winston, Karen A. Prendergast. MIT Pr, 1984, 324 pp, $15.95. [ISBN: 
0~262-23117-4] Edited transcripts of lectures and discussions at a 1983 MIT conference on the com- 
mercial potential of artificial intelligence. Academic researchers, financial supporters, indus- 
trial R&D and problem solvers discuss expert systems, robotics, intelligent programming environments 
and industrial applications. RM 


Applications (Biology), $(16-18), P. An Introduction to the Application of Nonnegative Matrices to 
Biological Systems. Joan M. Geramita, Norman J. Pullman. Papers in Pure & Appl. Math., No. 68. 
Queen’s U, 1984, vi + 96 pp, (P). An informal presentation (without proofs) of some of the linear 
algebra and matrix techniques used in applications to biological systems. The discussion is con- 
fined to nonnegative transition matrices, considering the alternatives of nilpotence or the 
existence of positive characteristic values. There are separate chapters for steady matrices and 
cyclic matrices together with a subsequent chapter on stochastic matrices. Annotated bibliography, 
citation index. JS 


Applications (Engineering), P, L. Computational Methods for Fluid Flow. Roger Peyret, Thomas D, 
Taylor. Ser. in Comp. Physics. Springer-Verlag, 1983, x + 358 pp, $22 (P). [ISBN: 0-387-13851-X] 
Paperback edition of the 1983 original edition (TR, March 1983). A state-of-the-art survey, in 
three parts: fundamental numerical methods, incompressible flows, and compressible flows. LAS 


Applications (Physics), P. J.C. Maxwell, The Sesquicentennial Symposium: New Vistas in Mathematics, 
Science and Technology. M.S. Berger. Elsevier Sci, 1984, xiv + 279 pp, $55. [ISBN: 0-444-86707-4] 
Papers on nonlinear phenomena, cosmology, geometry, and Maxwells equations by such experts as Free- 
man Dyson, Roger Penrose, Chen Ning Yang, and Irving Segal, presented at an October 1981 symposium 
at the University of Massachusetts at Amherst in honor of the 150th anniversary of the birth of 
James Clark Maxwell. LAS 


Applications (Physics), T(17-18: 1, 2), P*. Non-Linear Elastic Deformations. R.W. Ogden. Ser. in 
Math. & Its Applic. Halsted Pr, 1984, xv + 532 pp, $95. [ISBN: 0-470-27508-1] Research monograph 
with problems. Suitable for a graduate course. Applications of the theory of non-linear elasticity 
to the solution of boundary-value problems and the analysis of the mechanical properties of solid 
materials capable of large elastic deformations in a purely isothermal setting without reference to 
thermodynamics. Some previously unpublished results and new approaches to old problems. References 
throughout. JK 


Applications (Physics), P. University of Strathclyde Seminars in Applied Mathematical Analysis: 
Classical Scattering. Ed: G.F. Roach. Math. Ser., No. 9. Shiva (US Distr: Birkhauser Boston), 
1984, vi + 160 pp, £8.50 (P). [ISBN: 0-906812-99-2] Proceedings of a September 1984 international 
conference held at the University of Strathclyde, Scotland. 15 papers on mathematical physics of 
scattering phenomena are included. PZ 


Harald Fritzsch. Transl: Jean Steinberg. Basic Books, 1984, x + 307 pp, $19.95. [ISBN: 0-465- 
01446-1] A popular yet authoritative account of the origin and progress of the universe, set in a 
serious context of human values. Physicist Fritzsch explains the structure of matter as a way of 
probing the "how" and the "why" of the Big Bang. A lucid, inviting exposition of contemporary 
cosmology. LAS 


Applications (Physics), T(18: 1), S, P. The Mathematical Theory of Turbulence. M.M. Stanibié. 
Universitext. Springer-Verlag, 1985, xvi + 429 pp, $29 (P). [ISBN: 0-387-96107~0] This text is com- 
posed of two parts: the first deals with the classical approach to turbulence in conjunction with 
the semi-empirical methods of Prandtl, T.I. Tayor and von Karm4an. A generalized law concerning eddy 
viscosity is developed in detail, the equations of the turbulent boundary layer are discussed, and a 
new approach’ to their solution is indicated. Part two deals with statistical theories of tur- 
bulence. MU 
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RB: Richard Brown, Carleton; JNC: Judith N. Cederberg, St. Olaf; CEC: Clifton E. Corzatt, St. Olaf; 
JD-B: John Dyer-Bennet, Carleton; JRG: Jennifer R. Galovich, St. Olaf; SG: Steven Galovich, Carle- 
ton; BH: Bruce Hanson, St. Olaf; PH: Paul Humke, St. Olaf; RSK: Richard S. Kleber, St. Olaf; JK: 
Joseph Konhauser, Macalester; LCL: Loren C. Larson, St. Olaf; RM: Richard Molnar, Macalester; RWN: 
Richard W. Nau, Carleton; LN: Linda Ness, Carleton; YN: Yves Nievergelt, St. Olaf; AO: Arnold 
Ostebee, St. Olaf; TR: Teresa Reardon, St. Olaf; AWR: A. Wayne Roberts, Macalester; MS: Michael 
Schneider, Macalester; JS: John Schue, Macalester; SS: Seymour Schuster, Carleton; JAS: J. Arthur 
Seebach, Jr., St. Olaf; KS: Kay Smith, St. Olaf; LAS: Lynn Arthur Steen, St. Olaf; MT: Michael 
Tveite, St. Olaf; MU: Milton Ulmer, Carleton; TAV: Theodore A. Vessey, St. Olaf; MW: Martha Wallace, 
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An asterisk (*) by the title of a paper indicates that copies of the paper are available from 
the author. Papers presented under special sponsorship as part of joint meetings are so noted in 
parentheses, 


Allegheny Mountain Section 


The spring meeting of the Allegheny Mountain Section was held at the campus of the University 
of Pittsburgh at Johnstown on April 26-27, 1985. 


Invited Addresses: 


"The Covering Radius of Codes," by Neil J.A. Sloan, AT&T Bell Laboratories. 

"Discrete Mathematics in the Curriculum," by Martha Siegel, Towson State University. 

"Principles of Microcomputer Graphics," by Roy E. Myers, Pennsylvania State University at New 
Kensington. 

"The Bieberbach Conjecture: Solution of a Seventy Year Old Problem," by William E. Kirwan, Univer- 
sity of Maryland. 


Short Presentations: 


"A Non-trivial Example of Uncorrelated Dependent Variables," by Douglas H. Frank, Indiana Univer- 
sity of Pennsylvania. 

"The Maximal Planar Subgraph of a Graph," by Frank X. Hiergeist, West Virginia University. 

"Convergence Is Good, Divergence Is Bad. Or Is It?" by Richard McDermot, Allegheny College. 

“Raise High the Root Beam Carpenter and Trigonometric Factoring-~An Application,” by James E. 
Hall, Westminster College. 

"Packing Points in an Equilateral Triangle," by Ron Harrell, Allegheny College. 


Student Papers: 


"Game Theory Models for Evolution," by Dennis McDermot, Allegheny College. 

"Solving Certain Systems of Differential Equations Using Symbolic Programming," by Kevin Treu, 
Allegheny College. 

"Nonstandard Analysis: How and Why," by Chris Oliver, Allegheny College. 

"A Discriminating Journey Through the Land of Curves of Constant Width," by Richard Fenrich, 
Allegheny College. 


Iowa Section 


The seventy-second annual meeting of the Iowa Section was held on April 12~13, 1985 at Drake 
University at Des Moines, Iowa. There were 72 registrants. 


Invited Addresses: 


"Here We Go Again: Another Challenge to the Simplex Method," by A.M. Fink, Iowa State University. 

"The Bieberbach Conjecture: Some History and Related Problems," by Elgin Johnston, Lowa State 
University. 

"Integral Equations of the First Kind and I1ll-posed Problems," by Charles Groetsch, University of 
Cincinnati. 


Short Presentations: 


"Bell’s Inequality and Einstein’s Conception of Reality," by Alan Macdonald, Luther College. 

"Numerical Solution of Second Kind Integral Equations," by Glenn Lueke, Lowa State University. 

"The Numerical Evaluation of Particular Solutions for Poisson’s Equation," by Kendall Atkinson, 
Undversity of Iowa. 

"Numerical Solution of the Heat Equation by the Method of Heat Potentials," by Daniel Willis, 
University of Iowa. 

"Weighted Finite Elements for the Stamp Problem," by A. Bogomolny, University of Iowa. 

"Affine Invariant Conditions for the Monotone Convergence of Newton’s Method and Applications," by 
Florian Potra, University of Iowa. 

"Endospectral Graphs," by Milan Randic and Alex Kleiner, Drake University. 

"Factoring the Characteristic Polynomial," by Bernie Baker, Alex Kleiner, and Milan Randic, Drake 
University. 

"Catalan Numbers and the Pascal Triangle," by Milan Randic, Bernie Baker, and Patsy Milton, Drake 
University. , 

"Preliminary Model to Improve Estimation of Prevalence of Malaria," by M. Anne Dow, Maharishi 
International University. 

"Universal Objects: A Unifying Structure in Mathematics," by Catherine Gorini Wadsworth, Maharishi 
International University. 

"Progress Report--High School Teacher Certification in Mathematics and Computer Science in Iowa," 
by Lynn Olson, Wartburg College; Marilyn Zweng, University of Iowa. 

"The MAA Team Project," by Thomas Iverson, Central College. 


C84 Center Section October 


Student Papers: 


"On Spectral Properties of Graphs," by Ruth Gornet, Drake University. 

"Is There a Time When One Is More Likely To Have a Heart Attack?" by Steven Reeves, Luther Col- 
lege. 

"On Factoring the Characteristic Polynomial," by Nick Street, Drake University. 

"Rubik’s Cube: Tactile Group Theory," by Thomas Rosholt, Luther College. 

"On the Characteristic Polynomials of Graphs with Bridges," by Scott Rothfus, Drake University. 

"An Algorithm for Generating Permutations with Applications to High School Courses in Computer 
Simulation or Finite Mathematics," by Janice Mansheim, Emmanuel Maou, and Phyllis Schindler, 
Iowa Wesleyan College. 


Panel Discussion: 


"Discrete vs. Continuous Mathematics in the Undergraduate Curriculum," by Donald Alton, University 
of Iowa; Larry Naylor, Drake University; Zorabi Honargohar, Morningside College; Lynn Olson, 
Wartburg College. 


Maryland-D.C.-Virginia Section 


The spring meeting of the Maryland-D.C.-Virginia Section was held at Hollins College in 
Roanoke, Virginia on April 25, 1985. 135 persons attended the meeting. 


Invited Addresses: 
"How to Factor a Number," by Carl Pomerance, University of Georgia. 
Short Presentations: 


"True BASIC: A New Programming Language for the IBM," by Howard Penn, U.S. Naval Academy. 

"A Property of the Unit Digits of the Fibonacci Sequence," by Herta Freitag, Hollins College. 

"An Illustration of Computer Learning," by Bob Lewand, Goucher College. 

"A Geometric Treatment of the Concept of Means," by William Sanders, James Madison University. 

"Partial Fractions: Classical, Efficient Method--Now Complete," by Donald R. Peeples, Mary Wash- 
ington College. 

"The Mathematical Competition of Modeling--One School’s Experience," by Alice T. Williams, South- 
ern Seminary Junior College. 

"My Ten Years Last Summer at IFRICS (Institute for Retraining in Computer Science)," by Frederic 
Gooding, Goucher College. 

"Results of the 1985 Math Competition in Modeling," by Ben Fusaro, Salisbury State. 

"Groups of Order q’,p," by James Alonso, University of the District of Columbia. 

"Estimating Tornada Windspeed Probabilities: Empirical, Theoretical, and Subjective Perspectives," 
by Bob Abbey, Office of Naval Research; Mary Kay Abbey, Montgomery College. 

"Geometry’s Cinderella," by C.J. Maloney, National Institute of Health. 

"On Fermat d-Pseudopriges ,",by Lawrence Somer, George Washington University. 

"Integer Solution of c” = a” + b” — 2ab(p/q) and Nice Law of Cosine Problems," by Wilbur Hilde- 
brand, Montgomery College. 

"Pell-like Equations," by William Wardlaw, U.S. Naval Academy. 

"Independent Weightings in the Ball Weighing/Fake Coin Problem," by Ray Hancock, Emory and Henry 
College. 

"A Non-elegant Way of Expressing the nth Power of the Golden Ratio," by Herta T. Freitag, Hollins 
College. 


Student Papers: 


"Probabilistic Analysis of a Numerical Method for Finding Zeros of a Function," by Arthur Benja- 
min, Johns Hopkins University. 

"The Experience and the Product of a COMAP Winner--Modelling in Animal Populations," by John Kent, 
Dan McCaffey, Mike Caulfield, and John August, Mount Saint Mary’s College. 

"Some Result$ on the Period and the Restricted Period of the Fibonacci Series Mod-P," by Anna 
Werner, American University. 

"The Independence of Certain Ascoli Theorems," by David A. Lamb, George Mason University. 

"Some New Results Regarding Primitive Roots Modulo a Prime," by Steve Bonner, American University. 


Workshop: 


"Writing Mathematics Programs for the IBM-PC," by David Schneider, University of Maryland. 


Southeastern Section 


The sixth-fourth annual spring meeting of the Southeastern Section met at Wake Forest Univer- 
sity, Winston-Salem, North Carolina on April 12-13, 1985. 326 persons attended the meeting. 
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Invited Lectures: 


"Solutions of an Inverse Problem for Fractals and Other Sets," by Michael Barnsley, Georgia Insti- 
tute of Technology. 

"Geometry--A Lot”’s New and a Lot”’s Renewed," by John Kenelly, Clemson University. 

"Computing and the Changing Nature of College Mathematics," by Donald L. Kreider, Dartmouth Col- 
lege. 


Short Presentations: 


"Conditionally Convergent Integrals of Convolution Products," by Jonathan Lewin, Kennesaw College. 

"Sequence of Iterates for Quasi-nonexpansive Mappings in Locally Convex Spaces," by K.L. Singh, 
Fayetteville State University. 

"On Certain Commutation Relations in a Universal Enveloping Algebra," by Kwok Chi Wong, Fayette- 
ville State University. 

"Fixed and Common Fixed Points in Convex Metric Spaces," by Larry Harris, Fayetteville State 
University. 

"A Byte of Simpson’s Apple," by Jerry Currence, University of South Carolina at Lancaster. 

"Software Tools for Education," by Chantal M. Shafroth and James M. Shoaf, North Carolina Central 
University. 

"Continuous Mappings in Computer Graphics," by Lyndell Kerley, East Tennessee State University. 

"Investigating Pseudoconics: An Opportunity for Students to Explore and Discover New Mathematics," 
by F. Alexander Norman, University of Georgia. 

"On Solving Radical Equations," by Bill Bompart, Augusta College. 

"Partial Fractions: Classical, Efficient Method Now Complete," by Donald R. Peeples, Mary Washing- 
ton College. 

"An Elementary Proof of Descartes” Rule of Signs," by Curtis Herink, Mercer University. 

"A Modest Experiment in Mathematics Education," by David A. Smith, Duke University and Benedict 
College. 

"A Sparse, Banded Matrix: What Is Its Null Space?" by Steve Howard, Wake Forest University. 

“Weakly Closed Functions and Hausdorff Spaces," by David A. Rose, Francis Marion College. 

“What Every Statistician Should (But Probably Doesn’t) Know About Levels of Measurement," by R.W. 
Hoyer, James Madison University. 

"Some Popular Explanation of Slow-fast Cycles, Non-standard Models, and Duck Hunting Theorems," by 
Vadim Komkov, Winthrop College. 

"Report on Usage of MAA Multi-media Applied Mathematics (TEAM)," by Gil Proctor, Clemson Univer- 
sity. 

“Constructing the Good Multiple Choice Item: Easier Said Than Done," by Linda Boyd, DeKalb Commun- 
ity College. 

"College Board Achievement Tests in Mathematics," by Rose C. Hamm, The College of Charleston. 

"What°s Happening in Mathematics Education," by Virginia Joyner, East Carolina University. 

"Mathematics Education in the USSR--A Very Recent Up-date (March 1985)," by James G. Ware, Univer- 
sity of Tennessee at Chattanooga. 

"Approximating Polygons for Space Filling Curves," by Hans Sagan, North Carolina State University. 

"On the Number of Honest Permutations," by Albert F. Gilman, III, and Joseph B. Klerlein, Western 
Carolina University. 

"An Application of the Calculus of Finite Differences to Pascal’s Triangle," by Benjamin G. Klein, 
Davidson College. 

"Error in Padé Approximation," by Edward J. Allen, University of North Carolina at Asheville. 

"Some Improper Integrals are Only Teasing," by John V. Baxley, Wake Forest University. 

"Differentiability: A Paradoxically Sufficient Condition," by Irl C. Bivens, Davidson College. 

"Basic Orlicz Space Theory," by Arnold S. Goldstein, Savannah, Georgia. 

"Divisibility by Primes," by Reginald Mazeres, Tennessee Technological University. 

"A Geometric Portrait for the Parametric Representation of the Pythagorean Triples," by Alvin Tir- 
man, East Tennessee State University. 

"Squareful Numbers in Arithmetical Progressions," by Richard L. Robinson, Wofford College. 

"The Remaining Continued Fractions in Ramanujan’s Second Notebook," by Robert L. Lamphere, Francis 
Marion College. 


Student Bapers: 
"On Cryptography and the Factorization of Large Integers," by Sebrina Bragdon, Spelman College. 
"The MOG: A House of Bricks," by Julie Morrisett, Davidson College. 
North Central Section 


The spring meeting of the North Central Section was held at the College of St. Benedict, St. 
Joseph, Minnesota on April 26-27, 1985. There were 104 registrants. 


Invited Addresses: 
"The Mathematical Sciences: Role and Expectations by the Scientific and Technical Communities," by 


Ettore Infante, University of Minnesota. 
"Renewing Collegiate Mathematics," by Lynn A. Steen, St. Olaf College. 
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Short Presentations: 


* "An Elementary Derivation of qT OO cos(x/2") = sin(x)/x," by James Harper, St. Cloud State 
University. 
* "Viscous Flow on a Rotating Disk," by Jeffrey Rosoff, Gustavus Adolphus College. 
"ICME-V," by Francis Hatfield, Mankato State University. 

* "Generalized Circular Functions," by Douglas Dunham, University of Minnesota at Duluth. 
"Teaching of Experiential Applied Mathematics," by Michael Tangredi, College of St. Benedict. 
"Simpson’s Paradox and the Open Mapping Theorem," by John Kemper, College of St. Thomas. 

* "Existence of Self-Reproducing Programs," by Gary Rockswold, Mankato State University. 
"Regarding the Bieberbach Conjecture," by Paul Zorn, St. Olaf College. 


Illinois Section 


The sixty-fourth annual meeting of the Illinois Section convened at the University of Illinois, 
Champaign-Urbana on May 3-4, 1985 with approximately sixty members in attendance. 


Invited Addresses: 


"The Algebraic Structure of Non-standard Models of Arithmetic," by Angus Macintyre, Yale Univer- 
sity. 

"Mathematics Curriculum of a Comprehensive Two-year College," by Dale Ewen, Parkland College. 

"The Teaching and Learning of Mathematics in U.S. High Schools From an International Point of 
View," by Ken Travers, University of Illinois. 

"PME, RCDPM, and ICME-International Mathematics Conferences in Australia, August 1984," by Carole 
Bauer, Triton College. 

"Computer Software for College Algebra and Calculus," by Larry Dornhoff, University of Illinois. 

"The Role of Discrete Mathematics in the First Two Years of the Undergraduate Program," by Al 
Otto, Illinois State University. 

"M,C. Escher as Pioneer Mathematician," by Doris Schattschneider, Moravian College. 

"The B.A. in Mathematics--What Should It Signify?" by Heini Halberstam, University of Illinois. 


Mini-Course: 
"Computer Graphics," by Joan Wyzkoski, Bradley University. 
Professor Jon Laible of Eastern Illinois University was presented the Distinguished Service 
Award of the Section. 
Kentucky Section 


The sixty~eighth annual meeting of the Kentucky Section was held on April 19-20, 1985 at 
Eastern Kentucky University at Richmond, Kentucky. There were 109 registrants. 


Invited Addresses: 


"Some Observations on Mathematics and Mathematicians," by Ivan Niven, University of Oregon. 
"Some Unexpected Results in Elementary Mathematics," by Ivan Niven, University of Oregon. 
"N. Karmarkar’s Linear Programming Algorithm," by Jim Burke, University of Oregon. 


Short Presentations: 


"The Cutting Edge: Boundaries of Manifolds," by Scott Metcalf, Eastern Kentucky University. 

"General Discussion on an Introductory Statistics Course: What Topics Should Be Included?" by 
Frank Dietrich and James McKenney, Northern Kentucky University. 

"A Unique Analysis Course for Secondary Teachers Getting a Master”’s Education," by Austin French, 
Georgetown College. 

"A Fast Algorithm for Approximating Polynomial Roots," by Chris Christensen, Northern Kentucky 
University. 


Short Course: 


"Computer Graphics," by Gail Wells, Northern Kentucky University. 


Qhio Section 
The University of Akron at Akron, Ohio was the setting for the annual spring meeting of the 
Ohio Section held on April 12-13, 1985. There were 146 registrants plus approximately 50 students 
in attendance. 


Invited Addresses: 


"Symbolic Computation on Modern Computers," by Paul Wang, Kent State University. 
"The Embeddability of Graphs," by Phil Huneke, Ohio State University. 
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"Engineering Needs and the College Mathematics Core," by Isaac Greber, Case Western Reserve 
University. 

"Three P’s for Teaching Mathematics," by Jim Leitzel, Ohio State University. 

"EMPT Update for Department Chairs," by Bert Waits, Ohio State University. 


Contributed Papers: 


"Status Report on the Revision of Teacher Certification Rules," by Alan Osborne, Ohio State 
University. 

"Transformations Helping in Geometry," by Kenneth Cummins, Kent State University. 

"Ratios of Generalized Fibonacci Sequences," by Tom Dence, Ashland College. 

"Electronics for Computer Science," by Dean Brown, Youngstown State University. 

"OAC Basketball Schedule Problem," by Neil Bernstein, Marietta College. 

"Contest Problems from the 1985 AHME," by Dick Horwath, John Carroll University. 

"Handicapped Matrices," by Russ Smucker, Muskingum College. 

"Parabolic Blending Yields Piecewise Cubics for Interpolating Curves," by Cliff Long, Bowling 
Green State University. 


Special Session: 


Panel on Scientific Computing, by Tom Price, University of Akron (Moderator). 

"A Computer Algorithm for the Tsirelson Space Norm," by Johnnie Baker, Kent State University. 

"Providing a Complex Number Environment for MACSYMA," by Oberta A. Slotterbeck, Hiram College. 

"Piecewise Linear Approximation of Implicitly Defined Manifolds," by P.H. Schmidt, University of 
Akron. 

"Turning the Corner on Parallel Processing," by Douglas R. Smith, Goodyear Aerospace Corporation. 


Student Papers: 


"Computer Generation of Finite Maximal Connected Topologies," by Tom Batchik, University of Akron. 

"Create Your Own Geometry," by David Cameron, Miami University. 

"Counting the Infinite,” by Sheldon Degenhardt, Wittenberg University. 

"Accelerating Archimedes” Method for Approximating Pi," by Raymond E. Flanery, Jr., Youngstown 
State University. 

"Interest Rate Determination: The Impact of Federal Deficits," by Timothy Fuerst, Ohio Northern 
University. 

"Algebraic Properties of Magic Squares," by Jim Gerbic, John Carroll University. 

"A First Look at C(X)," by David Goloff, University of Dayton. 

"Modeling: Vented Loudspeaker Design," by Mike Grinkemeyer, Miami University. 

"Motion of a Satellite," by Linda Hudson, University of Akron. 

"An Implementation of an Elementary Cryptographic Technique Using Pascal," by Jeffrey N. Jones, 
Baldwin Wallace College. 

"Column Nim," by Stephen Kurzhals, Xavier University. 

"Keeping the User Friendly," by Paul Mullins, Youngstown State University. 

"A Software Conversion Problem," by Mary Beth Pretorius, Wittenberg University. 

"Fourier Analysis and Its Role in Resonance Technologies," by Arun Ranchod, College of Wooster. 

"Conversion of ATS6/GEOS3 SST DATA to Free Air Gravity Anomalies Over the Southeast Pacific," by 
Terry J. Sabaka, University of Akron. 

"A Special Class of Quadratic Diophantine Equations," by John Smith, John Carroll University. 

"Polygonal Residue Patterns," by Karen Trageser, Miami University. 

"Figurate Numbers," by Theresa Wiencek, Youngstown State University. 


Nebraska Section 


Hastings College, Hastings, Nebraska was the location of the annual meeting of the Nebraska 
Section held on April 12-13, 1985. There were 29 registrants from 9 colleges in Nebraska and 
Southeastern South Dakota. 


Invited Addfesses: 
"Statistics and the Law," by Mary Gray, American University. 
Short Presentations: 


* "Report on the High School Mathematics Examination," by Richard Barlow, Kearney State College. 

"Triangles with Concurrent Assorted Medians, Angle Bisectors, and Altitudes," by Dale M. Mesner, 
University of Nebraska at Lincoln. 

"How Not to Tile Rooms," by Leo G. Chouinard, University of Nebraska at Lincoln. 

"Use of Spread Sheets as a Mathematical Tool," by Dale W. Behrens, Hastings College. 

"An Existence Theorem for Differential Equations," by Allan Peterson, University of Nebraska at 
Lincoln. 

"Some Good Classroom Problems," by Alexander Mehaffey, Jr., University of South Dakota. 

"Continuous Functions Which Are the Composition of Retractions for Finite T)-Spaces ," by Mel 
Thornton, University of Nebraska at Lincoln. 

"dy/dx = -x”/(xty): A Visual Odyssey," by Thomas Shores, University of Nebraska at Lincoln. 

"Report on TEAM Project," by Albert Zechmann, University of Nebraska at Lincoln. 
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Student Paper: 
"Mathematical Anxiety in Arab Students Inside Israel," by Nimer Baya’a, American University. 
Panel: 

"Changes in the Mathematics Curricula," by Roger Olson, Dana College; Randall Heckman, Kearney 
State College; Dick Vogt, Nebraska Wesleyan University; David Logan, University of Nebraska at 
Lincoln; John Konvalina, University of Nebraska at Omaha; Wallace A. Raab, University of South 
Dakota. 


Missouri Section 


The annual spring meeting of the Missouri Section was held April 12-13, 1985 at Central Mis~ 
souri State University, Warrensburg, Missouri. There were 77 registrants. 


Invited Addresses: 
"Mathematics--Structure and Use; Or, What Are We Learning This For?" by Raymond Freese, St. Louis 
University. 


"Bits and Pieces from the Classroom," by Leonard Gillman, University of Texas at Austin. 


Short Presentations: 


"Problem Solving and Problem Solvers," by Sherralyn Craven, Central Missouri State University. 

"Applying SAS to Solve Problems in Estimating Ordinary Least Squares Regression," by Baird Brock, 
Central Missouri State University. 

"A Two-Person Coin Flipping Problem," by Curtis Cooper and Robert E. Kennedy, Central Missouri 
State University. 

"An Investigation of the Natural Density of {x: x is a factor of £(x)}," by Robert E. Kennedy and 
Curtis Cooper, Central Missouri State University. 

"How to Make Mathematics Interesting in the Classroom," by Peter G. Casazza, University of Mis- 
souri at Columbia. 

"Square Free Number Cycles," by Dale Woods, Central State University (Oklahoma)/Northeast Missouri 
State University. 

"Non-commutative Lattices," by Jonathan Leech, Missouri Western State College. 

"Random Normed Structures," by Troy Hicks, University of Missouri at Rolla. 


Student Presentations: 


"A Mathematical Model of Human Thought," by Janet C. Tremain, University of Missouri at Columbia. 
"A Graphical “Machine” for the Hyperbolic Functions," by David Rodriguez, Central Missouri State 
University. 


Southwestern Section 


The annual spring meeting of the Southwestern Section was held April 12-13, 1985 jointly with 
the American Mathematical Society and the Sociedad Matematica Mexicana, at the University of Arizona 
in Tucson, Arizona. There were 120 registrants. 


Invited Addresses: 


"Hilbert as a Household Word," by Constance Reid. 

"Some Observations on Mathematics and Mathematicians," by Ivan Niven, University of Oregon. 

"Modern Real Algebra," by Gregory Brumfiel, Stanford University. 

"Representable Functors among Categories of Algebras," by George M. Bergman, University of Cali- 
fornia at Berkeley. 


Short Presentations: 


"The Simplest Quad," by Alvin Swimmer, Arizona State University. 

"Archimedes, Fibonacci, and Pascal: A Rational Search for Roots," by Robert J. Wisner, New Mexico 
State University. 

"On the Metaphysics Which Underlies the Hypotheses Which Underlie Geometry, Part II," by Justin 
McCarthy, Deming, New Mexico. 

"What is the Relation Between Galileo’s Work on the Free Fall and the Teaching of Mathematics?" by 
Alejandro Lopez-Yanez, U. Nacional Autonoma de Mexico. 

"Putting the Ellipse in Perspective," by Charles G. Moore, Northern Arizona University. 


Student Papers: 


"Polynomials with Small Value Set Modulo p," by Javier Gomez-Calderon, University of Arizona. 

"A Method for Solving Linear Differential Equations," by Ray Griesan, University of Arizona. 

"Totally Orthogonal Groups All of Whose Nonlinear Characters Have the Same Degree," by Kwang Shang 
Wang, University of Arizona. 
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Now |S: F| = |P| = p, a prime, and thus F is a maximal subfield of S. Also, 
FCF,ASCEOSCS. 


Since a € F._, N S and a € F.1 S, we conclude that F= F_, MS and F.1 S=S so that 
SC F.. 

Write E = (S, F_,). Then F._, C EC F.. Also, S is Galois over Q and so natural irrational- 
ities applies and we conclude that |E: F._,| =|S: F| =p and E is Galois over F._,. However, 
F._,ly,] = & CR and some power of y, lies in F,_,. Our proposition now applies and yields 


p = |E: F_,| < 2. This contradiction proves the theorem. (See Fig. 4.) m 


F 


r 


F 


Fic. 4 


We close with the observation that solvable polynomials with real roots but which are not 
solvable by real radicals seem to abound. For example, for any prime p, the polynomial 


f(x) =x? — 2px +p 
has this property. (Note that it is irreducible by the Eisenstein criterion and it has three real roots 
since f(0) > 0 and f(1) < 0.) It is amusing to solve this polynomial by Cardan’s method to see 
where nonreal numbers come in. 


References 


E. Artin, Galois Theory, Notre Dame Math. Lectures, 1944. 
B. L 


1. 
2. . van der Waerden, Modern Algebra (vol. 1, 2nd edition), Ungar, New York, 1953. 


A THEOREM IN COMBINATORICS AND WILSON’S THEOREM 


HARAGAURI N. GUPTA 
Department of Mathematics and Statistics, University of Regina, Regina, Canada S4S 0A 2 


The purpose of this note is to state and prove a theorem in combinatorics and deduce from it 
the famous Wilson’s theorem as a special case. 


THEOREM 1. Let B(n,r), 1 <r<n, denote the number of distributions of n distinct objects 
among r ordered boxes, none remaining empty. Then, 

(i) if nis prime, then n divides B(n,r) for all r > 2, 

(ii) if n is prime, then n divides B(n — 1,r) +(—1)’, forallr, l1<r<n—l. 


Proof. We shall not require the explicit formula for B(n,r), but will make do with the 
following identities I and II, both of which can be established solely by combinatorial arguments: 
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I. Bin, r)=r(B(n—1,r)+ B(n—-1,r—-1))), 2<r<n-1. 
I. Ben,r) = Lea 'W("\Ba—k,r-), 2<r<n. 


Now, if n is prime and r > 2, then n — r+ 1 <n -— 1 and each of the coefficients (”) on the 
right-hand side of II is divisible by n. (See Remark 2 at the end of this note.) Hence B(n,r) is 
divisible by n. This proves part (i) of the Theorem. 

Moreover, since n does not divide r, the identity I together with part (4) now yields: 


( *) n divides B(n — 1,r) + B(n —1,r- 1), 2<r<n-1. 


To prove part (ii) we apply induction on r. In case r= 1, B(n — 1,1) = 1, because there is just 
one way of distributing n — 1 objects in one single box. In this case we have, therefore, 


B(n—1,1) +(-1) =1-1=0, 


and n divides 0. Assume then that n divides B(n — 1,r—1)+(—1)’"! for some r-—1, 
l1<r—1<n-—1. By(*) above. 


B(n-—1,r)+B(n-—1,r-—1)=O0(modn), 2<r<n-1. 
We add (—1)’~' to both sides of the equivalence. By the inductive hypothesis, 
B(n—1,r—1) +(-1)" | =0(modn). 

Hence 

B(n—1,r) =(—1)"  (modn), 
which amounts to: 

B(n —1,r) +(-1)' = 0(modn). 
This completes the induction, proving part (ii) of Theorem 1. 


REMARK 1. B(n — 1,n — 1) =n — 1, because the number of distributions of n — 1 objects 
among n — 1 boxes, no box being empty, is clearly n — 1!. This then can be seen as giving a proof 
of Wilson’s theorem. 


COROLLARY (Wilson’s Theorem). (n — 1)! = —1 (modn) for n prime. 

REMARK 2. If 1 <k <n-—1,thenk ( "| = n| a ‘), as is easily established by a combinatorial 
argument. If n is prime, then since n + k, we have n|(” , 

By an argument similar to that in the paper one can establish that if n is prime, then n divides 


("-t) +(-p" 
REMARK 3. Note that B(n,r)/r! is the Stirling number of the second kind S(n,r). S(n,r) 
denotes the number of partitions of n objects into r cells. 


P-ADIC BINOMIAL COEFFICIENTS MOD P 


ROGER C. ALPERIN 
Department of Mathematics, The University of Oklahoma, Norman, OK 73019 


In this note we provide a simple formula for evaluating the binomial coefficient 


a a(a—1):--(a-nt+1 
(2) = 2 piri +1) 


7 aI mod p 


for any p-adic integer a € Z,. As a corollary we obtain another proof of the well-known formula 


for the Legendre symbol [- ; 
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Let p be a prime number; for a rational number a = g/r where the highest power of p 
dividing g is a, that of r is b, we define the p-adic valuation, v,(a) = a — b. The completion of 
the rational numbers, Q, with respect to the norm |a|, = p~’»‘ is the topological field of p-adic 
numbers, Q,,. The elements of Q,, with nonnegative p-adic value form the ring of p-adic integers, 
Z,. Every a € Z, has a unique p-adic expansion a = a) + a,p + a,p? + --- withO <a, <p, 
a, = Z. The least nonnegative residue of a © Z,mod p is the first p-adic coefficient ag. Also, 
using the p-adic expansion, it is easy to see that the set N of natural numbers is dense in Z,,. 


LEMMA 1. Ifa © Z,,, then (7) EZ, forallneN. 


Proof. The polynomial function f(a) =(") is continuous and f(m) is integral valued for 
m © N. Hence f maps the closure of N, Z,, to Z,. @ 
Since the value of (*) for a € Z, is a p-adic integer, we may consider its evaluation mod p. 


LEMMA 2. Ifa €Z,, then forneN 

(1) (Pr) =OQOmodp ifptn, 

(2) (Pr) =({)mod p ifn = pk. 

Proof. For the first part notice that it is true for a, a natural number, by considering the 


coefficient of x” in (1 + x)?* =(1 + x?)* mod p. The function g(a) = P) is continuous in 


the p-adic variable a and g(N) = 0 mod p; thus g(a) = 0 mod p, by the density of N in Z,,. 
For the second part notice that by cancelling factors of p and reducing mod p, 


Cc _papa-1l  pa-p | pa-~pkti 
pk pk pk-1 pk — p 1 
aa-l a-—k+1 a 
ii ir nr ce oe = ({.)mod p.m 


Now consider the power series expansion for (1 + x)*, a € Z, [I], 


GQ+x*=F (Mr) x”. 


THEOREM. If a © Z, has the p-adic expansion a = ay + a;p +--+ andn €&WN has the p-adic 
expansionn = nN) +n, pt+--: +n, p’, then 
* (a 
(2) = LE [ x’ mod p. 
i= 


Proof. Observe that 


(l¢x)7=¥ (x) =(14+x)*%(14x)?"*? 
k=0 
where 


a=N+p*'B,N=a,+apt--:+ t+a,p’. 
Thus 


(1+x)*= Ia + xP)" (1 + x?""')? mod p 


c=0\k 


Comparing coefficients of x” in the expression above we obtain the formula of the theorem. 
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B 


This theorem gives some insight into the p-adic expansion. It follows easily from this theorem 
that the p-adic expansion for a is 


(Notice that no terms | enter into the coefficient of x”.) 


a= > {( 5 Jmod p} p”. 


n=0 


From this expression one can obtain formulas for the coefficients of the p-adic expansion for 
a + B in terms of the coefficients of a and B. If a=X%,a,p', B =X~2_b,p', then a+ B 
oO 


— I 
= )) c,p' where 
1=0 


n—-1l n—-1 
a, + b. 
Cc, =a, +b, + y (5%) y | moa p 
s=0 P i=st+l Pp 1 


The details for the proof of this formula are left to the interested reader. 


COROLLARY 1. Ifn =a) +a,p+°:+: +a,p* is a natural number, then 
S (ln — 
(27) =Car FY » mos p. 
MN i=0 a, 


Proof. From the uniqueness of p-adic expansions and the geometric series for 1/(1 — p), it 
follows that 


_i_p-l 24, 
5 5 (l+p+p?+ ) 


ae 
is the p-adic expansion of — 4. Evaluating 2 by means of the theorem and combining it with 
n 


_1 an n “2a(2n) 
(2) = (-1)"22"(2! 
we obtain the stated result. @ 
According to Euler’s Theorem, for an odd prime p, 


the formula 


2 1 
=) = 277" )mod p. 
| 


2 
Using the next corollary we can obtain the evaluation of (=). 


CoroLiary 2. For an odd prime p, 2?°?~» = (—1)°"-/8mod p. 
Proof. If p = 1mod4, let n = (p — 1)/4 and apply the previous corollary. If p = —1mod4, 
let n = (p + 1)/4 and apply the previous corollary. @ 


Reference 


1. N. Koblitz, P-adic Numbers, P-adic Analysis and Zeta-Functions. Springer-Verlag, 1977. 


ANSWER TO PHOTO ON PAGE 559 


The picture shows Grace Chisholm Young (1868-1944) who with her husband William Henry 
Young (1862-1943) produced fundamental research in general topology, measure theory, and 
functional analysis—and also six children, who include Professor L. C. Young of the University of 
Wisconsin. The picture was taken before her marriage, in fact probably in the mid 1890s when she 
took her doctorate at Géttingen, the first in the new program. 
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[T]Q + x2771)(1 + x?")(1 — x”) 


n>1 

—_— IT@ + x2r-1)] _ x4") 
n> 

_ I] (1 4 x4n-3)(1 4 x4") _ xin), 
n>1 


and we have 


1—x"”\? 4n—3 4n-1 
(4) [5 ) -1-45 [= ; 
nsi\lt+x 1 4 x 4n-3 1+ 47-1 
Now, 
(5 _ =| 7 rl (1 _ x21) _ x7") 
noi\l + x" n>1 (1 + x”) 
=[][Q- x*"*)G — x") 
n>1 
_ I] (1 _ x2" 1) _ x?" TV] _ x7") 
n>1 
= y (—1)"x", 
So (4) is 
00 2 4n—3 4n-1 
5 —1)"x"| =1-4 ea 
©) | 2 | Py P+ x9" 1+ x4"? 


Put —x for x, and we obtain 


, ; 2 _ 4 ytn-3 yan-l 
(6) a* =it d 1— 4n—-3. 7 _ 4n-1]? 


n>1 


from which Theorem 1 follows immediately [2]. 
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A SIMPLE STRUCTURE THEOREM FOR TWO-GENERATOR, ONE-RELATION GROUPS 


DONALD P. MINASSIAN 
Department of Mathematical Sciences, Butler University, Indianapolis, IN 46208 


A group G, with normal subgroup N, is said to be the extension of N by the factor group 
G/N. That is, G admits a two-step normal series: GD N D {e}. If N and G/N have readily 
grasped structure, then valuable insights can be gained about the structure of G. In this note we 
shall consider the important class of groups given by G, = {a,b|ba = ab” }, where n is any fixed 
natural number exceeding 1. Specialists well know [1] that each such group is an extension of a 
subgroup N,, of the additive rational numbers by the additive integers G/N,,. However, a proof for 
nonspecialists appears lacking in the literature. This note fills that gap. (The skeletal proof in [1] 
deals with G,. The author knows of no reference for G,,,n > 2.) 

We assume that G, has two “independent elements” or generators a and b of infinite order, 
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that G, consists of all elements of form a*b” where x and y are integers, and that a moves past 
b by employing the given relation ba = ab”. We need two more interrelated definitions: 


DEFINITION 1. For any fixed natural number n, an n-adic rational number is a rational number 
of form +n* where k is any integer. 


DEFINITION 2. The set H,, of all finite sums (including zero) of n-adic rational numbers is 
clearly a subgroup of the additive rational numbers. We call H,, the n-adic rational group. 

For example, —1/16 = —2~* is a 2-adic (or dyadic) rational number, and H, = {X, + 2°, 
integer k} is the dyadic rational group. 

The reader may have rightly surmised that WN, of the first paragraph turns out to be H,,. Note 
that H, contains the integers, since 1”_,(1/n) = 1. Speaking loosely, H,, may be deemed the 
“second simplest subgroup” within the additive rational numbers after groups isomorphic to the 
integers. (We say “second simplest” since if H,, contained a minimal power of n, say n”, then H, 
would be generated by n™” and hence isomorphic to the integers.) Henceforth, we drop the 
subscript n from G,, N,, and H.,,. 

We first show that N of the first paragraph is precisely that subgroup of G consisting of all 
finite products of n-adic powers of b, i.e., N = {11,,5" }, for integers w, is the n-adic rational 
group H. We then show that N is normal in G and that G/N is an infinite cyclic group (“the 
additive integers”). We first need a Lemma and its corollary. 


LEMMA. For any integer w, b” = a “ba”. (Strictly speaking, b"” is not yet defined for negative 
w but will be below.) 


Proof. By definition of G we see that b” = a~'ba. Now 
hb” = (b")”" = (a-'ba)" =a" 'b"a = a ‘(a ‘ba)a =a *ba’; 
further, 
pb” = (b”)” = (a~*ba*)" =a *b"a? = a~*( aba) a* = aba’; 


and generally, 


k 


(1) b” =a “bat, fork =0,1,2,.... 
Also, for k = 0,1,2,..., 
(a‘ba~*)” = a‘(b™ )a-* 
= a“(a *ba*)a~*, by (1) 
= b, 
and thus a*ba~* is an n‘th root of b—call it b” . (We might ask, “Is the n“th root of b 


unique?” The answers are (a) it doesn’t affect our argument either way, and (b) “yes” (see [1]; 
alternatively, I shall supply a proof upon request for general G,,)). That is, we define 


(2) b” “=a*ba*, fork =0,1,2,.... 
Together, (1) and (2) prove the Lemma. 


y 


CoROLLaRY. If x and y are integers with x > y, then b” =(b"’)” ', where b” and b"’ are 


given by the Lemma. 
Proof. Simply calculate 
(b”’) _— (a~’ba”) mn = gh” "g? = a~’( a’ *ba*~”) a” = a *ba* = b”, 
where all equalities except the second come from the Lemma. 


Three propositions will now complete our task. 
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PROPOSITION 1. The subgroup N of G generated by b” where w ranges over all integers— that is, 
the set N of all finite products of the form b”’*b"* --- b"””"—is an n-adic rational group, N = H. 


Proof. Let H be the n-adic rational group as defined in Definition 2. Let each a, be an n-adic 
rational number; that is, «, has the form +n”, for some integer w,. Note that as usual b~”” 
means the group inverse of b”". The map v from H to N given by 


[ Xa, Jo-= To" 
i=1 i=] 


looks suspiciously like an isomorphism. We show that it is and the proof follows. 


(i) The map v is well-defined. Suppose U?_,a; = 24_,8;, where each a, and B, are n-adic 
rational numbers: a, = +n” and B, = +n” for integers w, and 7,. Now clearly one (or more) of 
the a, and B; are smallest in absolute value; i.e., the absolute value of at least one of them is n”, 
where m is the smallest of all the exponents w, and r,. So the Corollary above implies that each 
b* = (b" )%/"", since a, = n™, where m,; > m; similarly for each b*. Thus the following two 
relations are equivalent: 


(1) TT (6") = [1 (6*), 
and 
(I) lor y"] = Tory"). 


Since each a,/n™ and B,/n”™ is integral, (II) is clearly equivalent (by elementary rules of integer 
exponents in groups) to 


(111) (or) Learn = (pn) rn 


r=] 
“Well-defined” follows since La, = LB, by assumption! 


(ii) The map v is a homomorphism onto N. This comes from the definitions of H, N and the 
well-defined map v. 


(iii) The map v is one-to-one. If not, b** = e (the group identity) for some nonzero Xa;,. Let 
w be the minimum of the exponents w, for all a, = +n”. Thus (2;a;)/n” is integral. If w > 0, 
making n” integral, then by the rules of integer exponents in groups, 


3 Hr” PZ — PB, — e, where 2a,/n” # 0 (since 2a, + 0 by assumption). 
I I p 


Thus a nonzero integer power of b is the identity. This is impossible since b has infinite order. If 
w <0, recall by (2) that b”” raised to the power n~” is b. Now raise both sides of (3) to the 
positive integer power n~”. By properties of integer exponents in groups, n”” commutes past the 
integer exponent given by the first = in (3) to produce 
pre/n™ _ en” =e, 
and again some nonzero integer power of b is the identity. 
Because the map vu is well-defined, b* is well-defined for a sum a of n-adic rational numbers 
a, = +n”, regardless of how a is expressed as such a sum 2 + n™ for integers w. That is, 
b* =JTIb*”” is well-defined. (See [2]. p. 159, for a simple example.) The following two proposi- 
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tions complete our task: 
PROPOSITION 2. The subgroup N consisting of all n-adic rational powers of b is normal in G. 


Proof. For any integers w and i we have 
pr = git) bgvtt = a~'(a~”ba” ) a’ _— a~'b" a’, 
by using the Lemma in the first and third equalities. Thus (reversing the steps), a~'b” a’ = b” 
€ N for any integer w. Further, taking inverses of both sides of this last equality, a~'b-” a' € N. 
Thus, for any a = 2, + n” we have 


w+ 


a'b*a' = [ [(a“'b*" a) € N, for any integer i. 


Since b is itself an n-adic rational power of b(b' = b”” EN), N is trivially invariant under 
conjugation by b and b'. Thus the definition of G as being generated by a and 5b and the fact 
that N is invariant under conjugation by any integer power of a or b yields the desired normality 
of N in G. 


PROPOSITION 3. The factor group G/N is infinite and cyclic. 


Proof. (i) Cyclic. By the definitions of G and N, G/N is generated by the coset aN. 

(ii) Infinite. If not, then (aN)* = N and so a* = b% for some nonzero integer k and some 
a = Ln” for integers w. Now there exists a large enough nonzero integer m (a power of n) such 
that ma is an integer, s. Raising both sides of a* = b* to the mth power, we have a” = b’, for 
integers r and s, where r = mk # 0 since both m and k # 0. Without loss of generality, take r 
positive. Thus a’ = b* would commute with b, ba" = a’b, whereas the definition of G shows that 
ba’ = a’b’, where ¢ is anatural number greater than 1, # a’b, since b has infinite order as noted 
in the second paragraph of this note. (It is at this point that the argument would break down if we 
let n = 1 in the definition of G. However, the abelian group G = {a,b|ba = ab} is simple enough 
in its own right. Clearly, N consists of all “l-adic rational powers” of b—i.e., all integral powers 
of b--and thus is isomorphic to the additive integers. Trivially, N is normal in G and the 
subgroup G/N is cyclic. Also, G/N would still be infinite, i.e., we could not have a’ = b’, but the 
argument would involve a further discussion of “free generators’, which we have chosen to avoid 
here.) 

Thus the group G is indeed “metabelian” (two-step abelian), i.e, G has a normal series 
G2 WN 2D {e}, where N and G/N are both abelian. Furthermore, N and G/N are among the 
least complicated of all infinite groups. 
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CORRECTION TO “NOTE ON THE CAYLEY-HAMILTON THEOREM” 


MARVIN JAY GREENBERG 
Department of Mathematics, University of California, Santa Cruz, CA 95064 


Formula (4), p. 194, this MONTHLY, March 1984, need not hold even when B commutes with 
A, because if Q(t) = adj(t] — A), Q(B) will not in general equal adj( B — A). All we can say if B 
commutes with A is that 
Q(B)(B— A) = x4(B) =(B- A)Q(B). 
My thanks to John R. Silvester for this correction. 
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PICK’S THEOREM REVISITED 


DALE E. VARBERG 
Department of Mathematics, Hamline University, St. Paul, MN 55104 


Pick’s theorem is one of the gems of elementary mathematics, this because the most innocent 
sounding hypotheses imply a very surprising conclusion. Yet the statement of the theorem can be 
understood by a fifth grader. Call a polygon simple if its boundary is a simple closed curve and 
call it a lattice polygon if the coordinates of its vertices are integers. Pick’s theorem asserts that the 
area of a simple lattice polygon S is given by 


1 1 
A(S)=i+5b-1=v0-5b-1 


where i, b, and v are, respectively, the number of interior lattice points, the number of boundary 
lattice points, and the total number of lattice points of S (Fig. 1). 


b 
A(S)=it+t>-1 

2 
A(S,)=7+5-1=%4 
A(S,)=7+4-1=%4 


A(S, US,)=15+ ¥-1= 20 


Fic. 1 


We are aware of several proofs of Pick’s theorem [1], [2], [4], [5], [6]. Each of them uses an 
elegant trick or two but finally rests on the stubborn fact that a primitive triangle (one whose only 
lattice points are its vertices) has area 1/2. We call this a stubborn fact because its derivation 
seems to be the major hurdle in all the standard proofs. 

Our proof of Pick’s theorem is direct, intuitive, and requires nothing more sophisticated than 
the almost obvious and easily proved fact (proof by induction will work) that a lattice polygon can 
be decomposed as a union of lattice triangles. 

To begin, associate with each lattice point P, of S a weight w, = 6,/27, where 6, measures 
the “visibility” angle with which P, can see into S$. Thus w, = 1 at an interior lattice point, 
w, = 1/2 at a boundary lattice point that is not a vertex, and w, = 1/4 at a right-angled corner 
point. Think of w, as measuring the contribution that P, makes to the area of S (see Fig. 2). Let 


W(S)= LL w, 


P,ES 
and, as before, let A(S) be the area of S. 

LEMMA. W(S) = A(S). 

Proof. We note first that W is additive; that is, if S = S, US, as in Fig. 1, then W(S) = 
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W(S,) + W(S,). This is a consequence of the fact that the visibility angles in S, and S, ata 
common lattice point add together to give the visibility angle in S at that point. 

Next consider successively (i) a lattice rectangle with sides parallel to the lattice (Fig. 2), (11) a 
lattice right triangle with legs parallel to the lattice (Fig. 3), and (iii) an arbitrary lattice triangle. 
The latter can be surrounded by right triangles of type (ii) to form a rectangle of type (i) (Fig. 4 
shows a typical example). 


Fic. 2 Fic. 3 Fic. 4 


That W(S) = A(S) in Case (i) is obvious and Case (11) follows immediately from Case (1) upon 
division by 2. We obtain the result in Case (iii) by using the shared additivity of W and A. It 
remains only to decompose an arbitrary lattice polygon as a union of lattice triangles and then to 
apply the additivity of W. 

We remark that the simpleness of § is not used in the proof of the lemma. 


PICK’S THEOREM. For a simple lattice polygon, 


od 1 
A(S)=i+5b-1=0- 55-1. 

Proof. A simple polygon with c interior vertex angles has angle sum (c — 2)7. For example, a 
triangle has angle sum 7, a quadrilateral 27, and so on. It follows that the sum of all the visibility 
angles at points P, along the boundary of S is (b — 2). Thus if J and B denote the interior and 
boundary of S, then 


A(S)=W(S) = Dew + Li wy 
P.el P.EB 
_, (b=20 
aa 27 
. 1 
=it3sb 1. a 


Generalizations. Consider next a general lattice polygon S, one that may have holes or 
intersect itself (see Figs. 5, 6, and 7). We require only that S can be expressed as the union of 
finitely’ many simple lattice polygons. Pick’s theorem as originally stated does not apply to such 
figures but a simple variant involving the Euler characteristic x does. The correct formula is 

1 
A(S) = U-~ 56, — X. 
Here, e, is the number of boundary edges of S. Note that for a simple lattice polygon, e, = b and 
x= 1. 

The situation illustrated in Fig. 5 is easily handled with no additional apparatus. Let S be a 
simple lattice polygon containing m separated holes which are themselves simple lattice polygons. 
Recall that the Euler characteristic for such a figure is 1 — m. Let by, b,,b,...,b,, denote the 
number of lattice points along the outer boundary of S and the boundaries of the m holes of S, 
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x= 
A(S) = 40 — (36) +2 A(S) = 20 — $(16) -1 A(S) = 40 — (38) + 2 


Fic. 5 FIG. 6 FIG. 7 


respectively. Then, using the fact that the visibility angles of the kth hole sum to (b, + 2)7, we 
obtain 


A(S)=i+— Do @, 
27 EB 
i a ne 
=i +5 (by +b, + ++ +b,) +(m = 1) 
. dt _ 1 
=i+356 X =v 75 x. 


This demonstration breaks down for the polygons of Figs. 6 and 7. For them, we are forced to 
follow the standard procedure of decomposing S$ into primitive triangles each of area 1/2, 
together with a simple combinatorial argument. We state the general result as a theorem. 


EXTENDED PICK’S THEOREM. Let S be a lattice polygon (simple or not). Then 


1 
A(S) — U ~ 4 & — X> 
where v is the total number of lattice points in S, e, is the number of edges on the boundary of S, and 
x is the Euler characteristic of S. 


Proof. We suppose that S has been decomposed into primitive triangles (see [4] for a proof 
that this can be done) and let v, e, and f denote the number of vertices, edges, and faces in this 
decomposition. Each triangle has 3 edges and each edge is shared by 2 triangles except for those 
edges on the boundary of §. Thus, 3f = 2e — e, and so 

f= -e,+2e-2f=2v—e,-Av—e+f)=2v-—e, — 2x. 
We conclude that 
1 1 
A(S) =5f=v- 5% —x. 0 


We have recently learned of an even more general form of Pick’s theorem; it appears in 
reference [3]. 
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UNBOUNDED SEQUENCES OF EULER-DEDEKIND MEANS 


CHARLES R. WALL 
Trident Technical College, Charleston, SC 29411 


Euler’s totient function, the number of integers not exceeding n and prime to it, is well known 
to be given by 


o(n) =nII(1 — 1/p), 
where the product is taken over the distinct primes that divide n. Less well known is the closely 
related Dedekind function: 


W(n) =nII(1 + 1/p). 
Define g(n) to be the mean of these: 
g(n) = (o(n) + $(2)) 72. 


It is easy to see that g(n) is always an integer and that g(n) = n whenever n is a prime power. 
So, if we iterate g, the resulting sequence may become constant: 


g(12) = 14, 2(14) = 15, 9(15) = 16, g(16) = 16,... . 


Richard Guy [1] asked if such sequences can increase indefinitely; we show that the answer is 


ce 99 


yes. 
Let x,,, = 2(x,) with x, = 1488 = 243 - 31. Then 


x, = 243 - 37, x3 = 2°3 - 11, x,= 2°41, x5 = 2°83, X_ = 27167, 
x, =2°?5-67, and x, =2°3-31=2x, 
so that for n > 1, we have x,,,, = 2x, and the sequence { x, } is unbounded. 
The two smallest initial values of such unbounded sequences are x, = 45 and x, = 50. Each of 
these gives x, = 56, x. = 1488, so that for n > 28, we again have x,4.7 = 2x,,. 
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What is the next element of the sequence that begins with F4E, $9, SESEN,...? 
—Donald E. Knuth 
See p. 595. 
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R 


nen, Cp 


Z’ 


Fic. l 
FIG. 2 


Also solved by K. L. Bernstein, J. Dou (Spain), V. Grinberg, L. R. King, L. Kuipers (Switzerland), O. P. Lossers 
(The Netherlands), A. Markovich (Yugoslavia), W. W. Meyer, A. Muller (Switzerland), B. Olk, M. Pachter (South 
Africa), F. B. Strauss, B. Vidakovic (Yugoslavia). 


ADVANCED PROBLEMS 


Solutions of these Advanced Problems should be mailed in duplicate to Professor D. H. Mugler, Department 
of Mathematics, University of Santa Clara, Santa Clara, CA 95053, by January 31, 1986. The solver’s full 
post-office address should be on each sheet. 


6501. Proposed by I. J. Schoenberg, Madison, Wisconsin. 


Suppose that we have an identity 
+ 0O + 0O 
(1) e=| y an2"]| y 2 for |z|<r,(r> 0), 


where a, > 0 and b, > 0 for all n, and neither of the factors in (1) reduces to a constant. Show 
that the two factors in (1) are necessarily of the form 


(2) ete and e”-*, with a>0,b>0,a+b=1. 


6502. Proposed by Klaus Schirger, University of Bonn, Federal Republic of Germany. 


Each of the following conditions is clearly necessary for the convergence of a sequence { S, }: 


- ; _t1e ; . 
(A) lim i >) S, exists; and (B) lim > >) S,,.*« exists for every integer m > 1. 
k=1 


n—- 00 k=1 n— co 


If {S,} is a real sequence which is bounded above and {nS,} is nondecreasing, is either 
condition sufficient for the convergence of { S, }? 


ANSWER TO QUESTION ON PAGE 587 


E1GHT. The rule is to replace the letters that are used as Roman numerals by their Arabic 
equivalents. 


THE TEACHING OF MATHEMATICS 


EDITED BY MARY R. WARDROP AND ROBERT F. WARDROP 


Material for this department should be sent to Professor Robert F. Wardrop, Department of Mathematics, 
Central Michigan University, Mount Pleasant, MI 48859. 


MORE— AND MOORE— POWER SERIES WITHOUT TAYLOR’S THEOREM 


I. E. LEONARD 
Department of Mathematical Sciences, Northern Illinois University, DeKalb, IL 60115 


JAMES DUEMMEL 
Department of Mathematics, Western Washington University, Bellingham, WA 98225 


Inspired in part by the article “Power series without Taylor’s theorem” by Wells Johnson in the 
June-July 1984 issue of the MONTHLY, we present here an extremely simple method for obtaining 
the power series expansions for the functions sin x, cos x, e *, e*. The approach we take uses no 
“sophisticated” concepts such as integration by parts, reduction formulas, or recursion relations; 
it uses only the monotonicity of the definite integral, and the basic inequalities for the functions in 
question. We make no claim of originality: the first author learned of the technique from 
Professor R. A. Moore, at Carnegie-Mellon University, while the second author encountered it in 
a calculus text (see [2], p. 78). 

Starting with the inequality, cost < 1, and integrating repeatedly over the interval [0, x], we 
obtain inequalities involving the successive Taylor polynomials for sine and cosine. For example, 
four integrations yield, for x > 0, 


x . 
x ~ 3, <smxs x 
and 
1 a <1 xo xt 
71 < COSX < T a 


Students are quickly convinced that sin x and cos x are trapped between successive partial sums 
of their usual Taylor series expansions. The error estimates 


n 2k 2n+2 
k Xx x 
cos x — —] < ————__—__ 
* 2 ) (2k)!| ~ (Qn +2)! 
and 

n 2k+1 2n+3 
_ _] k xX < ——___ 
sm 2 Bee! < nad)! 


are evident, and a little more argument will convince even the most skeptical students that the 
series actually converge to sin x and cos x. 

As soon as the derivative of the exponential function has been covered, this approach will also 
yield the Taylor series for e *~ from the inequality e-‘ < 1, for t > 0. This series can be used to 
estimate e ' and hence e. In particular, the estimate (24/9) < e < 3 is easily obtained in this 
way. This could be used below for students who find 3? more acceptable than e?. 

The power series expansion for e*, x > 0, can be obtained from the inequality 1 < e’ < e”, for 
0<t<b (b fixed), by integrating repeatedly, and we obtain, after rearranging, 


1+ ea + + xr <e*<l1l+ x +4 x" 
X _ ee aE ERROR X —_— eee TT 
2! (n+l)! 0° 2! (n+ 1)! 
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for 0 < x < b. This inequality gives the error estimate 
n+l ok 


O<e*- ar = 


Tay he? =D) 


for 0 < x < b. Nowa simple argument showing that lim, _, ,,(x”/n!) = 0, for fixed x, yields the 
power series expansions for sin x, cos x, e *, e*, valid for x > 0. Note that taking x = b= 1 in 
the above inequality gives 


<TH + sy 


— 1 eg x toe 
(n+)! ° ~k! (n+)! 
References 


1. W. Johnson, Power series without Taylor’s theorem, this MONTHLY, 91 (1984) 367-369. 
2. J. F. Randolph, Calculus, Macmillan, New York, 1952. 


AN ELEMENTARY PROOF OF A THEOREM IN CALCULUS 


DONALD E. RICHMOND 
500 Fulton St., #201, Palo Alto, CA 94301 


A fundamental theorem states that if f’(x) = 0 at every point of an interval [a, b], then f(x) 
is constant on [a,b]. Most proofs use the Mean Value Theorem, but several proofs have been 
given that are independent of this theorem (Bers [1], Halperin [3], Powderly [4]). The present note 
proves our fundamental theorem in an elementary way. 

Let f’(x) = 0 on [a, b]. If f(x) is not constant on [a, b] as the theorem states, then for some 
u<von[a,b], f(v) # f(u). Hence the chord joining (u, f(u)) and (v, f(v)) has a slope different 
from zero. That is 


(1) f(v) —f(u) =C(v-—u), or Af=CAx, withcC #0. 
Assume that C > 0. Bisect [u,v] at w. If 
f(v) — fw) < Clv — w) 
and 
f(w) — f(u) < C(w — u), 
then 


f(v) — flu) < C(v— x), 

which contradicts (1) with C > 0. Hence on at least one of the intervals [u,w] and [w, v], 
Af/Ax > > C. Call this interval [u,,v,]. By repeated bisection one obtains a nested sequence of 
intervals [u,,v,] over each of which Af/Ax > C. [u,,v,] converges to some x in [u,v]. Hence 
if C > 0, Af/Ax cannot approach f’(x) = 0 as required. 

The possibility C < 0 is excluded by reversing the inequalities. Hence C = 0, contrary to the 
assumption that f(v) # f(u). 

The same argument shows that if f’(x) > 0 on [a, b], then f(v) > f(u), and if f’(x) < 0 on 
[a,b], then f(v) < f(u). 

The argument may be generalized to show that if m < f’(x) < M on [a, 5], then 


(2) m(v—u) <f(v) —f(u) < M(v— u). 


It is sufficient to replace C > O by C> M and C < 0 by C < m. (2) has been called the ‘“‘ weak”’ 
form of the Mean Value Theorem [2]. 
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E 3105. Proposed by Calin P. Popescu, student, Bucharest, Romania, and the editors. 


Let X be a compact space (not necessarily 7,) and f: X — X a function with the property that 
f(A) ¢ A for every closed subset A of X. Show that there is a singleton set P such that P c f(P) 
and a closed set Q such that f(Q) = Q. 


E 3106. Proposed by Donald E. Knuth, Stanford University. 


Let S(n) be the set of all positive integers k such that the fractional part of n/k is 1/2 or 
more. For example, 


S(17) = {2,3,6,9, 10,11, 18,19, 20,...,34}. 
Prove that 


» o(k) =n’, 


kES(n) 


where ¢ is Euler’s totient function. 
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E 3107. Proposed by Ira Gessel, Brandeis University. 
m'(2m+ 2n)! 


Show that if m and int , then —— 
w that if m n are integers, then (2m)in(m + n)! 


is an integer. 


FE 3108. Proposed by L. Cseh and I. Merényi (students), Cluj, Romania. 
Let x, y,z be real numbers, k,,k,,k, © (0,1/2) and k, + k, + k, = 1. Prove that 
ki kyk3(x +y t+ z) > xyk,(1 — 2k,) + yzk,(Q1 — 2k,) + zxk,(1 - 2k,). 
When does equality hold? 


E 3109. Proposed by Desmond MacHale and Michéal O Searcoid, University College, Cork, 
Treland. 


Let R be an associative ring such that (xy)” = yx, for all x, y © R, for some fixed natural 
number n. Must R be commutative? 
E 3110. Proposed by Dennis Spellman, Sacred Heart University, Bridgeport, CT. 


Let p be an odd prime. Show that if (x9, y)) is a pair of positive rational numbers on the curve 
x? + yP =1, then at (x9, y) both dx/ds and dy/ds are quadratic irrational numbers. Here s 
denotes arc length. 


SOLUTIONS OF ELEMENTARY PROBLEMS 
A Square in a Polyhedron 


E 2740 [1978, 765]. Proposed by Victor Pambuccian, Bucharest, Romania. 


Show that if P is a convex polyhedron, one can find a square all of whose vertices are on some 
three faces of P, as well as a square whose vertices are on four different faces of P. 


Partial solution by the proposer. We first prove the following 


LEMMA. Let P,P,...P,(n > 3) be a convex polygon, such that «P,P,P, < 90° and 
«x P,P,P,_, < 90° and P,P, is not parallel to any P,P,,, (i =1,...,n —1). Then one can find a 
square all of whose vertices lie on exactly three sides of the polygon. 


n 


Proof. Let d be the support line parallel to P, P, and P, the contact point, P, = dM P,P,... P,,. 
For PE P,P,...P, let x = d(P,P,P,) (the distance from P to P,P,) and y = |PP’|, where 
P’ & P,P,...P, and PP’||P,P,. Put d(P,, P,P) = x9. Then 

f:[0,x]>R f(x) =y 


is a continuous function. So g: [0,x 9] — R, g(x) = f(x) — x is also continuous and g(0) = 
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|P,P,|> 0 and g(x9) = —XxX, < 0. Since g has the intermediate value property, g(x,) = 0 for 
some x, € (0, x9). Q.E.D. 


THEOREM. For every convex polyhedron there is a square, all of whose vertices lie on some three 
distinct faces of the polyhedron. 


Proof. Let V be any vertex of the polyhedron and a,, a,,..., a, be all the faces which meet in 
V (ie, VEa,,i=1,2,...,n). Let d,, =a,Na, be the lines of intersection of the planes 
containing faces a, and a,. Take two points A and B © a, on the two edges that belong to a, 
and pass through V, such that AB # d,, fori, 7 = 2,...,n, and xVAB < 90°, «VBA < 90°. (The 
possibility of such a choice is obvious.) Consider now all the planes that pass through 4B. When 
these planes are sufficiently near to a, = (VAB), their intersection with the polyhedron will be a 
convex polygon with the angles at A and B acute and having no side parallel to 4B. Applying the 


previous lemma to such a polygonal section we obtain the desired result. 


A Symmetric Dissection of an Isosceles Triangle 


FE 2918 [1981, 763]. Proposed by Jordi Dou, Barcelona, Spain. 


Show that an isosceles triangle can be dissected symmetrically around the principal median 
into seven acute isosceles triangles except when the vertex angle A is 90°, 120°, or when 
135° < A < 144°. 


Solution by the proposer. Let ABB’ (B = B’) be the triangle. If A < 90°, we obtain a partition 
into 4 equal triangles. With another partition into 4, of one of the two central triangles, one 
obtains a partition into 7. 

Let A > 90°. Let H be the unique interior vertex (on the axis) of the partition: if there are 2 
interior symmetric vertices H and H’ not on the axis, we sketch the triangles AHH’ and MHH’, 
and by symmetry will have an even number of triangles. If H and H’ are on the axis, sketch 
AHP, AHP’, and MH’M’; then the pentagon BMH’HP cannot possibly be partitioned into two 
triangles. (See Fig. 1.) 


A A 


Fic. 1 


Let P,P’ be the (necessary) vertices on AB,AB’; M,M”’ those situated on BB’. The 
“principal” vertices of the triangles BPM, BP’M’ are B and B’ (because B = B’ < 45°). That of 
HMM’ is H (by symmetry). For the triangles AHP and PHM (or AHP’ and P’HM’), we have 
to consider the 9 pairs of principal vertices. (See Fig. 2.) 


PRINCIPAL VERTICES 
6,y: Then w = (7 + 2A)/4 > 90° is impossible. 
6,5: Then 6 = 90°, which is impossible. 
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A 


FIG. 2 
6,¢: Then € = 90°, which is impossible. 


a, y: Then w = (37 — A)/4 > 90° is impossible. 


B,y: Then B=a—-—A < 90°, and w = (5A — 7)/4 < 90° implies A < 108°. Therefore 90° 
<A < 108°. 

8,6: Then B = 7 — A < 90° and 6 = (3A — 2)/2 < 90° implies A < 120°. Therefore 90° < 
A < 120°. 

B,e: Then B = 7 — A < 90° and w = (7A — 37) /2 < 90° implies A < (102.857...)°. There- 
fore 90° < A < (102.857...)°. 

a,6: Then 6 = (37 — 3A)/2 < 90° implies A > 120°, and w = 2A — mw < 90° implies A < 
135°. Therefore 120° < A < 135°. 


a,€: Then w = (57 — 5A)/2 < 90° implies A > 144°. Therefore A > 144°. 
Also partially solved by W. Janous (Austria). 


An Even Combinatorial Sum 


E 2932 [1982, 212; 1982, 498]. Proposed by Henry E. Fettis, Mountain View, CA 
For n > 1, set 


[((n+ b)/2] n-1 
s(b)= (ona 2k + 5) 
n rm k\(n— k)! 


Prove that, if b< 1n,S$(b) = S,(—)). 
Solution by Kenneth L. Bernstein, MITRE Corporation, Bedford, MA. 


S,(b) = = y (-1)*(; (polynomial in k of degree n — 1) 
* k=0 


n 


_ x (i (ma 2k by" 
k=[(n+b)/2]4+1 k(n—k)! 


According to Euler’s formula (a proof of which appears in this MONTHLY, 85 (1978) 450-467) the 
first sum is zero. Substituting m — s for k leaves 
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n—1—-[(n+b)/2] (n Is - by" 
S(b) = —1)° +. 
n(b) , X (~1) s'(n—s)! 
When n+ 5b is odd the upper limit in the above sum is [(n — b)/2]. When n + b is even the 
upper limit in the above sum is [(n — b)/2]— 1, but this hmit may be extended to [(n — b)/2] 
since, for this value, the summand vanishes. Thus: 


[(n—b)/2] n—-1 
_ yk (ra 2k by 
= S,(-b) Q.E.D. 


Solution by O. P. Lossers, Eindhoven University of Technology, The Netherlands. According to 
Feller, An introduction to probability theory and its applications, vol. 2, first ed., p. 27, 


[x/2] 


() vox) = 2m — DEY (fx 24) 


is the density of the probability distribution of the sum of » independent random variables having 
uniform distributions on (0,2). It follows that u, is symmetric around x = n. 
Hence we have 


S,(6) = Sn —1)!lu, (n+ b) 


2” 
= —u,(n + b) 


which is symmetric around b = 0. 
Replacing 2 in (1) by an arbitrary a > 0 yields an obvious generalization. 


Also solved by P. S. Bruckman, L. Carlitz, C. S. Karruppan Chetty (India), I. Gessel, O. G. Ruehr, University of 
South Alabama Problem Group, C. Wildhagen (The Netherlands), and the proposer. 


Carlitz noted that n!S,(2k — n) = 2"7'4,_ 1.4, Where A,,_, , 1S an Eulerian number (see, for instance, page 243 
of Louis Comtet’s Advanced Combinatorics, Reidel Publishing, 1974). 


Light Ray in a Semicircle 


E 2950 [1982, 424]. Proposed by Ko-Wei Lih, Academia Sinica, Taiwan. 


The inner side of a semicircle (including diameter) is a mirror. A light ray emitting from the 
zenith makes an angle a with the vertical line, 0 < a < 7/2. Characterize a such that the light 
ray will hit the zenith after finitely many reflections. (The context can be enlarged to a 
hemispherical mirror, including horizontal base.) 


Solution by the proposer. Observe that if we have a circular mirror, as in Fig. 1, the light ray AC 
will hit B when it is reflected by a horizontal mirror through the center. Since B and C are 
equidistant from the horizon, instead of following the light ray in the semicircular mirror, we may 
follow the same light ray in the circular mirror as in Fig. 2. When it hits a point on the lower 
circumference, the original light ray hits the symmetric point on the upper half. When it hits a 
point on the upper circumference, so does the original light ray. Therefore we must characterize 
the angle a in the circular mirror such that the light ray hits Z or Z’ after finitely many 
reflections. This occurs if and only if 8/7 = m/n, where m and n are integers with O < m<n 
and (m,n) = 1. Thus a/(7/2) = (n — m)/n, provided we make the convention that if a light 
ray hits a corner, it reflects back in the same direction. If we do not allow the light ray to hit the 
corners, then n must be odd. 
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R 


ee, Oe 


Z’ 


Fic. 1 
FIG. 2 


Also solved by K. L. Bernstein, J. Dou (Spain), V. Grinberg, L. R. King, L. Kuipers (Switzerland), O. P. Lossers 
(The Netherlands), A. Markovich (Yugoslavia), W. W. Meyer, A. Miller (Switzerland), B. Olk, M. Pachter (South 
Africa), F. B. Strauss, B. Vidakovic (Yugoslavia). 


ADVANCED PROBLEMS 


Solutions of these Advanced Problems should be mailed in duplicate to Professor D. H. Mugler, Department 
of Mathematics, University of Santa Clara, Santa Clara, CA 95053, by January 31, 1986. The solver’s full 
post-office address should be on each sheet. 


6501. Proposed by I. J. Schoenberg, Madison, Wisconsin. 
Suppose that we have an identity 


(1) e={¥a,z")( Y b2" for |z|<r,(r> 0), 


n=0 n=0 


where a, > 0 and b, > 0 for all 1, and neither of the factors in (1) reduces to a constant. Show 
that the two factors in (1) are necessarily of the form 


(2) e?*e and e”-*, with a>0,b>0,a+b=1. 


6502. Proposed by Klaus Schirger, University of Bonn, Federal Republic of Germany. 


Each of the following conditions is clearly necessary for the convergence of a sequence { S, }: 


; - ; _i1e ; ; 
(A) lim . >) S, exists; and (B) lim a >) S,,.* exists for every integer m > 1. 
a k=1 a k=1 


If {S,} is a real sequence which is bounded above and {nS,} is nondecreasing, is either 
condition sufficient for the convergence of {S, }? 


ANSWER TO QUESTION ON PAGE 587 


E1GHT. The rule is to replace the letters that are used as Roman numerals by their Arabic 
equivalents. 
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SOLUTIONS OF ADVANCED PROBLEMS 
Sets with Dense Projections 


6421 [1983,134]. Proposed by J. Beck and J. Pach, Hungarian Academy of Sciences, and 
F. Galvin, University of Kansas. 


Let E be a set of points in the plane with the property that every closed disc of radius 1 
includes at least one element of E. Prove that there exists a straight line Z such that the 
orthogonal projection of E onto L is everywhere dense in L. 


Correction to solution. M. J. Pelling has pointed out that the statement of the theorem used in 
the published solution [1984, 589] is incorrect. The unit circle 


E={(x,y):x7+y*=1} 
evidently satisfies the hypothesis of the theorem since 
{x/y:(x,y) € E,y #0} =R, 
but, for every a © R, {x + ay:(x, y) € E} is bounded and so is not dense in R. The defect can 
be eliminated by replacing “{x/y:(x, y) © E, y # 0} is dense in R” in the hypothesis of the 
theorem by “for every n © Z", {x/y:(x,y) © E, |y| >} is dense in R”. The set E in the 
corollary satisfies the amended condition. 


A Determinant That Rarely Vanishes 


6455 [1984, 205]. Proposed by L. E. Mattics, University of South Alabama. 


Let a, =(-1)'"', a4, =i"! forl <i<p-—1,2<j <p—1, where p is an odd prime. 
Show that det(a,,) = 0 (mod p) if and only if £f={(—1)***k~* = 0 (mod p). Is there a prime p 
for which det(a,,) = 0 (mod p)? 


Solution by the editors. Unfortunately a misprint occurred in the statement of the problem. It 
should have read a,, = i/ instead of a,, = i’ ~', The correct matrix (a,,) is then 


1 1 1 nee 1 
—] 92 3 Lae Qp-l 
1 3? 33 ve 3P-1 
2 3 —1 
1 (p-1) (p-l1y os 6 (p- 1)’ 


For k = 1,2,...,p — 1, divide the kth row by k and then add all the rows to the final row to 
obtain the following matrix (mod p) 


1 1 1 1 
= 27! 2 2° QP? 
—37! 3 3? 3P-? 
—1 2 —2 
—(p— 2) (p-2) (p-2)  -+- (p-2)* 
S 0 0 a 0 


where S = YP2i(—1)**'k7!. The determinant of this matrix is 1!2!3! ---(p — 2)!S and conse- 
quently det(a,,) = 0 (mod p) if and only if S = 0 (mod p). To date the only primes for which the 
final congruence is known to hold are p = 1093 [W. Meissner (1913)] and p = 3511 [N. G. W. H. 
Beerger (1922)]. 
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Also solved by the proposer. Lajos Takacs solved the printed problem with the first congruence replaced by 
(mod p*). Various versions of the problem were solved by S. V. Kanetkar, Pierre Lalonde (Canada), O. P. Lossers 
(The Netherlands) and Robert E. Shafer. 


The Digamma and a Related Function 


6456 [1984, 205]. Proposed by Robert E. Shafer, Berkeley, CA. 


For x > 0, define 


. 1 ~ 1 
W(x) = im log» + x — 5} — Lk 


and 


W2(x) = lim los" x +x -5| => log(x +k) 
noe 2 7 k=0 
(a) Show that, for all x > 0, 
1 1/24 
b2(x) < 5 (x) -~—_-“4 _. 
(x — 1/2)” + 3/10 
(b) Prove that 


¥ (= 18" = (y — 51082} 1082, 
n=1 
where y is Euler’s constant. 


Solution to part (a) by the proposer. Note that p(x) is the digamma function ['’(x)/T'(x). By 
the mean value theorem we have, for k > 1, x > 0, 


0 < log(x + k) —log(x + k — 1) esp 

and hence 
0 <log(x +n) —logx— LES e-yp 

Letting n — oo, we obtain 
(1) 0 < logx ~ (x) <= for x > 0. 
Likewise we can show that 
(2) 0 < 5 log?x ~ y2(x) < 8 forx >e-1. 
In order to prove (a) we shall first show that, for x > 0, 
(3) BX = v(x +1) - a(x) 

> 5 (¥ (x + 1) - P(x) - I + 


(x+1/2)°+3/10 (x—1/2)°+3/10 
Since (x + 1) — (x) = 1/x, the right-hand side of inequality (3) equals 


1 1 
1 g(x) +h) +73 
x 2x} x44 x7/10 + 121/400 
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Consequently (3) is equivalent to 


1 x?/12 
4 x) < lgx -— = - ————_+—"_——__. 
(4) v(x) 2x x44 x*/10 + 121/400 


We shall in fact prove the tighter inequality 


1 1/12 
5 x) < logx - =— - ——+——_.. 
(5) W(x) < logx — 5 241/10 
To prove (5) we shall first show that 
1 
(6) ¥(x+1)-¥(x) == 
> loge tt - 1 ,i_ oo? n 7? | 
xX  WMxtI 2x (x41) +1/10 x? + 1/10 


which is equivalent to 
x+1 01/1 1 1 x+1/2 
x Xx x (x + 1/2)" —(3/10)(x + 1/2)” + 49/400 


This will follow from the tighter inequality 
x+1 x+1/2 1/6 1/20 
(8) jog2 t= <__*ti/* _____2/6 ___i/@ _ 
x (x+1/2)°-1/4 (x+1/2) (x + 1/2) 
Expanding in terms of x + 1/2, the left-hand side of (8) is 
— 1 
Loo 
n=0 22"(2n + 1)(x + 1/2)°"** 
while the right-hand side is 
1 1 1 — 1 
x+1/2~ 2 3 4 st 2n 2n+1 
2°-3(x+1/2) 2%-5(x+1/2) n=0 2°"(x + 1/2) 
and (8) follows. Thus (7) and (6) hold. From (6) we get 


(9) E (ba m+ 1) 9x + m)) = He +N) HC) 


x+N_ 1 ,i_ 1/12 , 1/2 
x 24x+N) 2x 2 241/10’ 
(x+N)°+1/10 x*+1/10 


> log 


which is equivalent to 

(10) B(x) <¥(x +N) —log(x +N) ++» 1? 
x(x+N)  (x+N)*+1/10 

114 

2x  x741/10 


By (1), (x + N) — log(x + N) > 0 as N — o9, and so (5) follows from (10). Thus (3) is 
established. It follows from (3) that 


+log x — 


(11) E (a(x + m+ 1) — oa +) = a x+ N) —da( a) 
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1 1/24 1/24 

> a(x + N) —¥*(x)) ~—____+/% _ + —_+f24 _ 

(x+N-—1/2)°+3/10 (x-1/2) + 3/10 
and so 

1 1/24 
(12) Y2(x) <¥2(x + N) —5 V(x +N) +4 
(x +N-—1/2)° + 3/10 
1/24 

(x — 1/2)” + 3/10 
However, from (1) and (2), #,(x + N) — 40?(x + N) > 0 as N > o, and (a) follows. Strictly 
we can only deduce (a) with < in place of < from (12), but a more careful analysis shows that 
(5) in fact implies (a) as it stands. 


+5 ¥(x) - 


A number of readers pointed out that part (b) is well known. See this MONTHLY, 87 (1980) 498 for references. 


Representations of Residue Classes 


6457 [1984, 259]. Proposed by R. L. McFarland, Yellow Springs, OH. 


Let p be an odd prime. Show that the 2°7~/* numbers of the form +1 4+2+---+4(p-1) 
represent each nonzero residue class modulo p the same number of times. Determine this 
common number of representations and show that it differs by one from the number of 
representations of the zero residue class. 


Solution by James Propp, University of California at Berkeley. Let 
§=e?"/P and S=(E+H+ gf) (¢ 4 ¢-?) 22 (YPD/2 4 CP D/72), 
Then 
S=ayt+agr+--- +a,_,f?*, 


where a, is the number of ways k can be represented (mod p) by +14+2+---+4(p-— 1). 
Since S$ is also equal to 


(get b/2 4 go PtD/2) wee (¢?-} 4 gD), 


we obtain 
fle sro (e| (fer) 
— fe@_pp\l at yy) ay? +0)? _ 
= PIV TE A + 2) = Day ay” 3 


(using the identity (x + y)(x + y£)(x + pS?) +++ (x + yh?71) = x? + y?). Hence S = +1. Since 


L+x+x?+---+x?7! is the minimal polynomial of ¢ over the rationals, it follows that 
ag —~S=a, =a, = ++: =4,_). 
Further, since ¢ + (7 + --- +27! = —1 and 


we have 
_ _ —-1)/2 _ — 2 


= 2?-)/?2 mod p, 
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so that S -(2), Thus 


a, = (20-97 -(2)] and a),=4a, +(4), 
Pp P Pp 


Also solved by Paul S. Bruckman, Barry A. Cipra, L. E. Clarke (England), Stephen M. Gagola, Jr., Ray Hill 
(England), S. V. Kanetkar, Pierre Lalonde (Canada), David Leep, O. P. Lossers (The Netherlands), L. E. Mattics, 
J. G. Mauldon, R. W. K. Odoni (England), R. G. E. Pinch (Scotland), Lajos Takacs, L. Van Hamme (Belgium), G. 
Velissarios (Greece), D. H. Wiedemann (Canada) and the proposer. Gerald Myerson pointed out that the problem 
appears as Theorem 1 of his paper, ““A combinatorial problem in finite fields, 1”, Pacific Journal of Mathematics, 82 
(1979), 179-187. The above solution is a bit simpler than his proof. 


REVIEWS 


EDITED BY ALLAN L. EDMONDS AND JOHN H. EWING 
COLLABORATING EDITOR FOR FILMS: SEYMOUR SCHUSTER 


Complex Variables with Applications. By A. David Wunsch. Addison-Wesley, Reading, MA, 1983. 
viii + 439 pp. 


GEORGE PIRANIAN 
Department of Mathematics, The University of Michigan, Ann Arbor, Michigan 48109 


Without inspecting the book, I committed myself to writing a review. When the book arrived, a 
mixture of incompetence and blood oozed from my pores. The MAA had beguiled me with its 
Editorial Guidelines for Reviewers. I had been careless about accepting a blind date, and I 
deserved the consequences. 

The Guidelines virtually forbid chapter-by-chapter summaries as well as criticisms of format or 
style, and they clearly disapprove of reports on classroom testing. “Other than this, almost 
anything goes.”’ Would the telling of Old-Testament stories be acceptable, and how near to it can 
we come in the review of a complex-variables text for undergraduates? 

Somewhere in this essay we must smite the rock and hope for two or three paragraphs about 
the book; but first we shall praise the Guidelines. We praise them because they promote instability 
and because they convey the message that some of the MONTHLY’s curators prefer a prophetic to a 
priestly orientation. I take the distinction from Kenneth E. Boulding’s essay Religious Perspectives 
in Economics (pages 179 to 197 in Beyond Economics, Essays on Society, Religion and Ethics; Ann 
Arbor Paperbacks, The University of Michigan Press, 1968, 1970). 

The prophet descends from the mountain or strides into the city from the desert. He may even 
arrive by ship or by whale. Whatever his apparel, his manners, and his station in life, he 
denounces well-established practices (such as sin, for example) and demands reform. His message 
may have minimal substantive merit; but it can claim a basic virtue: it promotes the examination 
of customs, of legal codes, of relations between components of society, perhaps even of the sacred 
conventions that separate good from evil. Occasional reassessments are essential for survival, 
because without them our society does not adapt to environmental changes promptly enough to 
remain viable. We need prophets, even as we need poets, actors, clowns, and lutenists. 

The priests dedicate themselves to the perpetuation of the faith, possibly also to what we call its 
refinement or its elaboration depending on whether the development fills us with joy or disgust. 
Without priests, the available supply of prophets would be woefully inadequate for the mainte- 
nance of reasonable cultural values; but when priests become too abundant and powerful, the 
prophet can no longer get a hearing. 


1985] REVIEWS 601 


In our book reviews, frivolity would be out of place; our daily communication suffers from a 
smothering component of fluff contributed by newspapers, radio stations, the commercial world, 
local and cosmic governments, and ourselves. Agreed, then, that the MONTHLY’s reviews should be 
solid; but if they are also stolid, we will not read them, and their publication will degenerate into a 
ceremony whose omission most of us would never notice. Therefore we should support the 
Editors’ campaign for vitality, even if our concept of vitality is so diffuse that we are unable to 
define it. 

Now we must smite the rock, and we must do it without worrying that the rock may splinter 
the rod. Had I known that David Wunsch is a Professor of Electrical Engineering, I would not 
have accepted the assignment. A chatty essay about the book’s subject should make clear why the 
description of certain physical phenomena is most effective if we formulate it in terms of 
harmonic functions. Also, it should point out that every harmonic function of two real variables x 
and y is the real part of an analytic function of the complex variable z = x + iy. Readers of the 
MONTHLY are sufficiently aware of the second point, and nobody has ever accused me of being an 
expert on the first. 

If I were to write a book about complex analysis, I would not apologize for using the letter i 
instead of 7; but with or without such an apology, my book would never succeed with a class of 
undergraduates in engineering. Watching Wunsch lead the youngsters by their hands, we marvel 
at the powerful flux of his affection. The author is able to share the students’ collective identity. 
Together with a sound knowledge of the subject and the ability to formulate intelligible 
statements, the capacity for identification with the students forms the basic requirement for 
successful pedagogy. 

To needle the Editors and to honor the Faithofourfathers, let us record that Chapters 6, 7, and 
8 carry the titles Residues and their use in integration, Laplace transforms and stability of systems, 
and Conformal mapping and some of its applications. Section 7.3 is entitled Principle of the 
argument and introduction of the Nyquist stability criterion. Chapter 8 ends with examples 
illustrating the use of the Schwarz-Christoffel formula, and it has an appendix under the title The 
stream function and capacitance. 

This review is not what the Editors requested; I hope they can nevertheless live with it. While 
we wait for perfection, we must tolerate each other’s inadequacies. As to the Old-Testament tales, 
we never got close enough to them to use even one word of Yiddish; the more’s the pity, but as the 
Good Book says, there is a time for everything. This includes a time for celebrating cultural 
affiliations and preferences and a time for pushing such sentiments into the background. Also, 
there is a time for treating complex analysis as if it were the North wall of the Eiger, a challenge to 
those who are in shape for a demanding athletic performance, and a time for presenting the 
subject to those who, if they are to be true to themselves, will always regard it as a tool for the 
solution of practical problems. Wunsch succeeds admirably. 


Classics of Mathematics. By Ronald Calinger (ed.). Moore Publishing Company, Oak Park, IL, 
1982. xxv + 742 pp. $18.95. 


HAROLD M. EDWARDS 
New York University, Courant Institute of Mathematical Sciences, New York, NY 10012 


A mathematician who decides to look into great mathematical writing of the past faces real 
difficulties. He may need to acquire new terminology or even a new way of formulating 
mathematical ideas. Background study will no doubt be necessary to gain some notion of the 
mathematical environment in which his text was written. If he is studying a range of topics or 
periods, he is likely to have to consult many different source books. And, alas, he is likely to find 
many of the texts written in a language he doesn’t know. 

Ronald Calinger’s volume does much to alleviate these difficulties. His anthology contains 
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translations of writers as diverse as Ptolemy and Hilbert, Bhaskara and Boole. It has 742 densely 
printed pages, crammed with some of the most important mathematical works of all time. There 
are also section introductions providing historical context for the works, and biographies of many 
of the writers. And, of course, it is all in English. The list price is $18.95. If there is the remotest 
chance you will ever want to look into the history of mathematics, you should order a copy of this 
bargain book. 

Surely the publisher intends the book to be used as a textbook for courses in the history of 
mathematics, and it will be useful for this purpose as well. It is pretty heavy going, however, for 
any but the most dedicated and well-educated students, and there aren’t many of those. 

You might have imagined that the texts in this anthology are photographically reproduced 
from other books and that this accounts for the low price. Not at all. It has been entirely set from 
scratch in a two-column-per-page format, so that, as I indicated above, there is a great deal on a 
page. I am not happy with the type face that was used, but the proof-reading appears to have been 
done extremely carefully and I find practically no misprints. 

It would be foolish to quibble with the selections. No two people who cared about a field could 
be expected to agree on such a vast number of individual choices. I disagree with some choices, 
but I admire the imaginativeness of others—for example, the use of a passage from Garding 
which quotes Riemann on the subject of the partial differential equations of physics and then 
comments briefly on modern views on the subject. I feel that in very many cases (Archimedes on 
the quadrature of the parabola, Euler on sums of two squares) passages are cut too short to be 
comprehensible. But there would be no other way to get so much into the book, and these snippets 
may be enough to give the reader the flavor of the work and send him to complete sources if he 
likes the taste. 

Perhaps it will give an idea of the virtues and shortcomings of the book if I examine one 
selection in detail. One of the selections of greatest interest to me personally is #109, Kummer’s 
first introduction of his theory of “ideal complex numbers”. It occupies 6 pages of the book, and 
Calinger introduces it with a page-and-a-half biography of Kummer. 

The biography is rather dry. It contains the basic facts, but the choices are sometimes odd. I 
can’t believe, for example, that many readers will be interested to know that Kummer’s father’s 
full name was Carl Gotthelf Kummer; on the other hand, they might well be interested to know 
that Carl Gotthelf died when his son Ernst Eduard was only 3 years old, a fact that is not 
mentioned. Another peculiar choice is the decision to mention that Kummer was in military 
service at the time that he sent his paper on the hypergeometric series to Jacobi. This seems worth 
mentioning only in connection with two amusing quotes. Kummer was a mere musketeer in the 
military, and, as he wrote to his mother, “he [Jacobi] will be amazed and at the same time pleased 
that a musketeer is dealing with the same matters that he is.” Jacobi’s reaction fulfilled his 
expectations. He is reported to have said, “If even musketeers now produce such outstanding 
mathematical works, I would like to have a look at what the noncommisioned officers are doing.” 
Neither line is quoted here. 

There are also two mathematical misstatements. It is said that Kummer used his theory of ideal 
prime factors “to prove the law of biquadratic reciprocity” and that he “discovered the 
fourth-erder surface’. In fact, the law of biquadratic (fourth power) reciprocity was proved (by 
others) before ideal prime factors were invented; Kummer used his theory to prove a law of pth 
power reciprocity for regular primes p (although more than 10 years of strenuous effort came 
between the invention of ideal prime factors and the proof of the reciprocity law). And, of course, 
it was a particular fourth-order surface—the Kummer surface—that Kummer discovered. 

The text of Kummer’s article itself is given in the translation of T. F. Cope from D. E. Smith’s 
Source Book in Mathematics of 1929. A few mistakes from the Smith volume are faithfully 
reproduced (mA — 1 instead of mA + 1, u instead of p in one crucial place, “number” instead of 
“numbers” in another, and a serious mistranslation which says “every” where it should say 
“some”, making an important idea unintelligible). The translation does not show good under- 
standing of the piece on the part of the translator, and contains at least one phrase—“ the theory 
of a form of the second degree in two variables with determinant, however, a prime number A, is 
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closely interwoven with these investigations”—which couldn’t possibly be understood except by 
reference to the German original. I sympathize with Calinger here; the editor of an anthology can 
hardly be expected to deal with subtle shortcomings of his sources. Calinger must be blamed, 
though, for his failure to identify the authors of the four notes at the end of the text. (The first 
note is Kummer’s, the other three were made by the translator.) 

These shortcomings should not be allowed to obscure the main fact: This paper of Kummer, 
which I consider to be of major significance in the history of mathematics, is here available, albeit 
in slightly garbled form, to readers of English. With the aid of the German original, a serious 
reader could surely use it to gain a good understanding of Kummer’s seminal ideas and his 
discussion of what motivated them. And I would like to think that the inclusion of the paper in 
this book will increase the likelihood that serious readers may do so. 

Two minor points: First, the picture of Newton on the cover and on the first page of the book 
is terrible; an original drawing, it makes him look like a boxer on the day after a rough bout. The 
drawings of Gauss and (Emmy) Noether are no better likenesses than the one of Newton, but at 
least they don’t make their subjects look like monsters. (As for the remaining drawing, that of 
Archimedes, it is of course wholly imaginary. It is innocuous enough.) Second, the title of selection 
#94 “From the Testamentary Letter Sent to Auguste Chevalier (May 29, 1832)” might well send 
readers to the original to see what has been omitted, but they will be disappointed— the selection 
in fact contains the entire letter. 

One virtue of the book that deserves special mention is its coverage of modern mathematics. 
Because logic and set theory are the only important subjects in modern mathematics that are 
accessible to a broad spectrum of mathematicians, most history books tend to give the impression 
that mathematics since 1890 has been dominated by Cantor and Gédel. While this book has a fair 
amount of this sort of thing, it presents other topics too—Hilbert’s problems, a passage from 
Lebesgue on integration, an excerpt of G. D. Birkhoff on the ergodic theorem, and a paper by 
Hasse, R. Brauer, and E. Noether on the theory of algebras (this last paper in an original 
translation by Kurt Bing). Not many readers will be able to follow the details of these works, but 
they may be able to get something from them; in any case they will learn that 20th century 
mathematics is not all about the axiom of choice and the continuum hypothesis. 

Though not without shortcomings, this is a rich and useful book that deserves to be in every 
institutional mathematics library and will be a welcome addition to many private libraries 
(including mine). 


Linear Algebra through Geometry. By Thomas Banchoff and John Wermer. Undergraduate Texts 
in Mathematics, edited by R. W. Gehring and P. R. Halmos. Springer-Verlag, New York, 1983. 
x + 257 pp. 


Morris W. HIRSCH 
Department of Mathematics, University of California, Berkeley, CA 94720 


I can still recall my delight—after suffering through a computational course on matrices—at 
learning (I think from Halmos’ book Finite-Dimensional Vector Spaces) the geometrical and 
conceptual proof of the diagonalization of Hermitian operators. Students who read this book will 
see (among other things) the essence of that proof. 

Geometry, in a broad sense, is a realm of sharply defined, thing-like mathematical 
objects—surfaces, spaces, sets—that we can see, if only with the mind’s eye. Other parts of 
mathematics seem—to the geometrically oriented—to be somewhat more vague and harder to 
grasp: processes, relations, operations, equations, functions. Even these have become increasingly 
geometrified: it is much easier (for me) to understand a function as a set of ordered pairs than as 
a “rule” or collection of formulas. In fact the development of modern mathematics is to a great 
extent the enormous expansion of the geometric realm. 
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At the opening of the nineteenth century geometry meant the study of physical space, which 
was taken to be Euclidean. A few towering geniuses ventured into other geometries. Gauss 
confined his discussion of the n-cube to private correspondence, and his non-Euclidean geometry 
to his notebook. Cauchy hesitantly wrote of an n-tuple of numbers as an “analytic point’, while 
Cayley wrote about n-dimensional geometry but explicitly denied making any assertions as to the 
existence of such spaces. To lesser mortals, however, space was flat and three dimensional. 
Mobius, for example, puzzled by the relation between an asymmetric tetrahedron and its mirror 
image, wrote that they were not congruent because to move one onto the other would require 
more than three dimensions, and “such a space is unthinkable”’. 

The great geometrical liberation that took place during the latter half of the nineteenth century 
was marked by three events: the popularization of non-Euclidean geometry; Riemann’s founda- 
tions of differential geometry, and especially his crucial distinction between physical and mathe- 
matical geometry; and the development of set theory. With set theory it became possible to talk 
about and name hitherto unspeakable things, such as the set of all n-tuples. 

Along with the growth of geometry per se came the geometrization of other mathematical 
concepts. Complex function theory, for example, acquired a permanent geometric component 
through Riemann’s introduction of the surfaces named after him. As a result, the murky process 
of analytic continuation became visualizable and analyzable by drawing curves on surfaces. In this 
century a good part of analysis became understandable—and computable—in terms of the 
geometry of Hilbert space. 

At the same time algebra was also developing. Like other branches of mathematics, it was seen 
more and more in terms of sets and mappings. While Peacock and others saw abstract algebra as 
merely codifying operations on numbers, Hamilton insisted that algebra could reflect any mental 
operations whatever. As a result new systems could be invented that did not obey the same laws as 
numbers—for example, his non-commutative quaternions. 

If geometry lets us see what we are thinking about, algebra enables us to talk precisely about 
‘what we see, and above all to calculate. Moreover it tends to organize our calculations and to 
conceptualize them; this in turn can lead to further geometrical construction and algebraic 
calculation. 

A simple example, studied in the book under review, is the set of relationships among systems 
of linear equations, matrices, and linear transformations. Solving a system of equations is a typical 
example of what I mean by a “process’—we don’t say what it is, it is not an object, we just go 
ahead and do it. Along the way we do other processes, such as “substitution” and “cancellation”. 
There is nothing wrong with doing these things, but it is not easy to study systematically what we 
are doing. One way of studying this process has become part of the field of numerical analysis; 
but there is a need for a more structural study, aimed at answering questions such as “how many 
solutions are there?” and “how can we know before we try to solve whether there is in fact a 
solution?” 

Such questions have been answered by the theory of matrices. (I like to think of matrices as 
geometric objects— they have diagonals, rows, columns, symmetries, etc.) The processes of solving 
become precisely defined operations on matrices. In particular, substitution becomes matrix 
multiplication. 

An ever better way to think about equations is in terms of mappings, more precisely, linear 
transformations. The product of two matrices now represents the composition of mappings. With 
this interpretation, associativity of matrix multiplication is simply a special case of the trivial 
associative law for mappings. This is much better than the direct computational proof, not so 
much because the latter is tedious, but rather because it affords no conceptual insight into why the 
law is true (other than that a proof exists). 

This proof for 2 X 2 matrices is given by the authors in Chapter 2.3 (one of their innovations is 
the use of fractional chapter numbers). Even better, it is given simultaneously with the proof of 
associativity for linear transformations of the plane. This exemplifies the authors’ approach: “We 
lead the student to an understanding of elementary linear algebra by emphasizing the geometrical 
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aspects of the subject.” 

Except for two brief chapters the topics are all in dimension 3 or less. They include dot and 
cross products, determinants, matrices, isometries, eigenvalues, conics, quadrics, linear transfor- 
mations. 

The book is very elementary. Complex numbers are not mentioned until the last page. 
Determinants are defined only for 2 x 2 and 3 X 3 matrices. The theorem that every subspace of 
R” has a well-defined dimension is stated but not proved. The concept of rank is not mentioned. 

What the book covers is done well. The discussions of geometric concepts and their relations to 
algebraic notions are clear and well motivated. The high point is the proof of the spectral theorem 
for 3 X 3 matrices. This is applied to the classification of quadric surfaces. (The authors begin the 
proof, appropriately, with a geometrical appeal to the graph of a cubic to obtain a real 
eigenvalue.) 

For my taste the book is not geometrical enough, however. For example, while diagonal, 
permutation and shear matrices are defined, there is no discussion or picture of the linear 
transformations to which they correspond. There is no discussion of similarity transformations, 
except orthogonal ones for diagonalizing symmetric matrices. This precludes the important insight 
that similar matrices corresporid to the same linear transformation seen in different coordinate 
systems. Systems of equations are interpreted through matrices, but are not explicitly discussed in 
terms of linear transformations. 

The greatest lack is the complete exclusion of complex numbers. Complex roots of the 
characteristic polynomial are not mentioned, despite their clear geometrical interpretation as 
rotation angles for orthogonal transformations. 

Of course, these are primarily questions of taste. Since the book has evidently been carefully 
and skillfully planned, the authors must have deliberately excluded many topics for the sake of 
brevity or the desire to keep matters elementary. The subjects they include are presented with 
admirable clarity and simplicity. Every student of mathematics should be exposed to them as early 
as possible—preferably in high school. In college this book would make an excellent supplemen- 
tary text for an elementary linear algebra course, or the calculus course which covers similar 
topics. 


LETTERS TO THE EDITOR 


Material for this department should be prepared exactly the same way as submitted manuscripts (see the inside 
front cover) and sent to Professor P. R. Halmos, Department of Mathematics, University of Santa Clara, Santa 
Clara, CA 95053. 


Editor: 


Reading Professor Krantz’ article in the MONTHLY was enough to make any red-blooded 
mathematician’s heart swell with pride, maybe even with indignation. Computer Science (whatever 
that is) is sexy and fun while mathematics is important and difficult. They get their picture in 
Time magazine while we revolutionize Volvo carburetors. There they go again. 

Unfortunately, when the thrill of the prose was over I realized that the article had little relation 
to reality. What Professor Krantz has done is to construct two “straw fields” in the image of his 
prejudices, and proceed to compare them. Let’s take a look at what the current state of 
Mathematics and Computer Science really is. 

First of all, Computer Science majors do not limit their training to trivial programming in a 
couple of languages and fussing with trivial hardware questions. As far as I can see, Computer 
Science programs require students to take at least two years of Calculus, a term of Linear Algebra, 
often Number Theory, and courses such as Compiler Design, Comparison of Languages, Artificial 
Intelligence and Analysis of Algorithms. These last courses are often difficult, abstract, and 
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require the same skills which Professor Krantz assigns to Mathematics: abstract reasoning, 
modelling, and problem-solving. The difference is that the students with the greatest abilities in 
these areas used to go into Pure Mathematics; now, a large percentage of them go into “Computer 
Science”. Some of the students who still major in Mathematics do so because—sad to say—the 
difficulty and competitiveness in Computer Science is too great. Not every C.S. major is a 
mindless hacker; not every Math. major is capable of proving anything beyond the simplest 
theorems. 

It is difficult to evaluate hearsay, non-quantified random stories such as Professor Krantz 
supplies us: Big Companies Want Math. Majors. What Big Companies want is students who have 
the creative skills that the Big Companies need. Their initial reasoning was that the attractiveness 
of high-tech jobs would encourage some of the best students to go into Computer Science 
programs. This seems to be the case. On the other hand the pressure to set up these programs and 
the scarcity of people competent to teach in them and willing to work in academic conditions has 
made the Computer Science degree an undependable indicator of ability; so has the large 
enrollment of students with variable ability. As usual, the business community would like the 
academic community to find some certification system which always works. Unfortunately this 
has never happened in any field at any time. Any company which thinks that a Computer Science 
or Mathematics degree guarantees creativity in an industrial setting is bound to be disappointed in 
the long run. 

If we’re going to talk folktales, let’s recall the stories of the same Big Companies telling 
Mathematics Departments that, yes, many of your students are bright all right, but they can’t 
avoid being abstract; they can’t appreciate technical nitty-gritty applied work, even if the 
problems are interesting; they don’t know enough Physics, Chemistry, Engineering, etc.; they 
don’t want to work on hard technical problems. Sure, a lot of this is humbug, but it certainly 
should teach us to avoid stereotyping. All that Professor Krantz can legitimately say is that those 
Mathematics majors with the best problem-solving abilities will be of more use to the Big 
Companies than those Computer Science majors who are not as talented. I imagine most of us 
suspected this. One might try to estimate the percentage of very able students in each of the two 
fields, but it 1s difficult to measure or predict creativity; furthermore, even if we could, unless the 
percentages were wildly disparate it is not clear what practical conclusions could be drawn. 

It is not just impolite to trivialize another person’s field: it can often prove embarrassing. If 
Professor Krantz learned all he needed of Fortran in one afternoon, then that can only indicate 
that what he needed could be learned in one afternoon. My Mathematics or Computer Science 
students can also learn to do simple number-crunching in Fortran in one afternoon. On the other 
hand, my students also learn about Trees, Graphs, Hashing, Recursive Algorithms, etc.,—com- 
plicated techniques not do-able in Fortran. If Professor Krantz needed this complexity in his 
work, he might find one afternoon insufficient to learn it. Furthermore, in solving a problem a 
well-written program often can give useful results on a personal computer while a poorly-written 
one may need a Cray to rescue it: analyzing algorithms is a highly non-trivial subject which is 
central to Computer Science yet rarely dealt with by Mathematics majors; nobody learns it in an 
afternoon. 

Finally, Professor Krantz seems to believe that Computer Science is somehow an unidentified 
field which no one has tried to analyze. All I can do in this space is refer him to Professor Donald 
Knuth’s writings—e.g., this MONTHLY, April 1974, ten years ago, or March 1985—for enlighten- 
ment and references. Professor Knuth is a person who knows both fields: Mathematics and 
Computer Science. He sees an analogy between theorems in Mathematics and algorithms 
(numerical and otherwise) in Computer Science. While one field is young, the other venerable, he 
foresees a close collaboration for a long time. Let’s not ruin it by speaking in haste and ignorance. 


Mark Bridger 

Department of Mathematics 
Northeastern University 
Boston, MA 02115 
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RUBIK’S GROUPS 


EDWARD C. TURNER 
Department of Mathematics and Statistics, SUNY at Albany, Albany, NY 12222 


KAREN F. GOLD 
Department of Mathematics, University of California at Los Angeles, Los Angeles, CA 90024 


1. Introduction. Few puzzles have captured the public fancy the way Rubik’s cube has. The 
beauty of the puzzle, beyond its mechanical ingenuity and elegance and colorful appearance, lies 
in the contrast between the sheer impossibility of solving the cube by chance (better to bet on the 
proverbial snowball) and the existence of algorithms simple enough that many children master 
and even discover them. The value of the cube for the teacher of mathematics is that it provides a 
setting that is interesting in its own right and in which most of the important notions of 
elementary group theory are illustrated. The purpose of this article is two-fold. First we describe a 
uniform method—-The Method—for solving any “Rubik’s-type puzzle” (defined in the next 
section). As is the case with all descriptions of solution algorithms, we provide a list of basic 
moves, outline how to use them to unscramble a puzzle, and assure the reader that they will 
always suffice. Despite its generality, The Method is conceptually very simple, illustrating the fact 
that generalization often leads to simplification. The second purpose is to analyse the “Rubik’s 
Group #” whose elements are the various states of the puzzle and whose group operation 
describes composition of moves and to prove the adequacy of The Method. The analysis will 
involve an excursion into group theory in which permutation groups, direct sums, homomor- 
phisms and exact sequences appear naturally and which leads to an explicit description of # as a 
semi-direct product of easily described factors. A by-product of this structure is a notation scheme 
that handles composition easily. We also discover that the octahedral puzzle is unique among 
Rubik’s type puzzles in that edge flipping is impossible. In order to make the paper as accessible 
as possible to beginning students, we have collected in Appendix 1 a list of definitions of terms 
that might not be introduced in a first course in algebra. The first use of each term appears in 
italics. 

This paper 1s an outgrowth of a senior research project by Gold under the direction of Turner. 
It began with a study of the Tetrahedron {Pyraminx} and the Cube; as more puzzles came on the 
market, namely the Impossiball, Alexander’s Star and the Megaminx (a dodecahedron), the scope 
of the project widened to consideration of the general puzzle, resulting in the current analysis. 


2. The Method. In this section we describe The Method—an algorithm that will solve any 
Rubik type puzzle. A Rubik type puzzle is a regular solid (see Appendix) composed of pieces 
corresponding to vertices, edges and faces of the solid and for which a basic move is the rotation 
of a face, including edge and vertex pieces. This gives 5 possibilities corresponding to the 5 regular 
solids, which are listed in Table 1 of the next section. The theory also applies to any puzzle whose 
moves are turns about a vertex—e.g., Alexander’s star or the Impossiball—by the trick of 
dualization. Inside any regular solid is another whose vertices are the centers of the original faces 
(so a cube contains a smaller octahedron) and vertex turns on the larger correspond to face turns 
on the smaller. To solve a vertex turning puzzle, imagine the dual face turning puzzle inside and 


Karen Gold: I am presently a graduate student and teaching assistant in the Mathematics Department at UCLA. 
My primary mathematical interests are combinatorial group theory and algebraic geometry. My hobbies include 
Russian studies, folklore, boardgames, and the department’s daily tea. 


Edward Turner: I did my undergraduate work at the University of Rochester, graduate work at UCLA, and spent 
two years as an instructor at MIT before moving to SUNYA in 1971. My early mathematical interest was in 
differential topology, but in the last five years I have become increasingly interested in combinatorial group theory. 
My non-mathematical interests include running, squash, and folk dancing. 
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solve it. This gives another four puzzles (the dual of the tetrahedron is the tetrahedron). Each 
puzzle has simpler versions that don’t have all three types of piece: the Impossiball has only faces, 
Alexander’s Star only edges and the Rubik’s Pocket Cube only vertices. Rubik’s Revenge and 
obvious more general n X n Xn versions are not Rubik type puzzles in our sense. They admit 
another type of move not equivalent to any combination of face moves—the “slice” moves that 
turn a plane of pieces parallel to face. 

For each puzzle, consider one edge joining vertices a and b together with the adjacent edges on 
the two faces that meet at that edge, as indicated below. 


a 
; ra aa 
b b 
Tetrahedron Cube Octahedron 
Dodecahedron Icosahedron 


We denote this schematically as shown in Fig. 1, with the understanding that c = d and e =f 
if faces are triangles and that there are other edges on top and bottom if the vertex degree is more 
than 3. 


Small letters label vertices, 
numbers edges, 
and capitals face terms. 


Fic. 1 


We denote by L and R the clockwise rotations of the faces labeled L and R and use the 
convention that moves in a sequence are applied from left to right. After some experimentation 
with short sequences of basic moves, one discovers the usefulness of the commutator (see 
Appendix) of L and R™', which has the effect shown in Fig. 2. 

C is the basic building block of the solution which proceeds in four steps. The face pieces are in 
solved position by the nature of the puzzle and serve as references in placing vertex and edge 


pieces. 
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Effect of C= [L,R~']= LR™'L™!R 


Vertex effect 


Rotation is referenced to 
direct parallel translation 
with positive fractions 
denoting clockwise 
rotations. 


Edge effect 


The effect of C on 
Orientation is to preserve 
the small transverse arrows. 


Fic. 2 


Step 1. Place the edges 


C is a 3-cycle on edges. By conjugation we can obtain any 3-cycle of edges; for let 7 be any 
sequence that moves 1 to 4, 2 to 5 and 3 to 6 (such T always exists)—then T~'CT cycles 4, 5 and 
6. Since 3-cycles generate the alternating group, we can manage any even permutation of the 
edges. On all puzzles except the Cube, only even permutations are possible and we are done. On 
the Cube, if an odd edge permutation is desired, one basic move—inducing an odd edge 
permutation—converts it to an even permutation. Thus C and its conjugates (see Appendix) 
suffice to place all the edge pieces correctly. 


Step 2. Orient the edges 


Flipping edges in pairs is particularly easy on puzzles with vertex degree 3: let p denote 
clockwise rotation of the puzzle 1/3 of a turn about the diameter through vertex a. (Of course one 
never actually does the p'. This is the only time we use a rigid motion of the entire puzzle—we 
do so only for ease of description.) As the number of flipped edge pieces is always even, this move 
suffices to right them all. (See figure at the top of p. 620.) 


620 EDWARD C. TURNER AND KAREN F. GOLD [November 
Effect of [C, o ]}=Cp C~'p~! on edges 


flip flip 


Edge flipping on the octahedron and icosahedron is complicated by the presence of other edges 
on top; we show in Lemma 2 of Section 3 that edge flipping on the octahedron is in fact 
impossible. A double edge flipper for the dodecahedron is shown below. (See Fig. 3.) 


Dodecahedron 


[L,Ro']Tp'T'[Ro',TRITT, 


Fic. 3 


After steps 1 and 2 have been completed, the edges are in solved position. The remaining steps 
leave them there. 


Step 3. Place the vertices 


We will see that once edges are placed, the vertex permutation must be even and that it suffices 
to have a move that cycles 3 vertices without effecting the edges. Let B denote clockwise rotation 
of the bottom face—the one across edge bd from L in Fig. 1. Then [C, B] permutes vertices by 
(abd) and has no effect on edges. 


Step 4. Orient the vertices 


We will see that it suffices to rotate two vertices in opposite directions. The move [C’, B] 
rotates b clockwise and d counterclockwise (as viewed from outside) and does not affect edges. 


In the next two sections we will see that this algorithm will restore any scrambled puzzle to its 
original—often called pristine—state. It is surely inefficient in terms of time compared to others, 
but is conceptually quite simple and simple in practice in that very few conjugates of basic moves 
are necessary. In fact, the cube can be solved using conjugates only in Step 1. Furthermore, the list 
of basic moves is very short and easily remembered: 


C [Ce] [¢,B] [C*,8B] 
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(except for the exotic double edge flipper). 

On puzzles lacking edges (like the Rubik’s Pocket Cube and the Impossiball) steps 1 and 2 are 
unnecessary and on those lacking vertices (like Alexander’s Star) steps 3 and 4 are unnecessary. 
There is, however, a complicating factor when faces are absent, as they are on all the above—it is 
much harder to decide what moves to make without reference face pieces. One learns from 
practice how to deal with this problem and we will ignore it. 


3. The Rubik’s group @. With each regular solid is associated a Rubik’s type puzzle and a 
Rubik’s group, defined below. Table 1 gives the appropriate numbers for each puzzle and the last 
row sets our general notation. 


face vertex + of + of # of 
Regular Solid Group degree degree vertices edges faces 
Tetrahedron T 3 3 4 6 4 
Cube 6 4 3 8 12 6 
Octahedron 0 3 4 6 12 8 
Dodecahedron Q 5 3 20 30 12 
Icosahedron I 3 5 12 30 20 
Generic R P q V E F 

TABLE 1 


We imagine the central mechanism of the puzzle as fixed, so the face pieces don’t move, and 
refer to the F rotations of a face through 27/p radians clockwise as “basic moves”. An element 
of the group & is a sequence of basic moves, where it is understood that two sequences represent 
the same move if their effect on the puzzle is the same. Otherwise stated, # is the quotient of the 
free group (see Appendix) generated by the basic moves by the normal subgroup of expressions 
that leave all vertex and edge pieces in their original positions and orientations. 

We analyse # by separating the position effect from the orientation effect and the vertex effect 
from the edge effect. Ignoring orientations, each element of # permutes the edges and vertices, 
defining a map pos (for position) 


pos 
R- Sr x Sy; 


where S, is the permutation group on n letters. We denote the kernel of pos by F#, the position 
fixed subgroup of moves that do not move pieces from their original positions and the image of 
pos by X. Thus by definition we have an exact sequence (see Appendix) 


11> F 28ST 1, 


where i denotes inclusion. 
The remainder of this section is devoted to identifying A and }—the analysis of # is 
completed in the next section with a description of how ¥ and » interact. 


LEMMA 1. 
(i) If B+ @, then =A, X Ay. 
(ui) If R= G, then L = {(0,p)|o and p are both even or both odd }. 
LEMMA 2. There are exact sequences: 
(i) Ff REO, 
E V 


{Xr Sum 
0-F > @2,0@2Z,—-2,xZ,>0, 
1 1 


where 
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E V 
Sum((f,,---f¢).(%15---.%)) = Sf Ei) 
1 1 
(ii) If R= 0, 
r 4 Sum 
0-F- @©Z,-2,-0. 
1 


This is clearly a split short exact sequence (see Appendix), so this implies that F is isomorphic 
to 
E-1 V-1 
® Z,08 ® Z,. 
1 1 


The splitting is not canonical (see Appendix), however, and it is more natural to think of F asa 
subgroup of ®fZ, ® 0; Z,. 


Proof of Lemma 1. In case (i), the face degree p is odd, so that each basic move induces an 
even permutation on both edges and faces: thus) C A, X A,. A, is generated by 3-cycles and 
Step 1 of The Method shows how to achieve any desired 3-cycle of edges. Given (0, 4) € Ay X Ay, 
let M be a sequence of basic moves so that pos(M) = (0, p’). Now p’ is even so (p’) ‘pp € Ay 
and Step 3 shows how to find a sequence of moves N such that pos(N) = (id,(p’)~ 'w). Then 
pos( MN ) = (0, ), so pos 1s onto A, X Ay. 

In case (ii), the face degree is 4, so each basic move induces an odd permutation on both edges 
and vertices—the total parity is even, so 


d, C Ag X Ay U(Sg\ Az) X(Sy\ Ay). 


The argument of case (i) shows that © > A, X A,. If o and p are both odd, then let B be any 
basic move and consider pos (B)- (0,1) = (0’,p’). o’ and p’ are both even, so (0’,p’) € XL; 
thus (o, 4) © » and we are done. 


Proof of Lemma 2. The key to Lemma 2 is the question, “How do you measure the flipping of 
an edge piece or the rotation of a vertex piece when it has changed position?” It may seem 
paradoxical to ask this question when we are studying the position fixed subgroup—we do so 
because we need to think of elements of ¥ as products of basic moves, which do move pieces. To 
answer the question, consider the graph whose vertices are the vertices of the puzzle and whose 
edges join two vertices that can be obtained one from the other by the application of a single basic 
move. (In fact, these will correspond to the edges of the puzzle, but they should be thought of in 
this way.) Now choose a tree (see Appendix) containing all the vertices, and for each edge of the 
tree, choose a basic move that moves a vertex piece along that edge. 

Shown below, in heavy lines, is a particular choice of a vertex tree for @. (See Fig. 4.) 


FIG. 4 
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The letters are the standard Singmaster notation (see Appendix). A tree has the property that 
there is only one way to get from one vertex to another in the tree. The answer to the key 
question, for vertex pieces, is then “compare the effect of the move in question to the effect of the 
unique sequence of basic moves that moves that vertex along the tree using the labeling moves on 
each edge.” This gives a standard of comparison for every possible position change. Thus, for 
example, to move the lower-back-right vertex of @ to the lower-front-right position, the standard 
sequence is B?U~!F?. The move R, rotation clockwise of the right face, has a different 
effect—namely rotation 1/3 of a turn clockwise from the standard. 

On the general puzzle, for each move M and each vertex piece v, we have an integer r,(M) 
(mod qg) that measures how many (27/gq) radian clockwise rotations are necessary to move the 
standard reference position for that vertex move to the one given by the move in question. In a 
similar manner, consider the edge graph whose vertices are the edge pieces of the puzzle and 
whose edges join pieces that can be gotten one from the other by a single basic move. The edge 
tree is any tree in the edge graph containing all its vertices—it is not necessary to label the edges 
of this tree since the moves are uniquely determined. Shown below is a particular choice of edge 
tree for @, with labels included for convenience. (See Fig. 5.) 


Then to each move M and edge piece e we associate an integer f,(/) (mod 2) that measures 
whether or not the edge piece e is flipped relative to the standard by the given move. 
This defines a map 
V 


Rg 5 Z,@ @Z,, 
f(M) = (f(s fe (M)), 1M) = (4,1), 4,0). 
The proof of Lemma 2 for #2 # © will be complete when we have verified the following claims. 
Claim1. (f X r)|F is a homomorphism. 
Claim 2. Image(f X r) C ker(Sum). 
Claim 3. Image(f X r) > ker(Sum) if ZF ©. 


Note that f < r is not ahomomorphism on # because of the movement of edges and faces: in 
fact, it is easy to check that 


r,(.M,M,) = 7,(M,) + hi wy( Mp), 


f.( MM, ) = f,(M,) + foe (Ah), 


(* *) 
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where pos( M,) = (0,, ,). 

Claim 1 is clear from equations (* *). It is also clear that Sum(f X r) = ©, f,,u,7,) is also a 
homomorphism, so that to verify Claim 2, it is only necessary to prove that Sum(f x r)(B) = 0 
for any basic move B. Suppose B is the basic move of a face depicted below (we take p = 5 for 
ease of exposition—the argument is general; see Fig. 6). 


4 


V, V3 
FIG. 6 


The indicated vertices are the only ones moved, so 
Sum(r(B)) = 7, (B) +7,,(B) +7,,(B) +7,(B) + 7,,(B). 


But formula (* *) says that the right-hand side is r,(B°), since B'(v,) = v,,,. Since B° = id, 
71%, (B) = 0 (mod 5). 


An analogous argument works on the edges. 


(REMARK. It is at this point of the argument that (* *) is essential—an arbitrary method of 
measuring twisting and flipping would be inadequate.) 

The proof of claim 3 for 2 # © is an exercise in the use of The Method that is left to the 
reader: it involves explicit construction of moves whose images under f < r generate ker(Sum). 

In the case of the octahedron, the above analysis holds except that: 

(a) No edge flipping is possible, so that Z, factors measuring flips are absent; 

(b) the vertex rotations must come in units of 2(27/4), not 27/4 as expected. 


Both of these can be seen by dividing the faces into two classes—say rough-textured and 
smooth-textured—so that each edge separates a rough face from a smooth one. The possibility of 
doing this is easily seen to be equivalent to even vertex degrees. Then each basic move and so all 
of © preserves the texturing. It follows that edge pieces can’t have their rough and smooth facelets 
interchanged nor can vertex pieces be rotated by 27/4 or 3(277/4). 


4. The semi-direct product structure on #. 


DEFINITION. Suppose A and B are groups and g: B — Aut(A) is a homomorphism. Then the 
semi-direct product of A and B on @ is A X B as a set with the product 


(a,b)(a’,b’) = (ap(b)(a’), bb’). 

It is routine to check that this gives a group structure. It agrees with the direct product if and 
only if @ is the trivial homomorphism. The following standard theorem describes three equivalent 
ways to express this notion: see for example, W. R. Scott, Group Theory, Prentice-Hall, NJ, 1964, 
p. 213. 


PROPOSITION. The following are equivalent: 
(i) There is a split short exact sequence 
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a B 
1-A-GeB- 1. 
y 


(iil) G= A X,B for some 9. 
(111) G has two subgroups A and B so that 


A<aAG,AN B= {id} and G=AB. 


REMARK. Several ~’s may correspond to the same 9: if ~,(b) = ~,(b)c, where c € Centralizer 
of A, then y, and y, will determine the same 9. In particular, this will happen if A is abelian 
and c € A. This illustrates how m may be canonical in a setting in which y is not. 

We now have the tools to complete the analysis of @. Assume edge and vertex trees, with 
labels, have been chosen and let y: } > & by 


v(o,n) = M, 
where pos(M) = (0,p) and f,(M) =17,(M) = 0 for all v and e. (* *) implies that y is a 
homomorphism. Corresponding to ~, we have g: & — Aut(F) by 


P(O,B)((Ar--sfe)s (stv) = (OCA fe) Wty), 


where o(/f,,...,f-) and pw(n,...,7,) are the sequences obtained by permuting the entries 
according to o and um. 


THEOREM. (1) The following is a split exact sequence: 
lo F> R21. 
Y 


(li) R= FX YX. 
(iii) B has subgroups F and W(x) such that Fa BR, FO ve) = {id} and FX = &B. 


Proof. By the proposition, it suffices to prove any one of the three and to check that y and 
are correctly related. That y splits pos is immediate, verifying (i). The proposition says that 
y(o,) (f) should “permute the edges and faces by o and p without flips or twists—then flip 
and twist according to f—then restore edges and faces to their original positions without flips or 
twists.”” This is clearly just what m(o,u) (f/f) does. We point out that ~ was not canonical, 
depending on the choice of tree, but g is canonical. 


FIG. 7 


The theorem gives an explicit description of # which provides a convenient notation that we 
illustrate for Rubik’s Cube. We label the vertex and edge pieces of C with numbers and letters 
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respectively as below, relative to which the permutations induced by the basic moves in 
Singmaster’s notation can be easily determined. (See Fig. 7.) 


Move Value of pos 
R (a b ec dad) 23 4) 
L (i f hk), (857 8 6)) 
U (be f g), (3 25 6)) 
D (dlek j) €Q 48 7)) 
F (a j h e), (217 5)) 
B ((c gi 1), (43 6.8)) 


Now using the vertex and edge trees for C given in §3, we can determine the flip or rotation 
effect on each piece moved—the effects are filled in below in the gap representing the piece move 
with the absence of a label indicating zero. 


Move Extended notation 

R (a bc dy “2 3 4')) 
L (i fi hk), (5! 7) 8 6)) 
U (b e f g), @B 2! 5 6')) 
D (dl k jj, @ 48 7')) 
F (a j h e) (21 7 5)) 
B ((c gil 1), (4 3 6 8)) 


Clearly the extended notation determines the move completely. Furthermore, composition of 
moves is described by multiplying permutations in the usual way (left one applied first) and filling 
each gap with the sum of the numbers in the gaps that give rise to it. For example 


RU = ((a' bi cd"), (1'2'34')) -((befg), (32'5'6")) 
=((a' efgb' cd’), (17 5'6'34')(2')) 


where, €.g., 


1 2 
132351 - 5. 
R (RU) 
This notation allows us to easily draw a non-obvious conclusion. Note first that it is a relatively 
easy exercise to check that any move can be effected without using one of the six moves—The 
Method, for example, can be used to show how to obtain the effect of F from the other 5. 


CLAIM. It is not generally possible to solve the cube using only 4 of the 6 moves. 


Easy proof. If the two stationary faces are adjacent, the edge piece between them can’t be 
moved, and the claim is obvious. If. not, they are opposite and we may assume, without loss of 
generality, that they are R and L. But a glance at the chart shows that this makes edge flipping 
relative to the chosen tree impossible. Q.E.D. 


5. Some group theory related to the method. In this section we describe some interesting 
subgroups of # generated by basic commutators which provide almost all the moves needed to 
solve a puzzle. We encounter two groups K and J—discussed below—that are interesting in their 
own right. Verification of statements not explicitly discussed is tedious but routine. 


DEFINITION. For group # and a degree g vertex v of the corresponding solid, #, is the 
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subgroup generated by basic commutators of the form C = [L, R~'] for the g pairs of adjacent 
faces containing v. For an edge e, #, is the subgroup generated by all commutators of moves of 
the two faces separated by e—namely [L, R],[L~', R],[L,R-'] and[L7', R™'}. 


The groups &,, 


#, moves g edges and g + 1 vertices. In all cases, any edge flip, corner rotation or even edge 
permutation of these pieces possible in # is also possible in &,—the interest lies in the realizable 
vertex permutations. 


Case (i). R=I,6,D (q = 3); Image(pos) = A, X K. 
Case (ii). R=O (q = 4); Image(pos) = A, X As. 
Case (iii) R=IF (q = 5); Image(pos) = A, X I. 
The groups &, 


If faces have degree p = 3 (respectively p = 4,5) then &, moves 5 edges and 4 vertices 
(respectively 5 edges and 6 vertices). #, has no flip or rotation effect (pick the right tree) so #, is 
isomorphic to Image(pos). 


Case (i). R=7,0,4%(p=3) Image(pos) = A, X K. 
Case (ii). R= G@,D (p=4,5) Image(pos) = A, xX I. 
The Klein four group K 


K is the four element subgroup of A, consisting of double transpositions. That double 
transpositions do not generate A, (as they do A, for n > 5) indicates why it is necessary to bring 
in the move B in Step 3 of The Method. Abstractly, K is just Z, © Z,. 


The icosahedral group I 


TI is the subgroup of A, generated by the two 5-cycles X¥ = (12345) and Y = (16235) obtained 
as follows. For &,,, case (iii), label vertices as shown in Fig. 8 and let C,,C,,...,C,; be the basic 
commutators corresponding to edges 1-6, 2-6,..., 5-6; then 


X = (12345) = C'C.CC, 
Y = xX-(C,C,)° X. 


2 
3 A=[L,R] 
5 
3 B=[L7},R] 
C=[L,R™ 
, [L,R-1] 
D=[L"',R"*] 
5 
1 2 6 4 
FIG. 8 FIG. 9 


For @,, case (ii), label vertices as shown in Fig. 9; then X = CA, Y = (CD)’. 
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I is a very famous group which occurs naturally in a number of different settings and is known 
by names that reflect its various origins. It has order 60 and is the smallest example of a 
non-abelian simple group (see Appendix). We end with four descriptions of J other than as 
Image(pos). 

(1) T=¢xX,Y|X° = Y° =(XY) =(Y'X) =). 

This notation, from combinational group theory, means that J is a group characterized by the 
fact that it has two generators X and Y satisfying the listed relations and such that any other 
relations are derivable from these together with the group axioms. Descriptions of this type, called 


presentations, are completely general and very efficient but rarely display the special characteris- 
tics that make a group interesting. 


(2) I =A,, the alternating group on 5 letters. 
This is the most familiar form of 7. Here 
X = (12534), 
Y = (13452). 
In this form, it is clear that J has elements of orders 1, 2, 3 and 5 only and it’s easy to count 


how many of each there are. Furthermore, all elements of order 2 are conjugate, all of order 3 
conjugate and there are two conjugacy classes of elements of order 5. 


(3) I~ PSL(2,5). 
SL(2,5) = | (2) 


SL(2, 5) 
{+I} - 


v-[(E er Oh 


(4) I = the symmetry group of the icosahedron. 


a,b,c,d€& Z,,ad — bc = i}. 


PSL(2,5) = 


Here 


In this incarnation, J is the group of rigid motions of the icosahedron—i.e., length and angle 
preserving linear maps (orthogonal maps) of R® that carry an icosahedron centered at the origin 
back onto itself. Every orthogonal map of R° is a rotation about some straight line through the 
origin and those in the icosahedral group are of three types depending on whether the line passes 
through the center of an edge, the center of a face or a vertex. If L and R denote adjacent faces as 
well as clockwise rotation about the line through the center of the faces, then 


X=R, 
Y = L’. 


It is a delightful coincidence that the icosahedral group appears in the analysis of the 
icosahedral puzzle. 


Appendix 1— Definitions. 


Regular Solid. A regular or Platonic solid is a convex solid bounded by planar faces each of 
which is a regular polygon with the same number of edges and such that the same number of faces 
meet at each vertex. It was known to the ancient Greeks that Table 1 lists all possibilities—a 
proof appears in Book 13 of Euclid’s Elements. 


Commutator. The commutator of x and y in a group G is [x, y] = xyx_'y *. (Some authors 
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use [x, y] = x ‘y 'xy.) It measures the degree to which x and y fail to commute with one 
another. 


Conjugate. In a group G, a is conjugate to b if there is a c so that a = cbc _'. Conjugate 
permutations always have the same cycle structure. 


Free group. A free group is a group free of any relations not demanded by the group axioms. A 
group free on symbols X),..., X,, is the set of all strings—called words—of X,’s with exponents 
+1 in which occurrences of X,X,' and X,'X, have been deleted, together with 1 interpreted as 
the empty word. The multiplication is just juxtaposition followed by the above deletions. The 
discipline of combinatorial group theory views all groups as quotients of free groups as in the first 
description of J in Section 5. 


Exact sequence, short exact sequence. An exact sequence is a sequence of group homomor- 
phisms such that the image of each is the kernel of the next. A short exact sequence (the only kind 
considered in this paper) is 5 terms long, beginning and ending with trivial groups. 


a B 
1->A-G-B- 1. 


If all groups in the sequence are abelian, denote the trivial group by 0: otherwise 1. 


Split short exact sequence. A splitting of the exact sequence above is a map y: B — G such that 
Beow(b) = b. If G is abelian, the existence of a splitting says that G is the direct sum of A and B: 
this follows from the Proposition of §4. 


Tree. A graph is a tree if it is connected and contains no cycles. This is the same (for finite 
graphs) as saying that the number of vertices is one more than the number of edges. An important 
property of a tree is that there is exactly one way to get from one vertex to another in the tree 
without retracing. 


Singmaster notation. Singmaster’s notation—which has become standard—denotes the basic 
moves on the Rubik’s cube by letters indicating the face being rotated clockwise (see Fig. 10): 


Fic. 10 


L = left U = up F = front 
R = right D = down B = back. 
(Note: L and R here should not be confused with L and R in The Method). 


Canonical. A map or construction is canonical if it does not depend on arbitrary choices. In §3, 
gm is canonical but w is not, since ~ depends on the trees chosen. 


Simple group. A group is simple if it has no non-trivial normal subgroups, or equivalently, no 
non-trivial quotients. 


His main work is invariant under translation. (See p. 661.) 
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The screening time per question worked becomes —__t_. Thus 
[ a(x) ax 
0 
y + I xg( x) dx 
0 
6(1) = —— 
[esas 


[fe ax] 


= (18(1) ~ 9(7) 8(7))/f' a(x) ax 


o(1) = 


o'() = (—8(7)( (1) - Vex) dx. 


Since y > 0, lim, _,9¢(7) = +00 and lim, _,,,¢(7) = a + y. Also the graph of ¢ cannot cross the 
diagonal from below. It follows that ¢ has a unique fixed point and it occurs at the min of ¢. 

We now restate our answer to the question posed at the beginning. If, after screening, the 
estimate of the time to work the problem is longer than the average time spent per problem 
worked, then move on to the next problem. This answer may not be terribly helpful to the test 
taker and so we present in a table the results of the numerical calculations for the two models 
above for a 60 question three-hour exam with E(T) = 6. 


po E(S) E(W) Best t Answers Screens 


Alternative 
model 


Thus, the first model suggests that, on this test, a problem should be screened for 1.77 minutes 
and then if it is estimated that the time needed to work the problem is more than 4.561 minutes, 
go on to the next. The second suggests screening for 1.4 minutes with a moving-on time of 4.855 
minutes. 


Reference 


1. Beda Chan, Moving On, The Actuary, vol. 18, no. 1, Jan. 1984, 4-5. 


ANSWER TO PHOTO ON PAGE 630 


Alfred Haar (1885-1933). 


ON THE SECOND DERIVATIVE TEST FOR CONSTRAINED 
LOCAL EXTREMA 


D. SPRING 
Department of Mathematics, York University, Downsview, Ontario, Canada M3J 1P3 


1. Introduction. A well-known mathematical problem which arises in numerous mathematical 
and applied contexts is the determination of the local extrema (i.e., the local maxima or local 
minima) of a function which is subject to subsidiary constraints. Let f, g,: U > R be differentia- 
ble functions, 1 < a < m, where U is open in R” and m < n. The constraints are defined to be 
the equations g, = 0,1 <a<m.Let A Cc U be the solution set of the constraints. The problem 
is to determine the local extrema of the function f: A > R, where f is the restriction of the 
function f to the solution set A. To illustrate, let (x,,..., x,,) be coordinates in R” and consider 
the simple constraints x, = 0,1 < a <_m. The solution set is A = UNM V, where V is the linear 
subspace of R” obtained by setting the first m coordinates equal to zero. Setting f(x,,.1,---»X,) 
= f(0,...,0, X,,.441)--+»X,), then subject to these constraints we find that f has a local extremum 
at a =(0,...,0,4,,.3,---,4,) € A if and only if the function f has a local extremum (in the 
usual sense) at @ = (a,,,,,---,4,). In particular, a is a critical point of the function f, ie., all the 
first order partial derivatives of f vanish at @. For gene:al constraints the function f is known 
only implicitly; hence the local extrema of f cannot be calculated directly. 

The classical approach to solving the problem of constrained extrema involves the well-known 
method of Lagrange multipliers [4], [6]. Let L:R” X U-R be the Lagrangian function of 
(m + n) variables: 


m 
L(Ay5 0003 Amo Xp92 29 Xp) =f %y5--05%,) FY Ayal X15---5%X,)- 
a=] 
Fix a © A and suppose for technical reasons, the map g = (g),...,2,):U > R” is of class C’ 
and has maximal rank (= m) at a. This hypothesis ensures that, near a, the solution set A is an 
(n — m)-dimensional submanifold of R”. A principal result is the following [4], [11]: Subject to 
the constraints, if f has a local extremum at a € A, then for some vector c = (¢,,€),-.-, Cn) © 
IR”, the vector v = (c,a) € R”™ X U isa critical point of the Lagrangian L. That is, all the first 
order partial derivatives of L vanish at v: 


m 


0 
(1.1) La) + YS eg f0(a)=0, 1<j<m; g(a)=0, 1<a<m. 
J a=] J 


Generically then, the solution of the problem of constrained extrema is to be found among the 
critical points of the associated Lagrangian function. 

However, the critical point condition (1.1) is only a necessary condition—not a sufficient 
condition, in general, for a constrained local extremum. The determination of appropriate 
sufficient conditions is the central problem addressed in this paper. 

It is convenient to record, for reference, the standard approach to sufficient conditions. In the 


unconstrained case, m= 0, L=f and a € U 1s a critical point of f: f(a) =O0,l<i<n. 


Assuming f is of differentiability class C*, one expands f in a Taylor Series about a (h = 
(h,,...,4,) € R"): : 


David Spring: The problem of an optimal second derivative test, addressed in this paper, came to my attention 
during a recent teaching assignment of Calculus at York University. I received my Ph.D. in Differential Topology 
(1967) at the University of California, Berkeley, under the direction of M. Hirsch. I have taught also at the 
University of Toronto and, during a seven-year period in France, at the University of Paris, Orsay, and the 
University of Provence, Marseilles. Current interests include topological existence theorems for solutions to 
non-linear P.D.E.’s. I enjoy folk and dance music, reggae music in particular. 
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n 2 
(1.2) fae hy=Ma) +5 Lage (ahh, + RoC), 
Rh 


where lim, _,)>—> 
Sead 2 
WAI 


= 0. The character of the critical point a is analysed by examining the 


quadratic form 


that appears in (1.2). Note that the matrix associated to g is the Hessian matrix 


2 
Hf(a) = eka] . 


The remainder term is “negligible”, so the following result obtains (Edwards [6, page 138}): 


PROPOSITION 1.1. If q is positive (negative) definite, then f(a) is a strict local minimum 
(maximum) value; if q attains both positive and negative values, then f does not have a local 
extremum ata © U. 


REMARK 1.1. In the remaining cases, i.e., g is degenerate (det Hf(a) = 0) and is positive or 
negative semi-definite, the character of the critical point a is necessarily indeterminate at the level 
of second order partial derivatives (for a complete discussion see 4.6, §4). For example, let 
f(x, y) = x? + y4, (x, y) = x? — y*. The origin in R* is the only critical point of f, g and the 
associated quadratic form in both cases is g(x,y) =x, which is degenerate and positive 
semi-definite. However, f has a strict minimum at (0,0) while g does not have a local extremum 
at (0, 0). 


For a numerical algorithm to distinguish the three cases of Proposition 1.1, one examines the 
sequence (I), <,<, = (i,...,1,,) of principal upper left k x k minors of the Hessian matrix 
Hf(a) (Edwards [6, page 149}): 

In the case +, +, +,... of all positive signs, g is positive definite; in the case —, +, —, 
+,... of alternating signs, g is negative definite; assuming det Hf(a) #0 (so that q is 
nondegenerate), any other sequence of signs, or if zero terms occur, implies that q attains both 
positive and negative values. 

In the constrained case, 1 < m <n, recall (technical hypothesis) that the solution set of 


2=2= °°: =g,, = 0is a submanifold of R” near a. Let gq, be the auxiliary quadratic form 
1 &L 
q,(h) ~ D) . a ax, dx, 0) hihy: 


Subject to the constraints g, = 0,1 < a < m, the behaviour of f near a is analyzed by examining 
the restriction of the quadratic form q, to the linear subspace of R” consisting of the tangent 
vectors to the manifold A at a (Edwards [6, page 154]). Thus, in analytic terms, one studies the 
quadratic form g, which is subject to the m linear constraints: 


n 


) 
(1.3) Sy —22(a)h,=0, l<a<m. 
1 Ox; 


PROPOSITION 1.2 (Edwards [6, page 154]). Subject to the linear constraints (1.3), if q, is 
respectively positive definite, negative definite, attains both positive and negative values, then, 
subject to the constraints g, = 0,1 <a < _m, the function f has respectively, a strict local minimum, 
a strict local maximum, neither a local minimum nor a local maximum at a. 


REMARK 1.2. Proposition 1.1 is the special case of Proposition 1.2 obtained when m = 0 
(L = f; Q.= @). 


1985] ON THE SECOND DERIVATIVE TEST FOR CONSTRAINED LOCAL EXTREMA 633 


REMARK 1.3. In the remaining cases, i.e., subject to the linear constraints (1.3), q, is 
degenerate and is positive or negative semi-definite, the character of the critical point v is 
necessarily indeterminate at the level of second order derivatives (cf. (4.6), §4). 


To complete the analysis of Proposition 1.2, one requires a numerical algorithm which classifies 
quadratic forms that are subject to linear constraints. H. B. Mann [10] classified positive or 
negative definite quadratic forms which are subject to linear constraints. The noted mathematical 
economist G. Debreu [5] generalized Mann’s work by classifying positive or negative semi-definite 
quadratic forms which are subject to linear constraints. Since a quadratic form attains both 
positive and negative values if and only if it is neither positive nor negative semi-definite, Debreu’s 
algorithms suffice to distinguish, in principle, the three cases of Proposition 1.2. This completes 
our review of sufficient conditions in the literature. 

In this paper we propose a different approach to the problem of sufficient conditions for 
constrained local extrema. Our approach is to change coordinates in a neighbourhood of the point 
a & U so that the solution manifold A is “flattened” near a and so that the constraint functions, 
in new coordinates, are the simple constraint functions g,(x,,...,X,) =Xq,l<a<m, dis- 
cussed above. In the presence of simple constraints, appropriate sufficient conditions are obtained 
in terms of an algorithm which classifies quadratic forms, without constraints. 

Suppose v = (c,a@) is a critical point of the Lagrangian function L. A special feature of our 
approach is that the sufficient conditions are stated in terms of the (m+n) X (m+ n) Hessian 
matrix HL(v) of the function L evaluated at the critical point v (the matrix of second partial 
derivatives of L evaluated at v). 


0 0 

0 _ 0 ra (a) -_ 7x (4 ) 

fm sen 
0 wee 0 ax, 6%) wae ax, (4) 

Explicitly, HL(v) = 8, dg, “L aor, 
Ox, 4 Joss a (a) anil”? vo Ox,dx, 6”) 

Ag, An a7L OL 
pla) Sa) GA (0) (0) 


Note that the lower right n X n submatrix of HL(v) is the Hessian matrix of the auxiliary 
quadratic form 


qi(h) = y= Ce) nih, 


i, jai 
which was employed in Proposition 1.2. A natural question concerns the precise statement of a 
second derivative test (i.e., a numerical algorithm) in terms of HL(v) which distinguishes the three 
important cases: 


(1.4) Subject to the constraints g, = 0,1 < a < m, f has a (strict) local minimum value at a, or 
a (strict) local maximum value at a, or does not have a local extremum at a (the latter is known 
as the saddle alternative). 


The principal result, Theorem 1, distinguishes the three alternatives of (1.4) and in addition 
provides a complete saddle alternative, which seems to be new, in cases where det HL(v) = 0. 
Theorem 1 employs a sequence of principal minors of HL(v). This sequence is implicit in some 
older analyses of the problem (e.g., page 74 of Hancock [9]), but perhaps was first introduced 
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explicitly by H. B. Mann in this MONTHLY [10]. Unfortunately Mann’s test, and its generalization 
by G. Debreu [5], have been largely forgotten in the mathematical literature. However, in recent 
years Mann’s test has figured prominently in textbooks on the applications of mathematics to 
economics, where it is sometimes known as the method of bordered Hessians (Ostrosky and Koch 
[13], Y. Murata [12]; see also J. Marsden and A. J. Tromba [11]). A key observation underlying 
Theorem 1 is that Mann’s test is easily understood in the case of simple constraints discussed 
above. 

While the literature on the basic method of Lagrange multipliers is extensive, the literature on 
the second derivative test is sparse; in all the references of which I am aware, the saddle 
alternative of (1.4) is omitted (Hancock [8], [9], Carathéodory [3], J. Marsden and A. J. Tromba 
[11], Y. Murata [12], F. Bowman and F. A. Gerard [2], K. G. Binsmore [1]). Some texts on 
advanced calculus provide sufficient conditions for constrained local extrema but do not state a 
numerical algorithm for distinguishing the three alternatives of (1.4) (C. H. Edwards [6], R. 
Courant and F. John [4}). 

We have attempted, in this paper, to develop a definitive form of the second derivative test for 
constrained local extrema. This has necessitated a more detailed study of the classification of 
quadratic forms than one finds in the usual literature. An appendix on quadratic forms has been 
added for this purpose. 


2. Statement of the theorem. Let (y,), < ;<, be a non-trivial sequence of real numbers (i.e., not 
all the terms are zero). Let y, be the last non-zero term in this sequence. 


DEFINITIONS: 


2.1. The sequence (y;) is positive semi-definite if y, > 0, j = 1,...,k. 

2.2. The sequence (y,) is negative semi-definite if (—1)’y, > 0, j = 1,...,k. 

2.3. The non-trivial sequence (y,) is of saddle type if neither (2.1) nor (2.2) holds. That is, either 
some y, = 0, j < k, or the sequence of signs is not as indicated in (2.1), (2.2). 

2.4. The sequence (y,) is positive (negative) definite if it is positive (negative) semi-definite and 
k =r (1e., the last term y, # 0). 


We interpret these definitions in Proposition 2.1 below. 


Notation. (a) A principal jth order minor of a square n X n matrix M is the determinant of a 
principal j X 7 submatrix of M (1e., a submatrix obtained by deleting (n — /) rows and the 
corresponding (n — /) columns of M). 

(b) Let M be a square n X n matrix. To each permutation 7 of {1,...,} let M(1) denote the 
matrix obtained from M by permuting the rows and columns of M by 7. Associated to 7 is the 
linear change of coordinates: (x,,...,X,) > (Xmays+++»%m(ny)» Thus M(7) = E~' X M X E, for 
some orthogonal matrix F (depending on 7). 

The principal background algebraic result that we require is as follows: 


PROPOSITION 2.1. Let Q be a real-valued quadratic form on IR’ whose associated matrix is M (in 
some basis of R”). 


(a) Q is positive definite (negative definite) if and only if the sequence (y,)\<,<, Of principal 
upper left minors of M is positive definite (negative definite). 


(b) QO attains both positive and negative values if and only if there is a permutation 7 of {1,...,7r} 
such that the sequence (y,(7)),<,<, Of principal upper left minors of M(1) is non-trivial and of 
saddle type. 


Proof. Conclusion (a) is well known (Edwards [6, page 149]). Conclusion (b) does not seem to 
appear in the literature. The “if” part of (b) is trivial and is a consequence of Lemma 2.1 below 
(note that M(7) is the matrix for Q obtained by permuting the given basis of R” by 7). The 
“only if” part is proved in the Appendix. 


1985] ON THE SECOND DERIVATIVE TEST FOR CONSTRAINED LOCAL EXTREMA 635 


LEMMA 2.1. Let q be a real-valued quadratic form on R” whose associated matrix is N (in some 
basis of R"). If the sequence (y,),<,<n Of principal upper left minors of N is non-trivial and of 
saddle type, then q attains both positive and negative values. 


Proof. Suppose y, is the last non-zero term in the sequence. Let V be the subspace of R” 
generated by the first k vectors in the given basis of R”. Denote by g the quadratic form on V 
obtained by restriction: g(x) = q(x), all x € V. Since y, # 0, it follows that g is non-degener- 
ate. Since the sequence (y,),<,<, 18 of saddle type, it follows from the classification of 
non-degenerate quadratic forms (Edwards [6, page 149]) that g (and hence q) attains both 
positive and negative values. 


With these preliminaries, we state our second derivative test as follows: Let T, = (—1)” X 
principal upper left kth order minor of the Hessian Matrix HL(v),1< k < m+n. In particular, 
Diem = (-)"det HL(v). Note that in case m = 0 (no constraints), then L = f and v=aeu 
is a critical point of f. In this case, I’, is the principal upper left Ath order minor of the Hessian 
matrix Hf(a),1<k«<n. Let m7 be a permutation of {1,...,m+n}; let [,(7) =(—-1)” X 
principal upper left Ath order minor of the matrix HL(v)(7), 1<k<m+n. Only the 
sequences 

(T 


2m+p)i<pen—m = (Tyntio++:olmen) and (Tymep(™))1 cpen—m 
are used in the test. 


THEOREM 1 (Second Derivative Test). Suppose v = (c,a) € R”™ X U is a critical point of the 
Lagrangian function L. Assume f,g,:U > R,1 <a < m;m <n, are of differentiability class C° 


and that g = (),---,;8m):U ~ R”™ has maximal rank (= m) ata. In particular, by relabelling 
the variables x,,...,x, if necessary, assume 
O( 81 5-++5 Bn 
(815-++8m) (a) #0. 
O(X,,...,X,,) 


(a) Suppose [, , #0 (i.e., det HL(v) # 0). 

(i) If the sequence (Vymsp)icp<n—m |S positive definite, then subject to the constraints, f 
has a strict local minimum ata © U. . 

(ii) If the sequence (Vymsp)icp<n—m IS negative definite, then subject to the constraints, f 


has a Strict local maximum ata © U. 


(b) Suppose the sequence (Vym4p)rcpen—m 1S nontrivial and of saddle type (in particular we do 
not assume det HL(v) # 0, only that T),,,, # 0 for some p). Then subject to the constraints, f has 
neither a local maximum nor a local minimum at a © U. This case is known as the saddle alternative. 


(b)’ (Comprehensive saddle alternative). Suppose there is a permutation 1 of the last (n — m) 
rows and columns of HL(v) (this corresponds to the permutation of the last (n — m) variables in 
RR (X56 66 Xp) > (0156-63 Xs Xa madyo+++>Xany)) Such that the sequence (Vym4p(T))icpen—m 
associated to the matrix HL(v)(7) is non-trivial and of saddle type. Then subject to the constraints, f 
has neither a local maximum nor a local minimum ata © U. 


REMARKS. 


2.1. Suppose, in (b), [,,,,, # 0 (ue., det HL(v) # 0). This case is known as the non-degenerate 
saddle alternative and is equivalent to: not ((i) or (1i)). Thus if [,,,,, # 0, the second derivative 
test is complete. Also, if det HL(v) # 0, then (b), (b)’ are equivalent. 


2.2. Clearly (b) is a special case of (b)’ (let 7 = id). In practice, if (a) fails, then (b) usually 
applies. For purposes of illustration, Example 3 below demonstrates that (b)’ may apply where (b) 
fails, in case det HL(v) = 0. 


2.3. The only indeterminate case—in which Theorem 1 implies no conclusion about the character 
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of the critical point v—is that for which [,, , = det HL(v) = 0 and (b)’ fails. In terms of the 
analysis of sufficient conditions provided by Proposition 1.2, §1, this corresponds to the case that, 
subject to the linear constraints (1.3), the auxiliary quadratic form q, is degenerate and is positive 
or negative semi-definite (cf. (4.5), (4.6), §4 for complete details). As will be explained in §4, the 
degenerate semi-definite cases are necessarily indeterminate at the level of second order partial 
derivatives. In this sense, the statement of Theorem 1 cannot be improved. 


2.4. In case m = 0 (i.e., no constraints) and [, = det Hf(a) # 0, Theorem 1 is the well-known 
second derivative test for a local extremum of a function f at a critical point (Edwards [6, page 
145]). Note that conclusions (b), (b)’ improve and complete this classical second derivative test 
when det Hf(a) = 0. 


2.5. Proofs of part (a) of Theorem 1 with n = 3 and 1 < m < 2 appear in various textbooks (e.g., 
Bowman and Gerard [2]). However, these textbook treatments generally either ignore the saddle 
alternative or assume that det HL(v) # 0. 


2.6. In case m = n — 1, there is only one term in the sequence, ie., [’,,,,, = (—1)"det HL(v). In 
this case, the hypothesis that g: U — R™ has rank m at a © U is sufficient (i.e., no relabelling of 
the variables is necessary). Note also there is no saddle alternative in this case. 


We illustrate Theorem 1 with some examples. 


EXAMPLE 1. f(x, y,z) = xyz subject to x7 + y* +z? —1=0. This example is taken from 
Marsden and Tromba [11], page 229. A critical point for 


L=xyz+X(x? + y? + z*-1) 


is v = (0,1,0,0). For this critical point, one calculates T, = 0;T, = —det HL(v) = —4. Apply- 
ing Theorem 1, we see that this is the saddle case. Lacking a saddle alternative, Marsden and 
Tromba (uncritically) conclude that v falls in the test fails, i.e., indeterminate case. 


EXAMPLE 2. To find the minimum distance between a point (x,, x3) of the circle x* + y* = 2 
and a point (x5, x,) of the line x + y = 4 (see page 156 of Edwards [6]) we seek a local minimum 
value of 


f (15 %25%3 5X4) = (2 - x2)” + (x3 - x4)” 
subject to the two constraints 
81 (X15.%2,%3,%4) = x? + x? —2=0; 
(1 5X2,X3,X4) =X. + x4-4=0. 


The critical points of the Lagrangian L=f+A,g, +A,g, are: v, = (1, —2,1,2,1,2) and 
v, = (—3, —6, —1,2, —1, 2). From the bordered Hessian 


0 0 2x 0 2x, 0 
0 0 0 1 0 1 
2x, 0 2+2, -2 0 0 
AL(vy=| 9 yy 2 0 0 |’ 
2x, 0 0 0 242A, -2 
0 1 0 0 -2 2 


one calculates that I, = 32 and I, = 64 at v,, while [5 = —32,T, = —192 at v,. Applying 
Theorem 1, one concludes that f has a local minimum (in fact an absolute minimum) at 
(1, 2,1, 2), corresponding to the points (1,1) on the circle and (2,2) on the line, while f does not 
have a local extremum at (— 1,2, —1,2), corresponding to the points (—1, —1) on the circle and 
(2,2) on the line. These conclusions are clear geometrically. 
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EXAMPLE 3. f(x, y,z,t) = y? + z* — t? — x’, subject to x — z = 0. Clearly f is a non-degen- 
erate quadratic form which becomes degenerate and attains both positive and negative values on 
the three dimensional subspace defined by the linear equation x = z on R*. We check this by 
Theorem 1. The critical points of 


L=y?+27-1—x?+X(x-2z) 
are of the form v = (21,1,0,t,0),t € R. Computing the 5 x 5 Hessian of L, one calculates 
T, = 2; 1, = 0; U5 = 0. The sequence 2, 0,0 is positive semi-definite so Theorem 1, part (b) does 
not apply. However, by interchanging z, ¢ one obtains the Lagrangian 
yrttP—27-—x*+A(x-1t) 


for which one computes IT, = 2;T., = —4; I = 0. Since the sequence 2, — 4,0 is of saddle type, 
one concludes from Theorem 1, part (b)’, that subject to the constraint x = z, f does not have a 
local extremum at any point (7,0, 7,0) € R*,r ER. 


3. Proof of Theorem 1. We first prove the theorem for the simple constraints x, = 0, 1 < 
a < m, introduced in §1. The analysis of HL(v) for these constraints will elucidate the role of the 
signed minors I,,,,,,1<p<n-—m. Let 


F( Xppapo+e+s Xp) =S(0,..-,0, Xp ys e229 Xy)e 
The associated Lagrangian function is L = f + U”"_,A,x,, whose critical points are of the form 


v=(c,a) €R” X U, 


where 
(i) a= (0,...,0, a,43,...,a,) € R” and @ = (a,,,,,...,4,) is a critical point of f. 
. E m 
(i -(- 2 (4),...,-FL(a)} eR 


(3.1) Evidently, 


a°L o*f 
dx, OX, (v) = dx;0x, 


(4), m+1l<i, j<n. 


Consequently, HL(v) assumes the block matrix form 


On. m Tn Onn —m 
HL(v) =| I, BOC ; 


0,,— mm Cc’ Hf (a) 


Ox, Ox; (a) is the Hessian matrix of f evaluated at the critical point a = 


(,,419°++s4,)3 0, , IS the zero r X s matrix, and J, the identity r X r matrix. Let (A,), <p <n—m 
be the sequence of principal upper left minors of the Hessian Hf(G); let (S,):<p<n+m be the 
sequence of principal upper left minors of the Hessian HL(v). By definition, 


n=(-)D"S,, l<k<men. 
Employing elementary properties of determinants, one verifies easily that 


(3.2) (i) Som =(-1)" (ii) Som+p = (-1)"4,, l<p<n-m. 


where Hf(a@) = Ge 


(3.3) Consequently, I,,,., = (—1)7"A, =A,,1<p<n-—m. That is, the sequences 
(Tymnspi<pen—m and (A,) <p cn—m coincide. 


The relations (3.3) explain the sequence of signed determinants introduced by Mann [10]. To 
prove the theorem, we observe that: 
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(3.4) Subject to the constraints x, = 0,1 <a<™m, f has a (strict) local minimum or a (strict) 
local maximum or does not have a local extremum at a € A, if and only if the function f has 
respectively a (strict) local minimum or a (strict) local maximum or does not have a local 
extremum at d = (a,,,1,.-.-,4,). 

Let 


be the quadratic form associated to the function f at the critical point 4 (cf. (1.2), §1). Applying 
Proposition 1.1 to f and also applying the classification results Proposition 2.1 (a), Lemma 2.1 to 
the quadratic form g, we find, from the relations (3.3), (3.4) above, that conclusions (a), (b) of 
Theorem 1 are proved. 

It remains to prove conclusion (b)’. From the displayed form above of the matrix HL(v), it is 
evident that, for any permutation 7 of the last (n — m) rows and columns of HL(v), the principal 
upper left 2m X 2m submatrix of HL(v) remains unchanged, while Hf(a) is changed to 
Hf (a)(7). 


(3.5) Consequently, the proof of (3.3) shows that the sequences (I5,,4,(7))i<p<n—m and 
(A,(M)<p<n—m coincide. 

‘Applying Proposition 1.1 to f and also Proposition 2.1 (b) to the quadratic form g, we find 
from the relations (3.4), (3.5) above, that conclusion (b)’ of Theorem 1 is proved. 

To prove Theorem 1 for general constraints, we change coordinates to reduce to the case of 
simple constraints. Let 


9:R"™ XU OR" XR" =R™ 
be the map 
D(Aps- 22s Nps Xz 92029 Xp) = (Ags ees Ape Zp9 2889 Zn) 
where 
Zu = Bal Xpo009Xp)) LS AKMIZ, 4; =Xmaz, Mtl<i<n. 


In block matrix notation, 


Tn Onn 
(3.6) De(v) = | On m Dg(a) 
0, — m,2m D, — m4 | 
O(g,,..-5 — , 
Since Fea (a) + 0, it is clear that the C? map » has maximal rank (= m+n) at 
X15+++yXpy 


v =(c,a) € R™ X U and hence, by the Inverse Function Theorem [11], (A,,...,A,.5Z15-++5 Zn) 
defines a coordinate system in a neighbourhood of w = p(v) in R”*”. Write 

p '(A,z) = (A, ¥(z)) ER” X U, 
and let 


L=Log '=f(¥(z))+ DY AQze- 
a=] 


Thus L is the Lagrangian associated to the C? function f oy, subject to the constraints z, = 0, 
1 <a <~m. Since v is a critical point of L, then w = p(v) is a critical point of the Lagrangian 
L= Le og '. Evidently, the following equivalence obtains: 


(3.7) Subject to the constraints g, = 0,1 <a<™m, f has a (strict) local minimum or a (strict) 
local maximum or f does not have a local extremum at a € A if and only if, subject to the simple 
constraints z, = 0,1 <a <m,foyw has respectively a (Strict) local minimum or a (strict) local 
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maximum or f° wy does not have a local extremum at ay), (ao) = a. 


The equivalence (3.7) indicates that we should compare the sequences of principal upper left 
minors of the Hessians HL(v), HL(w). This is accomplished by the following lemma. 


LEMMA 3.1. HL(v) = P” X HL(w) X P, where P = Do(v). 
COROLLARY 3.1.1. Let 7 be a permutation of {1,...,m+n}. Then HL(v)(7) = P'(m) X 
HL(w)() X P(n). 


COROLLARY 3.1.2. Suppose, in addition, ma fixes pointwise {1,...,2m)} (i.e., a induces a 
permutation of the last (n—m) rows and columns of (m+n) x (m+n) matrices). Let 
(T,.(7)), <k<men be the sequence of principal upper left minors of HL(w)(1). Then 


Pamsp) = | S880) (a)]-Pagagl)s OSD nm 


COROLLARY 3.1.3. For each permutation a as in Corollary 3.1.2, both the sequences 
(Dymep(Mr<pen—-moCoimsep(Mhicpen—m> are, respectively, trivial or positive (semi-) definite 
or negative (semi-) definite or of saddle type. 


It is clear that Corollary 3.1.3 and the equivalence (3.7) above prove Theorem 1 for general 
constraints. 


Proof of Lemma 3.1. This lemma is a special case of a well-known result: A change of 
coordinates induces a congruence of the Hessians at the corresponding critical points. Explicitly, 
recall that L = Log. Introducing the auxiliary notation, 


(Uy 5-65 Up an) = (Ags. es AXms X1908 29 Xp )s 
(0150655 Uman) = (AqseessAms 2190829 2nd 
and 
(015-555 Pman) RR” x U- RR", 


we find, from a straightforward Chain-Rule computation, that 


a-L mtn 927 dy, . Og, a 
Ju,du, °°?) ~ » 9y,40,\") Bu, (0) 3 u, 62)? 1 <i, J<mrn, 


at the critical points v and w = y(v). That is, 
HL(v) = P’ xX HL(w) X P, where P = Dq(v). 


Proof of Corollary 3.1.1. Let E be the orthogonal (m+ n) X (m+ n) matrix corresponding to 
the permutation 7. Thus 


HL(v)(7) = E~! X HL(v) X E=E™! X P™’X HL(w) XPXE 
= P'(r) X HL(w)(7) X P(z). 
Proof of Corollary 3.1.2. From (3.6), the principal upper left 2m X 2m submatrix of P is 


_m Onn D,(g)(a) 
where 


D,(g)(a) = Es 98 (a | 


<i, <m 


In particular, 
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I( 81 5-+-) 8m) 
det P,,, = I(x. x, ) -OM* 0). 


Observe that if 7 is any permutation of the last (n — m) rows and columns of P, the matrix 
P(7) assumes the form 


Py B 
(3.8) P(m) a ha ; | 


where B is a 2m X (n — m) matrix which depends on 7. 


Since HL(v)(7) = P'(7) X HL(w)(7) X P(m), a simple block matrix calculation, employing 
(3.8) (note also P’(7) = P()"), proves Corollary 3.1.2. This completes the proof of Theorem 1 
for general constraints. 


4. In this section we explain the connection between Theorem 1 and the classification result 
Proposition 1.2, §1. To begin, we classify quadratic forms which are subject to linear constraints. 
Let q = 3} ,=14,,x,x, be a real valued quadratic form on R"; A = [4,,], <i; <n. SUppose 
is subject to m linear constraints, L7_,5,;x,; =0, 1<i<m,m<n.Let B=[bjheicmicys<n 


be the coefficient matrix of the constraints. We assume the m X m submatrix [b,;]) <;,;<m 1S 


non-singular. Evidently v = (0,0) € R” X R” is a critical point of the Lagrangian function 


L=q+ > y b, ;x,A,. 


i=1 j=1 
Note that the Hessian matrix HL(v) is: 
0 B 
(4.1) HL(v) = | mn | 
Br OA 


Recall, §2, the sequences (Ty,)) <4 < mans (™))1 eee men Of principal upper left minors respec- 
tively of the matrices HL(v), HL(v)(7). 


THEOREM 2. Subject to the above linear constraints, 


(a) q is positive (negative) definite if and only if the sequence (Tymsp)icp<n—m IS positive 
(negative) definite. 


(b) qg attains both positive and negative values if and only if there is a permutation o of the last 
(1 — m) variables in R”" (1.€., (X1,---5%q_) A (X15- ++ Xs Xa(m41yo+++9Xa(ny)) Such that the 


9>“m9 


sequence (Vymsp(7))1<p<n—m IS non-trivial and of saddle type. 


(c) q is positive (negative) semi-definite if and only if for each permutation a of the last (n — m) 
variables of R” the sequence (Vym+p(7))1<p<n—m §5 trivial or positive (negative) semi-definite. 
Proof. To prove Theorem 2, one essentially applies Theorem 1 to quadratic forms which are 


subject to linear constraints. Suppose first of all that the linear constraints are simple: x, = 0, 
1<i<~m. Let G(x,,,1,..-,X,) be the quadratic form 


1 n 
(05-250, X mers %n) = 5 » A; ;X;X;. 
i,j=mt+l1 


In the case of simple constraints, Theorem 2 follows from the relations (3.3), (3.5), §3, (let 
f = 9; f = G in Theorem 1) and the classification results applied to 7 provided by Proposition 2.1, 
§2, and Proposition A.1 in the Appendix. For general linear constraints, note that in Theorem 1, if 
f = q and 


n 
8,(X,,.--,X,) = ye bai x; l<a<m, 
j=l 
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then the change of coordinates map, 9y:R” X R” — R™*", constructed in Theorem 1 is a Jinear 
map. Consequently, in new coordinates, the function g° y is a quadratic form which is subject to 
simple constraints. Thus Theorem 2 follows from Corollary 3.1.3, §3, and the classification, 
proved above, for quadratic forms which are subject to simple constraints. 


REMARK 4.1. Conclusion (a) was first established by H. B. Mann [10]. Conclusion (c) sharpens 
and simplifies the classification scheme obtained by G. Debreu [5] for semi-definite quadratic 
forms. (In particular Debreu’s results are stated in terms of a// permutations of the variables in 
R”.) Despite its obvious utility for the theory of Lagrange multipliers (cf. Propositions 1.1, 1.2, 
§1), conclusion (b) of Theorem 2 appears to be new. 

Turning now to the study of the indeterminate case for Theorem 1, recall that Proposition 1.2, 
§1, is stated in terms of the auxiliary quadratic form on R”, 

re 
a(hy=Z Le Gayle) hh; 
i, j=l I J 
which is subject to the m linear constraints (1.3): 
984 
— = < 
> ax, (a)h, = 0, l<a 


In 


mM. 


(4.2) OBSERVATION. Let L, be the Lagrangian function associated to the quadratic form q, 
and the linear constraints (1.3). It is clear from (4.1) that the Hessian matrix HZ, (0) is exactly the 
Hessian matrix HL(v) defined in §1. 


Applying (4.2) and Theorem 2, we establish the following results: 


(4.3) Conclusion (a) of Theorem 1 obtains, if and only if, subject to (1.3), the quadratic form gq, is 
positive or negative definite. 


(4.4) Conclusion (b)’ of Theorem 1 obtains, if and only if, subject to the constraints (1.3), q, 
attains both positive and negative values. 


(4.5) The only indeterminate case for Theorem 1, at the level of second order derivatives, occurs if 
and only if det HL(v) = 0 and (b)’ fails, i.e., subject to the linear constraints (1.3), the auxiliary 
quadratic form q, is degenerate and is positive or negative semi-definite, a numerical algorithm 
for which is provided by Theorem 2 (c). 


(4.6) For completeness, we show that this indeterminate case is necessarily indeterminate at the 
level of second order derivatives. 


Let V be the (n — m)-dimensional linear subspace of R” defined by the linear constraints 
(1.3). There is a basis (e,,...,e,) of R” such that (e,,...,e,_,,) iS a basis of V and, with respect 
to coordinates (y,,..., y,) in this basis, the quadratic form q, is diagonalized: 


At (Vs-+-5 In) =e? +--+ +y72), where e=1(e=-1), 


if g, 1S positive (negative) semi-definite on V and k is the maximal dimension of a subspace of V 
on which q, is positive (negative) definite. Since gq, 1s degenerate on V, then k <n — m (q, = 0 
if k = 0). In the basis above (assuming m > 0), the linear constraints (1.3) are equivalent to the 
simple constraints: y =0, i=n—m-+t1,...,n. Let 


F(Nr500 Ye) = 4 Ni5-0 9M) F Veer, BON e I) = OL ON10- In) — View 


subject to the simple constraints Gf m>0) y,=0, i=n—m-+1,...,n. Subject to these 
simple constraints, if e = 1, then f has an absolute minimum at the origin in R”, while g does not 
have a local extremum at the origin in R”, although the auxiliary quadratic form is g, in both 
cases. Similar conclusions obtain if e = —1. Consequently, the indeterminate case (4.5) is 
necessarily indeterminate. 
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REMARK 4.2. It follows from (4.3), (4.4) above that Theorem 1 provides a numerical algorithm 
which distinguishes the three cases of Proposition 1.2, §1. 


This completes our study of Theorem 1 and of the problem of constrained local extrema. 


APPENDIX. In this appendix we recall some results from linear algebra and prove Proposition 
2.1 (b), §2. Let Q be a real-valued quadratic form on R” whose associated matrix is M (in some 
basis of R”). For each permutation 7 of {1,2,...,r}, recall the sequence (y,(7)),<;<, Of 
principal upper left minors of the matrix M(7) (cf. §2 for this notation). Let 


(x,y) = (Q(x +») - O(x) - Q(y)) 


be the associated bilinear form. 


PROPOSITION A.1. (a) Q is positive semi-definite if and only if, for each permutation 1, the 
sequence (y,(7)) is either trivial or positive semi-definite. 

(b) Q is negative semi-definite if and only if for each permutation 7, the sequence (y,(7)) is 
either trivial or negative semi-definite. 


Proof. It is enough to prove (a). To prove (b) one considers the quadratic form —(Q. 
Employing the classical classification of semi-definite quadratic forms (Debreu [5], Gantmacher 
[7], page 307) Q is positive semi-definite if and only if each principal jth order minor of M is 
> 0, 7 = 1,2,...,7r. Note that each principal jth order minor of M is the principal upper left jth 
order minor of M(7), for some permutation 7. Thus, one obtains the condition: y,(7) > 0, 
j=1,...,r, for all permutations 7. However, if the sequence ( VAT) <jer is non-trivial, then 
only a sequence of + signs followed by a sequence of zeros can occur. Indeed, any other 
non-trivial sequence of + signs and zeros would constitute a sequence of saddle type and hence it 
would follow, by Lemma 2.1, §2, that Q is not positive semi-definite. Thus Proposition A.1 
sharpens the classical classification of semi-definite quadratic forms. 


LEMMA A.1. Suppose there are permutations 7, 6 of {1,2,...,r} such that the sequences 
(YT) jer and (¥(9)) <j<, are nontrivial and positive semi-definite, respectively, negative 
semi-definite. Then there is a permutation 7 of {1,2,...,r} such that the sequence (y,(T)), <j<r IS 
non-trivial and of saddle type. 


Proof. Let y,(7) = Q(e) > 0; y,(o) = O(f) < 0, where e, f are elements of the given basis 
of R”. Let + be a permutation of {1,2,...,7} such that the principal upper left 2 X 2 submatrix 


‘i {(ff) (ef) 
of M(7r) is ke eA Evidently, 


y(t) = O(f) <0, »(7) = O(f)Q(e) -[(e. fF < 0. 


Consequently the sequence (y,(T)), <;<, 18 non-trivial and of saddle type. 


Proof of Proposition 2.1 (b). Suppose Q attains both positive and negative values. Then Q is 
neither positive nor negative semi-definite. By Proposition A.1, either there is a permutation 7 for 
which the sequence (y,(7)), < ; <, 18 non-trivial and of saddle type, or there are permutations 7, o 
for which the hypothesis of Lemma A.1 holds. In either case there is a permutation y such that the 
sequence (y,(Y))1 <;<, 18 non-trivial and of saddle type. Since Lemma 2.1, §2 proves the “if” part 
of (c), this completes the proof of Proposition 2.1 (b). 
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THE SEARCH FOR A ROLLE’S THEOREM IN THE COMPLEX DOMAIN 


MORRIS MARDEN 
Department of Mathematical Sciences, University of Wisconsin-Milwaukee, Milwaukee, WI 53201 


1. Introduction. Our purpose here is to give a brief account of the results obtained over many 
years in the attempts to create a complex domain analogue to a very familiar theorem of real 
analysis. The theorem is one which is often introduced in our first calculus courses. It states that 
between any two real zeros of a differential real function f lies at least one critical point of f 
(zero of its first derivative f’). 

This theorem first appeared in a book published in 1691 by the French mathematician Michel 
Rolle. Its publication had predated the adoption of the geometric representation of complex 
numbers by about 140 years. For, though this representation was devised by the Norwegian 
cartographer Caspar Wessel in 1797 and again by the Swiss mathematician Jean Argand in 1806, 
its universal acceptance had to await its invention in 1831 by none other than the great Karl 
Friedrich Gauss. 

With this representation came the concepts of a complex variable z = x + iy and a function of 
a complex variable. It is not surprising that some early attention was directed towards construct- 
ing in the complex domain counterparts to well-known theorems of real analysis such as Rolle’s 
theorem. However, the generalization of Rolle’s theorem to the complex plane is not obvious or 
trivial, as the following two examples show. 

First, take the function f(z) = e”' — 1 which has zeros at z = 0 and at z = 27. If Rolle’s 
theorem were valid, at least one critical point would be situated on the interval 0 < x < 27. But 
f(z) = ie”! so that f’ has no zeros whatsoever. 

Secondly, take the polynomial f(z) = (z* — 1)(z - iV3) which has zeros at the vertices 
z= +1, z = iy3 of an isosceles triangle. If Rolle’s theorem were valid, f would have a critical 
point on each side of this triangle. But f’(z) = 3[z — (i/ V3 )]* so that f’ has a single zero at 
z = iy3, a point interior to the triangle. 

As the second example shows, the concept of a critical point lying between two real zeros (i.e., 
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on the line segment joining the two zeros) generally is replaced in the complex plane by the 
concept of a critical point situated in some region containing the zeros of the given function. In 
our second example that region is a closed triangle but in our later examples a polygon or a 
circular disk may be the most convenient choice. 

As to the importance of locating the critical points of a given function f, it is to be recalled 
that, whereas for a function of a real variable the determination of the critical points helps in 
locating the maxima and minima, for a function of a complex variable f analytic in a region T 
finding the critical points aids in determining where the map of T by w = f(z) fails to be 
conformal. 


2. Locating all the critical points. An immediate corollary of Rolle’s Theorem is the following 
result. 


THEOREM 2.1. If a real polynomial f of the real variable x has only real zeros all situated on the 
interval I: a < x < b of the real axis, then all the critical points of f also lie on I. 


It is Theorem 2.1 that was first to be generalized to the complex domain. The earliest such 
result was due to Gauss who in 1836 stated the following [2, pp. 21-24]. 


THEOREM 2.2 (Gauss). The critical points of a polynomial f which are not multiple zeros of f are 
located at the equilibrium positions in a certain field of force. This field is one due to a particle placed 
at each zero of f, having a mass equal to the multiplicity of the zero and attracting according to the 
inverse distance law. 


From Gauss’ Theorem we may deduce immediately the following fact. 


THEOREM 2.3 (Lucas). The critical points of a polynomial f lie in the convex hull H of the zeros 
of f. 

The convex hull H of the zeros of f is defined as the smallest convex polygon enclosing all the 
zeros of f. 

Though Theorem 2.3 is a corollary of Theorem 2.2, nowhere in Gauss’ papers does one find 
either a statement or proof of Theorem 2.3. The first published statement and proof of Theorem 
2.3 appears to be due to the French engineer F. Lucas in 1874. 

Another form of Lucas’ Theorem is the following result. 


THEOREM 2.4. The critical points of a polynomial f lie in any circle C enclosing all the zeros of f. 


In fact, Theorem 2.4 is equivalent to Theorem 2.3. For, on the one hand, C encloses H and on 
the other hand, H is the intersection of all circular disks covering H. 


3. Separation of the critical points. If, instead of being given that all the zeros of polynomial f 
lie in a single circle C, one presupposes that the zeros are distributed over a set of given circular 
disks, one may expect more specific results on the location of the critical points than given in 
Theorem 2.4. During the period 1918-1922 Joseph L. Walsh discovered a number of such results, 
two of which are the following [2, pp. 89-92]. 


THEOREM 3.1 (Walsh). Jf a polynomial f of degree n has m, zeros in a circle C,, with center at 
Z = C, and radius r, and the remaining m, = n — my, zeros ina circle C, with center at z = cy and 
radius r,, then every critical point of f that does not lie in C, or C, lies in a third circle C with center 
at z= (m,c, + m,c,)/nand radius r = (mr, + m,r)/n. (See Fig. 1.) 


THEOREM 3.2 (Walsh). If Cy, C,,-...,G, is a set of circles having a common external center 0 of 
similitude and if f is a polynomial of degree n that has m, zeros in circle C,,k = 0,1,...,p, then 
every critical point of f, not in a circle C,, lies in a circle Cj, where Cj, C3,..., Cj is a set of circles 
also having 0 as an external center of similitude. (See Fig. 2.) 


An interesting new application of Theorem 3.1 is to a real polynomial f with only real zeros. 
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C C Cy 


Co Cy C G C, 
FIG. 2 


THEOREM 3.3. Let f be a real polynomial of degree n with only real zeros of which m, are located 
on the interval I: a, < x < by and the remaining m, =n — m, are located on the interval I,: 
a, <x <b, with a, > by. Then any critical point of f not on I, or I, is located on the interval I: 
a<x <b, where a = (ma, + m,a,)/n and b = (mb, + m,b,)/n. 


This theorem supplements Rolle’s Theorem. To prove it, let us draw the circle C, whose center 
is at c, + iy, where c, = (a, + b,)/2, and which passes through the two points z = a, and 
= b,. Also let us draw the circle C, whose center is at c, + iyA, where 


C,=(a,+b,)/2 and A=(b, — a,)/( by — a), 


and which passes through the points z = a, and z = b,. Then according to Theorem 3.1, any 
critical point of f not in circle C, or circle C, lies in circle C whose center is at 


z= (1/n)[ mc, + myc, + iy(Am, + m,)| 


and which cuts the real axis in the points z = a and z = b. Now, as y is allowed to vary from 
— oo to +00, the intersection of all disks C, is interval J,, the intersection of all disks C, is the 
interval J, and finally the intersection of all the circles C is the interval 7. Thus, we have proved 
Theorem 3.3. 


4. Further separation of the critical points. Theorem 3.2 has been generalized to an arbitrary 
set of circular disks C,, one that does not necessarily have a common external center of similitude 
[2, pp. 96-106]. 


THEOREM 4.1 (Marden). For j = 0,1,2,..., p let f(z) be a polynomial of degree m, that has all 
its zeros on the circular disk C,(z) < 0, where C(z) = |z — c,|° — re Then, if a critical point of the 
product 


(4.1) f(2) =h(DA() 4) 
is not situated in one of the circular disks Cy, Cy,...,G,, it lies in a simply-connected region bounded 
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by an oval of the p-circular 2 p-ic curve E(z) = 0 where 


E(z) P nm, P mM, M1, x 
(4.2) ee 
= j20 G2) jan. GZ) G, (2) 
Lig) k=j+1 
and where 
2 
tT =16,-&)? -—(4-%), n= My +m, +--- +m,. 


It is to be noted that, if 7,, > 0, then 7,, equals the square of the length of the external common 
tangent of the circles C,(z) = 0 and C,(z) = 0. 

To clarify Theorem 4.1, let us consider the special case p = 2, that is, of just three given circles 
C,(z) = 0, j = 0,1,2. Equation (4.2) then simplifies to 
(4.3) E(z) = n?(x? + y?)” + lower degree terms, 


so that F(z) = 0 is the equation of a bicircular quartic. It may happen that (4.3) is factorable, 
thus: 


(4.4) E(z) = (x? +y>+ax+ By t v1) (x? +y*+a,x+Byt 2); 


in which case the bicircular quartic E(z) = 0 degenerates into a pair of circles as for Theorem 3.2. 
As an example of a non-degenerate case, let us choose the circles C), C, and C, each of radius 


r but with centers respectively c) = 2i, c, = —3 —i,c, = +3 — i, thus at the vertices of an 
isosceles triangle. Then (see Fig. 3) for r = 0, the bicircular quartic reduces to the points z = + y2. 
For 0 < r < ¥3 — 1, the quartic consists of two distinct ovals, one surrounding point z = — ¥2 


and the other point z = + ¥2. For r= V3 — 1 the two ovals touch one another at z = 0. For 
v3 —1<r< 3 +1 the quartic is a single oval. For r = V3 + 1, the quartic consists of a single 
oval and the isolated point z = 0. For r > V3 +1, the quartic consists of two ovals, with one 
enclosed in the other; in this case, any critical point, not in one of the given circular regions C), 
C, or C,, lies in or on the outer oval of the bicircular quartic. 


° r=0 
x ----- <r <y3-1 
. \ --—— r=y3-1 
oy fi o—} \ \. \ — y3-1<r<y3+1 
\ NY I. le / | 
San NO / — r=y¥3+1 
/ —-— r>y34+1 
Fic. 3 
In the general case equation (4.2) has the form 
(4.5) E(z) =n*(x? + y?)? + lower degree terms, 


so that E(z) = 0 represents a p-circular 2 p-ic curve. While such curves have been known for 
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some time, very little has been written about their properties. It may happen that the curve 
E(z) = 0 degenerates into a set of circles as in the case of Theorem 3.2. 

Another example where curve E(z) = 0 degenerates into a set of circles is that in which the 
circles C, are all of the same radius r and have their centers at the p + 1 vertices of a regular 
polygon P. The curve E(z) = 0 then reduces to a set of concentric circles centered at the center 
of P. 


5. Locating only some critical points. So far we have discussed some complex domain ana- 
logues, not to Rolle’s Theorem, but to its corollary, Theorem 2.1, that involves all the zeros and all 
the critical points of a polynomial. However, Rolle’s Theorem proper deals with the location of 
the critical points of a real polynomial in relation only to pairs of real zeros. 

A complex domain analogue closer to Rolle’s Theorem is the following due to the two British 
mathematicians J. H. Grace in 1902 and P. J. Heawood in 1907 [2, pp. 107-110]. 


THEOREM 5.1. If z, and z, are any two distinct zeros of a polynomial f of degree n, then at least 
one critical point of f lies on the circular disk |z — c| < r, where 


c=(2,+2,)/2 and r=(|z, — 2,|/2)cot(7/n). 


This result turns out to be the best possible in that the limits given in Theorem 5.1 are attained 
by the polynomial 


f(z) =f — icot(a/n)|" * ds. 


From Theorem 5.1 one may deduce the following result, discovered independently by the 
American topologist Alexander in 1915. 


THEOREM 5.2. If the disk |z| < R contains two zeros of an nth degree polynomial f, the concentric 
disk |z| < Resc(a/n) contains at least one critical point of f. 


In an attempt to generalize Theorem (5.2), the Japanese mathematician Kakeya established in 
1917 the following result [see 2, pp. 113-121]. 


THEOREM 5.3. If the disk |z| < R contains p zeros of an nth degree polynomial where 2 < p <n, 
then there exists a number $(n, p) dependent only upon n and p such that at least p — 1 critical 
points of f lie in the disk |z| < Rd(n, p). 


The value $(n, 7) = 1 follows from Lucas’ result in Theorem 2.4. Kakeya himself was able to 
show $(n,2) = csc(7/n) confirming our Theorem 5.2, but admitted an inability to calculate 
o(n,p) for general p. In 1927 the Polish mathematician M. Biernacki calculated the value 
o(n,n— 1)=[1 4+ (1/n)}”. In 1936 Marden showed that 


(5.1) o(n,p) < csc(7/2Qq), 
where g = n — p + 1, and in (1945) Biernacki showed that 

n—p 
(5.2) 6(n.p) < TI [(n + k(n k)]. 


However, neither (5.1) nor (5.2) provides the least upper bound to $(n, p). 
In 1972, Marden [4] studied the Kakeya problem relative to the special family of polynomials 


f(z) =(z2- 4)"(2-24)"(z- 23)", 
where 
A+p=p, v=n—p, |4l=|z,;=R, and |z;|>R. 
For such polynomials he proved that #(n, p) < [2 — (p/n)}'” and that this bound is attained 


when p is an even integer. 
An application of the above leads to the result: 
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THEOREM 5.4 (Marden). An nth degree polynomial P is at most p-valent (2 < p <n) on the disk 
D: |z| < sin[w/2(n — p)| if, in the unit disk |z| < 1, P has at most p — | critical points. 


For a function f to be at most p-valent in a region S means that, if A is an arbitrary real or 
complex number, the equality f(z) = A holds on at most p distinct points z within S. 

Theorem 5.4 follows almost immediately from Theorem 5.3 and inequality (5.1). For, if for 
some number A the equation P(z)=A were valid at p+ 1 points in the disk D, then 
f(z) = P(z) — A would have at least p critical points in the disk 

Iz] < $(n, p + 1)sin[ 7/2(n — p)] < sel 7/2(n — p)|sin[ /2(n — p)] = 1, 
in contradiction to the hypothesis of Theorem 5.4. Hence, P(z) = A in at most p points of D. 

6. Some recent conjectures. During the past dozen years considerable interest has been 
aroused regarding the location of the critical points of any polynomial f all of whose zeros lie in 
the unit disk |z| < 1. By Theorem 2.4 we know that all the critical points of f also lie on the unit 
disk. The question recently raised is how close to each zero do the critical points lie [see 3]. 

The following conjecture was made by the Bulgarian mathematician B. Sendov in 1962 but 
became later known as the “Ilyeff Conjecture.” 


CONJECTURE 1. If all the zeros of an nth degree polynomial lie on the disk \z| < 1 and Zp is any 
one of the zeros, then at least one critical point of f lies within unit distance from Z,. 


This conjecture has been proved in a number of special cases including n = 2,3,4,5, but not as 
yet in general. 

Another conjecture was made in 1969 by the American mathematicians Goodman and Roth, 
the Canadian-Indian mathematician Rahman and the German mathematician Schmeisser. 


CONJECTURE 2. Under the same hypotheses as for Conjecture 1, at least one critical point of f lies 
on the disk |z — (2,/2)| < 1 — (|Zo|/2). 


This has been proved when |z,)| = 1, but some counterexamples have been devised for the case 
|Z)| < 1 by M. J. Miller [see 5]. One of these is the sixth degree polynomial P(z) = (z — 0.84)Q(z), 
where 


O(z) = z° + 1.182303196z* + 1.34007004z* + 1.34007004z? + 1.182303196z + 1. 
Miller shows that the six zeros of P(z) are approximately the following quantities: 
Zo = 0.84, z,= —-1, 2, = 0.415548 + 0.909571i, 
z, = —0.506700 + 0.862122i, z,=2Z,, 25 = 23. 
Obviously |z,| < 1. His calculations show that 
Iz, = [221 = 23] = [zal = [zs] = 1. 
On the other hand, the five zeros of P’(z) are computed by him to be approximately the 
following: 
§, = —0.162505, § = —0.159819 + 0.055911i, §, = 0.098445 + 0.485713:, 
j=, & =h. 
If Conjecture 2 were valid for polynomial P and for the above zero zy, then the inequality 
S, = |f, — 0.42] < 0.58 


would hold for at least one k, k = 1,2,...,5. However, his calculations show that S, > 0.582505 
for all k, thus refuting Conjecture 2. 


7. Critical points of non-polynomials. So far we have surveyed the results aimed at extending 
Rolle’s Theorem to polynomials in a complex variable. We now shall consider the corresponding 
work with entire functions, rational functions and more general functions of a complex variable. 
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The names of Laguerre, Cesaro, and Polya are associated with the earlier work on entire 
functions of a complex variable. By an entire function is, of course, meant one for which the 
Maclaurin series 


f(z) = bp + biz 4+ bz7 4+ --- 


converges for all values of z. Of special interest are entire functions of finite order p, meaning 


| M 
eo = lim sup —“ ten k log k = lim sup Foglog M(rsf) (rf) ; 
log (1/1b,\) r— oo log r 
k— oo 


where M(r, f) = {max|f(z)|, |z| =r}. 
The following is a result about the critical points of entire functions of finite order 0. 
THEOREM 7.1 (Marden). If f, an entire function of finite order p, has all its zeros in a 


semi-infinite strip T, at most n critical points of f lie in sector S (see Fig. 4), where n is the largest 
positive integer less than or equal to p. 


Angle A =a7/(n + 1) 
Fic. 4 


For example, let p be a non-negative integer and let 
f(z) = MTG —ky +]. 


This entire function of order p has all its zeros in a semi-infinite strip 7:0 < x,-l<y<l. 
Hence, at most » = p critical points lie in the sector S. 
Another example 
k=p 
f(z) = T] cos(z — ki)” 

k=—p 
is that of an entire function of order p = 1/2. Here one finds as the semi infinite strip 7, 
x > 1*/4,|y| < p, whereas n = 0. Thus no critical points lie in S. In fact, one can show that all 
the critical points lie in T. 

More generally, it may be proved that in the case of any entire function f with an order p, 
0 < p < 1, any convex polygon H which contains all the zeros of f also contains all the critical 
points of f, just as in Theorem 2.3. However, H will ordinarily be unbounded, since f may have 
an infinite number of zeros. 

For rational and meromorphic functions, there are theorems similar to those on polynomials 
and entire functions respectively. 

For a more general function f which is analytic in a region 7, some results have been obtained 
by the noted French mathematician Jean Dieudonné in 1930 [see 1]. Supposing that f has in T a 
zero at z = z, and one at z = z,, and that the circle C with the line segment from z, to z, as 
diameter is contained in 7, he then determines conditions on f such that at least one critical point 
of f liesin C. However, these conditions are too complicated to be stated here.* 


*If z, and z, are sufficiently close, there should be a critical point in their immediate vicinity, since the critical 
points are continuous functions of the zeros. For, z, — z, makes z, a double zero and thus a critical point. 
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8. Concluding remarks. In summary, we have surveyed the efforts made towards devising some 
complex plane counterparts to Rolle’s Theorem. These attempts began about one hundred and 
fifty years ago with Gauss and have continued to the present day. None of the results so far have 
had the generality and simplicity of Rolle’s Theorem. Hence, it remains a challenge for the future 
to find a true analogue of Rolle’s Theorem in the complex domain. 


Acknowledgments. The author wishes to thank UWM Teaching Assistant Sherrie Nicol for the drawings 
accompanying this paper, and his colleague Professor David H. Schultz for the numerical calculations. 
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ON ALL SORTS OF AUTOMORPHISMS* 


PETER HILTON 
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0. Introduction. If ¢ is a linear transformation of a finite-dimensional vector space into itself, 
then there is a simple criterion that ¢ be invertible, that is, an automorphism, given by det  # 0. 
However, this criterion fails, as we shall see, in general for an endomorphism of a finitely-gener- 
ated module over a commutative unitary ring. In this note we find a criterion for such an 
endomorphism @ to be an automorphism in terms of the existence of certain polynomial 
annihilators of , basing ourselves on a Cayley-Hamilton Theorem for such endomorphisms. We 
then examine the very different situation which occurs if we drop the condition that the module be 
finitely-generated; and this in turn leads us to criteria for special classes of automorphisms of 
(non-commutative) groups. 

The criterion in terms of polynomial annihilators, and the extension to the non-finitely-gener- 
ated case, arose, in the context of homotopy theory, in trying to understand some interesting work 
of J. M. Cohen [2], [3] on self-maps of fiber spaces. Cohen, in fact, used one aspect of the criterion 
in [2], and gave an ad hoc proof, in the case of abelian groups, in [3]. However, the problem of 
understanding Cohen’s work was exacerbated by a defect in his definition of a pseudo-identity of 
an abelian group as a special type of automorphism. Nevertheless, it became plain, from a study 
of [3], that a simplification of a basic tool used by Cohen could be achieved by using a 
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Cayley-Hamilton Theorem for endomorphisms of finitely-generated abelian groups or, more 
generally, finitely-generated modules over a commutative unitary ring. 

We include in Section 1 a proof of such a generalized Cayley-Hamilton Theorem for 
completeness; it is, of course, not new (see, for example, [7]). However, it seems worth drawing 
attention to the fact that such a proof is not available if one confines attention to vector spaces, 
since the operator domain A must, in any case, be embedded in the polynomial ring A[t]. Such a 
process is fundamental to the proofs given in [1], [8], [11]. 

We use the Cayley-Hamilton Theorem to study polynomial annihilators of an endomorphism ¢ 
of a (finitely-generated) module A over a commutative ring A. We enumerate 4 assertions about 
, each of which implies that @ is an automorphism and discuss their mutual implications. If A is 
finitely generated, they are all equivalent to being an automorphism—this follows from the 
Cayley-Hamilton Theorem—but if A is not finitely generated, then no two are equivalent and all 
are stronger than the assertion that @ is an automorphism. These last results, obtained with the 
use of readily accessible examples, appear in Section 2. In fact, Cohen’s notion of pseudo-identity 
is equivalent to assertion (d) in Theorem 1.2. 

In the course of our discussion we find characterizations equivalent to assertions (b) and (d) 
which immediately generalize to the non-abelian case. Thus we are ready to pass, not merely from 
vector spaces to modules, but even to groups with operators. This direction is precisely that taken 
by Roitberg and the author in [5], [6], [10]. 

We should, perhaps, apologize for perpetrating the term “demonic polynomial’ in this paper. 
The term ‘monic polynomial,’ is standard (though we here permit the slight generalization of 
allowing the leading coefficient to be +1); the term ‘comonic polynomial’ for a polynomial with 
constant term +1 seems reasonable in view of the dual role it plays in our theory. To describe a 
polynomial which is both monic and comonic as ‘bimonic’ would be an etymological solecism. To 
stop short at “dimonic’ would be perverse and cowardly! 

In reference [9] a nice proof is given that an endomorphism of a finitely-generated module over 
a commutative ring is annihilated by a monic polynomial, but the proof does not allow us to 
minimize its degree (see Remark (11) following Theorem 1.2). 


1. The Cayley-Hamilton Theorem. Let A be a unitary commutative ring and let A be a 
finitely-generated (left) A-module, with finite generating set a = {a,,a,,...,a,}. Given any 
endomorphism ¢: A — A, we may associate with @ the matrix 


Ay at Ap eee ry 
My Ap o—t we AS 


Ne Ngee pp ot 


over the polynomial ring A[t], where 
(1.1) oa; = »v Ai;a;; i=1,2,...,n, A, EA. 
j=l 


Then det M, the determinant of the matrix M, is an element of A[t], called the characteristic 
polynomial of ¢ relative to the equations (1.1). Let us write F(t) for this polynomial. We may then 
prove (see, for example, [7]): 


THEOREM 1.1 (Generalized Cayley-Hamilton Theorem). F(@)A = 0. 


REMARK. If A is a free module and A is a ring such that a free finitely generated A-module 
has a unique rank, then the characteristic polynomial of ¢ is uniquely determined by ¢. This 
remark, of course, covers the case of a finite-dimensional vector space. 


We include a proof of Theorem 1.1 for completeness; it will be just an adaptation of the 
standard proof proceeding via Cramer’s rule for solving systems of linear equations (see [7]). 
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Proof of Theorem. Set R = A[t] and make A into an R-module by defining ta = $a, aeA. 


Then from (1.1) we infer that X7_,c;,a; = 0,i = 1,2,...,, where 
Cc = ae l Fis 
J A, 7t, i=. 


Regarding these as equations in the quantities a;, we infer that F(t)a, = 0, k = 1,2,---,n. But 
since a is a generating set for A, this means that F(t)A = 0. Finally F(¢)A = 0 since ta = oa. 

We note that F(t) is a polynomial of degree n whose leading coefficient is +1; we call such a 
polynomial monic. We also introduce the term comonic for a polynomial whose constant term is 
+1, and demonic for a polynomial which is both monic and comgnic. Note that the set of monic 
(comonic, demonic) polynomials is closed under multiplication. We may now use Theorem 1.1 to 
prove the following result. 


THEOREM 1.2. Let A be a unitary commutative ring and let A be a A-module. Consider the 
following assertions about an endomorphism $: A > A: 

(a) @ is an automorphism; 

(b) VaeA, J comonic F(t) such that F(¢)a = 0; 
(c) J comonic F(t) such that, VaeA, F(d)a = 0; 
(d) VaeA, 3 demonic F(t) such that F()a = 0; 


(e) J demonic F(t) such that, VaeA, F(o)a = 0. 
Then 


(d) 
(1.2) (c) (b) > (a). 
(c) 


Moreover, if A is finitely generated, all 5 assertions are equivalent. 


Proof. All implications in (1.2) are trivial, except for (b) > (a). Now, given (b), we have an 
equation, for each aeA, 


( cof” + CpG" 1 + +++ $0,169 + 1l)a =0. 


This equation shows immediately that aedA and that a = 0 if da = 0. Thus ¢ is an automor- 
phism. 

Now assume A finitely generated. If a = {a,,a,...,a,} 1s a generating set, then there is a 
characteristic polynomial of $, say G(t), which is monic of degree n, such that G(¢)A = 0. If @ 
is an automorphism then there is also a characteristic polynomial of @~*, say G(t), also monic of 
degree n, such that G(@1)A = 0. Notice that t”G(t~*) is comonic of degree <n. Then if we set 


F(t) = tG(t) + t"G(t"*), 


it is easy to see that F(t) is demonic (of degree n + 1) and F(¢)A = 0, so that (e) holds. In the 
light of (1.2), this proves the equivalence of all 5 assertions if A is finitely generated. 


REMARKS (i). The equivalence of (a) and (e), in the finitely generated case, was the starting 
point of Cohen’s investigations in [2], [3]. 

(ii) We noted above that the polynomial f(t) we constructed had degree (n + 1). Thus the 
construction produces a polynomial of degree (7 + 1), where » is the minimum number of 
elements required to generate A. It is clear that we cannot, in general, do better than this. For 
example, let 4 = Z/5, generated by a, and let ¢: A — A be given by $a = 2a. There is plainly 
no demonic linear polynomial F(t) such that F(¢)a = 0. However the quadratic polynomial 
t? + 1 will evidently serve. 
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(iii) We were careful to state, in our argument, that the degree of t”G(t *) is less than or equal 
to n. It would be less than n if the constant term of G(t) were zero. Now the constant term of the 
characteristic polynomial of ¢ relative to the equations (1.1) is det(A,;), so the question arises 
whether we can have det(A,;,;) = 0 when ¢ is an automorphism. The following example is, 
perhaps, a little surprising. 

Let A be the abelian group Z/2 © Z/3 = (a, b) and define ¢:A — A by 


(1.3) oa = 3a — 3b, ob = 2a — 2b. 


2 
Notice that, in this example, our generating set is minimal in the weak sense that no element may 


be omitted, but not in the strong sense that its cardinality is minimal. 
We have been able to prove, in this direction, the following very partial result. In the statement 
and proof we use the term ‘minimal generating set’ in the strong sense. 


Then ¢ is the identity but the matrix associated with (1.3) is (2 3), whose determinant is zero. 


THEOREM 1.3. Let A be a finitely-generated module over the principal ideal domain A, let 
{a,,@,,...,a,} be a minimal generating set for A, and let 6: A — A be given by 


(1.4) da;= )A;,a,, 1=1,2,...,n, A; eA. 


Then if is an automorphism, det(A,;) # 0. 

Proof. Let us first choose a special minimal generating set (a,,a@),...,a,,5,,52,...b,), 
k+l=n, for A, where a, is of order p,, i=1,2,...,k, with py|ms|--: |My, mM, > 1, and 
{b,, b,,..., 6, } is a basis for a free A-module. Thus py, .,..., 4, are the invariant factors for A. 
Let T be the torsion submodule of A. We then have a commutative diagram 

T> A» A/T 


Lo’ Lo Lo” 
T» A-» A/T 


where $’,,” are automorphisms. If we choose {a,,a,...,a,} as generating set for T and 
{b,T,b,T,...,b,T} as generating set for A/T, then det = (det 6’)(det 6”), where det @ has its 
obvious meaning. Certainly det 6” + 0, so it remains to show that det ¢’ # 0. Thus, for this part 
of the argument, we may assume that A is a torsion module, and hence that k =n. Now if 
o: A — A is the identity, then, in (1.4), 


A;;=Omodyp;, i#j; A,; = 1mody,. 
It follows that det(A,;) = 1 mod py, so that det(A;,) # 0. 


Now let @ be an arbitrary automorphism of the torsion module A with special minimal 
generating set {a,,a,,...,a,}. Then we have, say, 


pa; = VA 4,> p-'a, _ LA 


and so (det A, (det a, ;) # 9. Thus detA,, # 0, and this proves our assertion if we had chosen a 
special minimal generating set. 

We now turn to the general case—and, of course, no longer assume that A is a torsion module. 
Let (1.4) be given, and suppose we have a special minimal generating set (@,, a ,...,a,). Then 


G;, = 21D: ;4;, a, = 04,4, 
so that 


where (A;;) = (p,; Ai ;)(4i;). Thus 
det( p;,)det( d,;)det(g,;) = det(X,;) #0, 
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so that det(A,,) # 0 and the theorem is proved. 


It is probably not easy to improve this result; Craig Squier has produced an example of a 
Noetherian integral domain A, a A-module A generated by 2 elements, and an automorphism ¢: 
A — A admitting a zero determinant. It should be noted, however, that, in this example, A is not 
a unique factorization domain. 

Squier’s example* is easy to describe, but it is not easy to establish its crucial property! Thus, 
let a,b,c,d be indeterminates over the complex numbers C and let A be obtained from 
Cla, b,c,d|/(ad — bc) by adjoining an inverse of a + d. Let M be the A-module generated by 
(u,v) with the relations cu = av, du = bv; and, finally, let ¢: M-— M be multiplication by 
a + d. Then we may write ¢u = au + bv, dv = cu + dv, so that the associated matrix is (“ *) 
with zero determinant. The crucial property to establish is that M is not a cyclic module. This 
requires either a long and involved computational proof or an appeal to the methods of algebraic 
geometry. Indeed, it turns out that A is the coordinate ring of a 3-dimensional quasi-affine 
algebraic variety over C, and hence a Noetherian integral domain. 


2. Non-finitely generated modules. We revert to the 5 assertions of Theorem 1.2 and show, by 
examples, that they are all distinct if A is no longer assumed finitely generated. To validate our 
examples we need the following propositions. 

Let ¢:A — A be an endomorphism of the A-module A. For each aeA let H, = H,(¢) be the 
submodule generated by the orbit of a under ¢, thus 

H, = H,(¢) = (a,oa,...,6"a,...). 


PROPOSITION 2.1. Let A be a A-module and $¢: A — A. Then the following assertions are 
equivalent: 
(i) Assertion (b) of Theorem 1.2; 
(11) VaeA, aeolH,; 
(iii) VB C A such that 6B C B, OB = B. 


Proof. The equivalence of (i) and (11) is almost immediate. Now if (iii) holds, then, since 
obviously ¢H, C H,, it follows that ¢H, = H,, so a © ¢H,. Conversely, given (ii), let a © B. 
Then ¢a € B and so, inductively, ¢"a € B for all n. Thus H, C B and a € $H, C $B, so that 
oB= B. 


PROPOSITION 2.2. Let A be a A-module and let 6: A — A be an automorphism. Then 


ae 'H,(¢ 1) @ H,(¢) is finitely generated. 


Proof. If H,,(#) is finitely generated then Jn such that ¢"a € (a,¢a,...,¢" ‘a). But then 
ae (o-"a,o "Ya,...,6 1a) C 61H, ($71). 
Conversely if a € ¢ 'H,(@ '), then 
ae (oe "ayo Ya,...,6 7a), 
for some n, so that $a © (a, ¢a,..., o” Ja). But then an easy induction shows that 
"a © (a,oa,...,¢" 1a), 


for all positive m, so that 
H,(¢) = (a, oa,...,¢" 4a). 


PROPOSITION 2.3. Let A be a A-module and 6: A — A. Then assertion (d) of Theorem 1.2 is 
equivalent to the joint assertion that, Va € A,a © 6H, and H, is finitely generated. 


*The author is grateful to Craig Squier for permission to publish this preview of a forthcoming article. 
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Proof. Let F(t) be a demonic polynomial such that F(¢)a = 0. Then, say, 
("+ og" 14+ --- 46,64 1)a=0. 


This makes plain that a € @H, and that $"a € (a,¢a,...,¢" *a). As above, this last implies 
that H, is finitely generated. Conversely suppose that for all a € A,a € $H, and H, is finitely 
generated. Since a €& oH, it follows from Proposition 2.1 that J comonic F(t) such that 
F()a = 0. Since H, is finitely generated, it follows from Propositions 2.1 and 2.2 that J comonic 
F(t) such that F(@~*)a = 0. Choose n sufficiently large that t”F(t~*) is a monic polynomial in ¢ 
with zero constant term and degree greater than that of F(t). Then G(t) = F(t) + t"F(t”*) isa 
demonic polynomial such that G(¢)a = 0. 

We are now ready to prove by examples that the 5 assertions of Theorem 1.2 are all distinct if 
A is not assumed finitely generated. In all our examples A will be an abelian group. 


EXAMPLE 2.1. Consider the automorphism Q@ > @. Clearly (c) holds with F(t) = 2t — 1. 
However (d) is false since H, is not finitely generated unless a = 0. 


EXAMPLE 2.2. Let ¢,: Z” — Z” be an automorphism with minimum polynomial of degree n 
(for example, the cyclic permutation (a,,a,,...,a,) > (a),...,4@,,4,) of generators); and let 
¢= 0¢: @Z”"— @ Z". Then (d) is true. For any element of © Z” belongs to a finitely 


generated subgroup on which ¢ is an automorphism, so Theorem 1.2 itself ensures the truth of (d). 
On the other hand (c) is false since no nonzero polynomial in ¢ can vanish on every element of 


@ Z". 


EXAMPLE 2.3. Let ¢;: Q > Q be multiplication by - and let 6 = ©,¢,: 6,Q > @,Q. Then 


(b) is true, since any element of ©,@ belongs to some subgroup ®/_,Q and is thus annihilated by 
the comonic polynomial IT/_,(it — 1). On the other hand (d) is false since it was even false in the 
simpler Example 2.1. Moreover, (c) 1s also false. For if there existed a comonic polynomial F(t) 
such that F(¢)A = 0, then there would exist a monic polynomial G(t) such that G(@~')A = 0. 
But if a, is the unity of the ith copy of @ then ¢~'(a,) = ia,;, so we would have G(i) = 0. This 
would have to be true for every positive integer i, an evident contradiction. 


2 
EXAMPLE 2.4. Consider the automorphism @ — Q. It is evident from Proposition 2.1 that (b) 1s 
false since ¢Z Cc Z but oZ + Z. 


It is probably sensible, when A is not finitely generated, to concentrate on assertions (a), (b) 
and (d) of Theorem 1.2. It is then surely worth observing that the equivalence of (ii) and (iii) in 
Proposition 2.1, as also the validity of Proposition 2.2, do not depend on A being commutative. 
Thus we may consider the equivalent formulations of (b) and (d), in terms of H,(¢), in studying 
endomorphisms of groups or endomorphisms of 7-groups. (Here we understand by a 7-group a 
group G on which a fixed group 7 acts, all homomorphisms of z-groups to respect the 7-action). 
This is the point of view to be found, for example, in [5], [6], [10]. In fact, Proposition 2.3 
establishes that assertion (d) is the most significant of the 5 assertions of Theorem 1.2 in the 
non-finite-generated case, since the equivalent assertion about H, leads to the important 
consequences in the generalized context. 


3. Appendix. In [4] the author proves a generalized Cayley-Hamilton Theorem in the following 
form. 


THEOREM. Over an arbitrary commutative ring, if matrices A and B commute, then x ,(B) = 
Omod B — A, where x ,(t) is the characteristic polynomial of A. 


We observe that, if 6, ~: M — M are commuting endomorphisms of a A-module M, then, for 
any polynomial F(t), F(¢) — F(~) = Omod¢ — y. Thus for any polynomial F(t) such that 
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F(o)M = 0, it follows that 
F(~) M = Omod(¢ — y) M. 


The theorem above is thus an immediate consequence of this observation and Theorem 1.1. 
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GENERALIZED LEAST SQUARES AND EIGENVALUES 
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In a recent note in this section of the MONTHLY [2], Hung C. Li discusses the generalized least 
squares problem of finding the best-fitting line L to a set of points in the plane. Here 
“generalized” means that the sum S of squares of the perpendicular distances from the points to L 
should be minimized, as opposed to ordinary least squares where distances are measured vertically 
or horizontally. This forms the basis of orthogonal regression [3] and principal component analysis 
[4]. Li’s approach is to treat S as a function of the two parameters that determine L and to 
minimize it by partial differentiation. However, we can obtain additional geometric insight and 
extend the solution more easily to higher dimensions by using linear algebra, as in [4]. The 
presentation given here is simpler and accessible to undergraduates. 

If L isa line and x a point in R”, we let d(x, L) denote the orthogonal distance from x to L. 
Given points x',..., x in R”, we seek a line L for which the sum S = U,d(x*, L)* is minimal. 
We will show that LZ must pass through the centroid of the given points and that its direction can 
be given by any eigenvector for the largest eigenvalue of a certain symmetric matrix. More 
generally, the d largest eigenvalues determine a best-fitting affine subspace of dimension d. 

We first show that the desired line must pass through the centroid, which for convenience we 
assume to be at the origin, i.e.,£,x* = 0. Let L’ be any line x = w + w in R", where v is a unit 
vector, and let L be its translation x = tv to the origin. For each x in R”, 


2 4 2 
d(x, L)" = ||x\P —(x +0), 
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Precalculus, T(13: 1). College Algebra, Second Edition. Bernard Kolman, Arnold Shapiro. Academic 
Pr, 1985, xiii + 492 pp, $21. [ISBN: 0~12-417897~-9] Excellent up-date of a fine text (First Edition, 
TR, February 1982). Covers equations, functions, exponential and logarithmic functions, conics, 
matrices, sequences. Optimal section on linear programming. New: anecdotal digressions (very 
nice), and answers to selected problems. MA 


Precalculus, T(13). Trigonometry. Roland E. Larson, Robert P. Hostetler. DC Heath, 1985, xvi + 378 
pp, $24.95. [ISBN: 0-669-08611-8] Chapter material presented primarily through written explanations 
and augmented by proofs and/or examples. Nice treatment of sine/cosine graphing. Does not need the 
attention devoted to calculators, particularly the separate calculator exercises-~these should be 
integrated. Contains additional topics such as polar graphing. TR 


Precalculus, T(13: 1). College Algebra, Ninth Edition. Paul K. Rees, Fred W. Sparks, Charles Sparks 
Rees. McGraw-Hill, 1985, xiii + 545 pp, $28.95. [ISBN: 0-07-051735-5] Suitable for a pre-calculus 
course. Has a wide variety of applications. Does not include material on circular functions. 
Emphasis is on examples and some problem solving techniques. (Sixth Edition, TR, June-July 1972; 
Seventh Edition, TR, August-September 1977; Eighth Edition, TR, October 1981.) TR 


Precalculus, T*(13: 1). Precalculus Mathematics, Second Edition. Thomas W. Hungerford, Richard 
Mercer. Saunders College, 1985, xx + 663 pp. [ISBN: 0-03-000843-1] Presentation of standard precal- 
culus topics is clear and very readable. Changes in this edition include new material on conic sec- 
tions, systems of equations, and sequences and sums, together with new review material and many new 
exercises. (First Edition, TR, March 1981.) JRG 


Foundations, T(18: 1), P. Constructibility. Keith J. Devlin. Perspectives in Math. Logic. 
Springer-Verlag, 1984, xi + 425 pp, $58. [ISBN: 0-387-13258-9] Constructible set theory is an exten- 
sion of Zermelo-Fraenkel set theory in which the notion of what constitutes set is precisely 
defined. This reference text presents a comprehensive account of theory of constructible sets. 
Covers Godel hierarchy in first part and Jensen hierarchy in second. Short exercise sets. KS 


Foundations, T(13-14), S, L. Godel’s Theorem Simplified. Harry J. Gensler. U Pr of America, 1984, 
iii + 83 pp, $6.75 (P); $12.75. [ISBN: 0-8191-3869-X; 0-8191-3686-1] Non-technical exposition of 
proof of Godel’s first incompleteness theorem: arithmetic cannot be reduced to any axiomatic system. 
First shows that lower arithmetic, consisting of propositions without quantifiers, can be axioma- 
tized. Explains logical terms and symbolism. Some comments on philosophical implications of 
theorem. KS 
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Graph Theory, T(16-17: 1, 2). Graphs, Groups and Surfaces. Arthur T. White. Math. Stud., V. 8. 
Elsevier Sci, 1984, xiii + 314 pp, $30 (P). [ISBN: 0-444-87643-X] The nine chapters of the First 
Edition (TR, August-September 1974; Extended Review, January 1976) have been revised and updated; 
six new chapters have been added covering voltage graphs, non~orientable imbeddings, hypergraph 
imbeddings, map automorphism groups, and change ringing (fascinating application). JRG 


Topological Groups, T(18: 1, 2), S, P. Ergodic Theory and Semisimple Groups. Robert J. Zimmer. 
Mono. in Math., V. 81. Birkhauser Boston, 1984, x + 209 pp, $29.95. [ISBN: 0-8176-3184-4] Very 
readable account, in a research~expository style, of recent results of G.A. Margulis and of the 
author, on amenable group actions, rigidity, arithmetic lattices, Lie groups, and a careful intro- 
duction to ergodic theory, beginning with the definition of ergodic actions. Includes numerous 
examples and an extensive bibliography, but no exercises. YN 


Calculus, T(14: 1, 2). Intermediate Calculus, Second Edition. Murray H. Protter, Charles B. Morrey, 
Jr. Undergrad. Texts in Math. Springer-Verlag, 1985, x + 648 pp, $38. [ISBN: 0-387-96058-9] 
Thorough coverage of multivariable calculus topics from the geometry of R™ through vector integra- 
tion. Includes chapters on infinite series (including Fourier series) and appended material on 
matrices and determinants. Flexible treatment allows for a variety of syllabi. There are few 
sophomore-level texts available--this is a welcome re-entry. JRG 


Calculus, T*(15: 1, 2), L. Advanced Calculus, Third Edition. Wilfred Kaplan. Addison-Wesley, 1984, 
xiv + 721 pp, $36.95. [ISBN: 0-201~11680-4] More proofs and exercises than in the Second Edition 
(TR, February 1974); some rewriting; new material on potential theory. Presupposes calculus, pro- 
vides introductory linear algebra. Rigorous as before; much emphasis on physical motivation and 
applications. Many references to the literature. DFA 
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Jacques Douchet, Bruno Zwahlen. Pr Polytechniques Romandes, 1983, ix + 244 pp, Sfr. 68 (P). [ISBN: 
2-88074-026-6] Appropriate for freshman honors” calculus courses (in French). Provides concise and 
simple yet rigorous proofs of all the standard theorems (Heine-Borel-Lebesgue, Bolzano-Weierstrass, 
Rolle, Dini). Also includes some special topics, for instance Minkowski’s inequality. Postpones 
all multivariate calculus to the second volume. Many examples and numerous exercises, some leading 
to clever proofs of classical facts. Thin, yet substantial: good preparation for real analysis. YN 


Calculus, T(13: 3), S. Vorlesungen zur hoheren Mathematik. Johannes Weissinger. Bibliographisches 
Institut, 1984. Band 1: Grundlagen, Differentialrechnung, Hochschultaschenbucher, B. 613, 190 pp 
[ISBN: 3-411-00613-7]; Band 2: Integralrechnung, Lineare Algebra, Vektorrechnung, 
Hochschultaschenbucher, Band 614, 260 pp, (P). [ISBN: 3-411-00614-5] Course intended for engineers 
(in German). Carefully explains the standard topics in differential and integral calculus, with 
discrete and numerical methods when appropriate, e.g., Newton’s method in two variables. Presents 
many constructive or computational proofs, e.g., the binomial theorem, and states without proof the 
harder results, e.g., the convergence of Fourier series. Excellent as lecture notes, perhaps terse 
as text. YN 


Real Analysis, T(16-17: 1, 2), S, P, L. Undergraduate Analysis. Serge Lang. Undergrad. Texts in 
Math. Springer-Verlag, 1983, xiii + 545 pp, $28. [ISBN: 0-387-90800-5] Aimed at upper-level under- 
graduates, the book begins with a brisk survey of elementary calculus using techniques "not 
emphasized in elementary courses." Part two is a discussion of uniform convergence in several con- 
texts including the Stone-Weierstrass theorem and the Riemann integral. Part three concerns appli- 
cations including Fourier series and integrals. Part four is differential calculus in n-space and 
part five is multiple integration concluding with an introduction to differentiel forms. Some exer- 
cises, mostly extending the theory or on special topics. Appendix on algebra. Index. JS 


Real Analysis, P. Lecture Notes in Mathematics-1067: Absolute Summability of Fourier Series and 
Orthogonal Series. Yasuo Okuyama. Springer-Verlag, 1984, vi + 118 pp, $7.50 (P). [ISBN: 0-387- 
13355-0] Focuses on absolute Norlund summability and absolute Riesz summability of Fourier series 


and orthogonal series. Dry, with awkward grammer in places. BH 


Complex Analysis, P. American Mathematical Society Translations, Series 2, Volume 122: Ten Papers on 
Complex Analysis. M.A. Lavrent”’ev, et al. AMS, 1984, vi + 222 pp, $54. [ISBN: 0-8218-3081-3] 
Translations of Russian research papers on a variety of topics in complex analysis. BH 


Numerical Analysis, $(16-17), P*. A New Approach to Scientific Computation. Ed: Ulrich W. Kulisch, 
Willard L. Miranker. Notes & Reports in Comp. Sci. & Appl. Math., No. 7. Academic Pr, 1983, xv + 
384 pp, $39. [ISBN: 0-12-428660-7] 12 papers, mostly by members of the Institute for Applied 
Mathematics at Karlsruhe, West Germany, on the development and use of a computer arithmetic for 
intervals of reals, complex numbers, vectors and arrays. For many operations and numerical prob- 
lems, optimal accuracy can be obtained. Implementation in languages such as Pascal-SC. RWN 


Numerical Analysis, T*(16: 2), S, L*. Computational Methods for Partial Differential Equations. 
E.H. Twizell. Ser. in Math. & Its Applic. Halsted Pr, 1984, 276 pp, $57.95. [ISBN: 0-470-27511-1] 
Explains the classical methods of finite differences and finite elements, and compares their rela- 
tive merits. Applies them to the trilogy of elliptic, parabolic, and hyperbolic partial differen- 
tial equations. Discusses characteristics, consistency, and convergence; includes theoretical and 
computational exercises. Instructors may add complements in linear algebra, existence and unique- 
ness, mesh generation, and physical motivations (largely omitted topics). YN 
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Functional Analysis, P. Modéles étalés des espaces de Banach. 3B. Beauzamy, J.-T. Lapresté. Her- 
mann, 1984, 210 pp, 160F (P). [ISBN: 2-7056-5965-X] 


Functional Analysis, P. Jordan Operator Algebras. Harald Hanche-Olsen, Erling Stérmer. Mono. & 
Stud. in Math., No. 21. Pitman, 1984, viii + 183 pp, $55. [ISBN: 0-273-08619-7] Assumes basic 
knowledge of Banach and Hilbert spaces. Develops algebraic theory of Jordan algebras and then stu- 
dies JB algebras theis weak ly-closed ana logqus JRW algebras. (A JB algebra A is a Jordan Banach 
algebra satisfying jla™ ll = lal” and lla” |< lla” + b°ll for a, b in A.) Large bibliography. BH 


Analysis, P. Coloquio de Sistemas Dinamicos. Ed: J.A. Seade, G. Sienra. Aportaciones Matematicas, 
V. 1. Sociedad Matematica Mexicana, 1985, viii + 167 pp, (P). First volume in a series of lecture 
notes sponsored by the Sociedad Matematica Mexicana. This volume features papers on dynamical sys- 
tems, mostly in English, from a December 1983 conference at the Centro de Investigacion en 
Matem&éticas (CIMAT) in Guanajuato, Mexico. LAS 


Differential Geometry, $(18), P. Déformations Infinitésimales des Structures Conformes Plates. 
Jacques Gasqui, Hubert Goldschmidt. Progress in Math., V. 52. Birkhauser Boston, 1984, 226 pp, 
$24.95. [ISBN: 0-8176-3260-3] Research monograph presenting the authors” recent results. Constructs 
and studies resolutions of sheaves of Killing fields over conformally flat, real, Riemannian mani- 
folds of any dimension, using tools from the theory of overdetermined systems of partial differen- 


tial equations. YN 


Differential Geometry, T?(17: 2), S*. Differential Manifolds. Serge Lang. Springer-Verlag, 1985, 
ix + 230 pp, $19.80 (P). [ISBN: 0-387-96113-5] Introduces sprays and vector bundles over infinite 
dimensional manifolds. Also proves such classical results as the Morse-Palais lemma and Stokes” 
theorem with singularities. Yet students might fail to appreciate the depth of such statements as 
"Hironaka tells me that by using the resolution of singularities [...]" (p. 204). No exercises. 
Certainly original, but perhaps not an introductory text. (1972 Addison-Wesley hardcover edition, 
TR, August-September 1972.) YN 


Probability, T(16-18: 3), S, P, L. Probability. A.N. Shiryayev. Transl: R.P. Boas. Springer- 
Verlag, 1984, xi + 577 pp, $48. [ISBN: 0-387-90898-6] This textbook is based on a three-semester 
course given by the author at Moscow State University. The first part is devoted to the elementary 
theory; the second part to the modern theory (o-algebras, measures, Lebesgue integral, characteris-— 
tic functions); the third part to random processes with discrete parameters (Markov chains, theory 
of martingales). Thorough treatment with a large number of examples and exercises. LCL 


Probability, P*. Classical Potential Theory and Its Probabilistic Counterpart. J.L. Doob. Grund. 
der Math., V. 262. Springer-Verlag, 1984, xxv + 846 pp, $58. [ISBN: 0-387-90881-1] "The purpose of 
this book is to develop (the) correspondence between potential theory and probability theory by exa- 
mining in detail classical potential theory, that is, the potential theory of Laplace”s equation, 
together with the corresponding probability theory, that is Martingale theory." No knowledge of 
potential theory is presupposed. Reader should be familiar with basic probability concepts through 
conditional expectations. The necessary lattice theory, set theory and capacity theory are covered 
in Appendices. JK 


Statistics, T(17: 1), S. The Modern Forecaster: The Forecasting Process Through Data Analysis. Hans 
Levenbach, James P. Cleary. Lifetime Learning, 1984, xiv + 537 pp, $31.50. [ISBN: 0-534-03361-X] 
The authors present forecasting as a process rather than as a set of disjoint techniques. There are 
five parts to the book: an introduction to the forecasting process, exploratory data analysis tech- 
niques, smoothing and regression techniques, the econometric approach to forecasting, and time 
series modeling using Box-Jenkins methodology. MT 


Statistics, S?7(14-17). Experimentation and Statistical Validation. Norbert L. Emrick. Robert 
Krieger, 1983, 121 pp, $14.50. [ISBN: 0-89874-445-8] Based on a series of articles which appeared in 
the Journal of Neurological and Orthopaedic Surgery. Describes various statistical procedures, 
using illustrations from the health and related sciences, in an attempt to bring an understanding 
and appreciation of the contributions that can be made by statistics. RSK 


Statistics, T(17: 1), P*. Analysis of Survival Data. D.R. Cox, D. Oakes. Mono. on Stat. & Appl. 
Prob. Chapman & Hall, 1984, ix + 201 pp, $25. [ISBN: 0-412-24490-X] This book gives a thorough 
introduction to the analysis of survival data, also referred to as failure-time data. It can be 
used either as an introductory text for the subject or as a reference for applied statisticians. 
Includes discussion of parametric and nonparametric analysis techniques for both univariate data 
sets and data sets with explanatory variables. MT 


Statistics, P. Nonparametric Functional Estimation. B.L.S. Prakasa Rao. Prob. & Math. Stat. 
Academic Pr, 1983, xiv + 522 pp, $70. [ISBN: O6-12-564020-X] Attempts to bring together literature 
about nonparametric estimation of density functions, distribution functions, regression functions, 
failure rates, etc. Most of the results have not appeared in a book before; there are bibliographic 
notes at the end of each chapter. MT 


Computer Literacy, T(13-14), S. Computer Information: A Modular System. Marjorie M. Leeson. SRA, 
1985, xvi + 591 pp, (P). [ISBN: 0-574-21720-7] A very complete, flexible computer literacy text. 
Contains interesting commentary on issues and trends as well as a brief introduction to BASIC for 
selected micros. Each of four modules can be used independently from others--a nice feature. TR 
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Computer Programming, T(15), S. BASIC, A Modular Approach, Second Edition. Robert G. Thompson. 
Charles E. Merrill, 1985, xi + 366 pp, (P). [ISBN: 0-675-20280-9] Programming in BASIC with good 
style. Concepts presented in a clear, organized way; examples good. Two possible problems: pro- 
gramming is presented too early (no algorithm design) and extended BASIC has not been used, with 
resultant lower code readability. FA 


Computer Programming, T(15: 1), S. FORTRAN 77 for Scientists and Engineers, Second Edition. J.N.P. 
Hume, R.C. Holt. Reston, 1985, xvi + 364 pp, (P). [ISBN: 0-8359-2065-8] Structured programming with 
examples mainly from science and engineering. Includes design of algorithms using "structured 
English" and Fortran-77. Builds up to some fairly complex algorithms and data structures. Appendix 


presents an introduction to UNIX and VAX/VMS. FA 


Computer Programming, T(13-18: 1). BASIC Programming for the Apple Computer. Robert M. Bateson, 
Robin D. Raygor, Gregory Bitz. West, 1985, xx + 278 pp, $14.95 (P) [ISBN: 0-314-85290-5]; 
Instructor’s Manual to Accompany, x + 199 pp, (P). [ISBN: 0-314~87247-7] A good beginning text with 
a well-written instructor’s manual. Contains many examples and problems for each topic. Includes 
comments on possible errors and how to correct them. TR 


Computer Programming, T(13-14: 1), S. Applied Structured BASIC. Roy Ageloff, Richard Mojena. Ser. 
in Computer Information Systems. Wadsworth, 1985, xvi + 550 pp, (P). [ISBN: 0-534-04740-8] Stan- 
dard introductory text for BASIC. Does contain a section on modules and matrix operations. Allows 
for system variations. Contains applications geared to many fields. TR 


Applications (Actuarial Science), P. Loss Distributions. Robert V. Hogg, Stuart A. Klugman. Wiley, 
1984, x + 235 pp, $29.95. [ISBN: 0-471-87929-0] There are five chapters in this book: the first 
chapter is an introduction; the second and third chapters discuss probabilistic models and methods 
of fitting these models to data; and the fourth and fifth chapters discuss choosing and applying 
distributions of the size of insurance losses. MT 


Applications (Biology), P. Lecture Notes in Biomathematics-56: Gonorrhea, Transmission Dynamics, and 
Control. Herbert W. Hethcote, James A. Yorke. Springer-Verlag, 1984, ix + 105 pp, $9 (P). [ISBN: 
0-387-13870-6] Presentation of mathematical models for the incidence and transmission of gonorrhea 


in several populations, and evaluation of disease control procedures. JRG 


Applications (Communication Theory), P. Applications of Walsh and Related Functions: With an Intro- 
duction to Sequency Theory. K.G. Beauchamp. Academic Pr, 1984, xvi + 308 pp, $55. [ISBN: 0-12- 
084180-0] Update on developments in signal theory in the past decade. Background chapters in 
sequence theory are followed by applications to signal processing in one and two dimensions, to com- 
munications, and in the logical description of digital designs. Extensive list of references accom- 


panies each chapter. JRG 
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brium Model of Real and Financial Accumulation in an Open Economy. Giancarlo Gandolfo, Pietro Carlo 
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Padoan. Springer-Verlag, 1984, vi + 172 pp, $11.50 (P). [ISBN: 0-387-13889-7] 
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with Applications to Engineering Sciences. Ed: V. Komkov. Springer-Verlag, 1984, v + 130 pp, $7.50 
(P). [ISBN: 0-387-13871-4] The functionals in question weakly formulate boundary value problems, 
e.g., buckling beams, and involve design parameters such as shape (as opposed to control parameters 
such as loads). Design modifications, e.g., boundary shapes altered by vector fields, change the 
functionals and weak solutions. Finding stationary functionals leads to optimal designs, e.g., 
strongest cross-sectional shapes; mathematically, the unknown is the boundary value problem itself, 
rather than its solution. YN 


Applications (Information Theory), T**(16-17: 1), S*, P*¥, L. Mathematics of Kalman-Bucy Filtering. 
P.A. Ruymgaart, T.T. Soong. Ser. in Inform. Sci., V. 14. Springer-Verlag, 1985, x + 170 pp, $29. 
[ISBN: 0-387-13508-1] This superb, mathematically sound and accessible exposition covers stochastic 
differential equations and processes, with mean-square calculus in Hilbert spaces. Begins with 
essentials of functional analysis, measure, and integration theory. Ends with Kalman-Bucy filters: 
recursions for reconstructing estimators of random vectors observed with random noise. Gives exer- 
cises and answers. Outstanding text, both for engineers and for "pure" mathematicians. YN 


Reviewers 


MA: Melissa Anderson, St. Olaf; FA: Fahrad Anklesaria, Macalester; DA: David Appleyard, Carleton; 
RB: Richard Brown, Carleton; JNC: Judith N. Cederberg, St. Olaf; CEC: Clifton E. Corzatt, St. Olaf; 
JD-B: John Dyer-Bennet, Carleton; JRG: Jennifer R. Galovich, St. Olaf; SG: Steven Galovich, Carle~ 
ton; BH: Bruce Hanson, St. Olaf; PH: Paul Humke, St. Olaf; RSK: Richard 8S. Kleber, St. Olaf; JK: 
Joseph Konhauser, Macalester; LCL: Loren C. Larson, St. Olaf; RM: Richard Molnar, Macalester; RWN: 
Richard W. Nau, Carleton; LN: Linda Ness, Carleton; YN: Yves Nievergelt, St. Olaf; AO: Arnold 
Ostebee, St. Olaf; TR: Teresa Reardon, St. Olaf; AWR: A. Wayne Roberts, Macalester; MS: Michael 
Schneider, Macalester; JS: John Schue, Macalester; SS: Seymour Schuster, Carleton; JAS: J. Arthur 
Seebach, Jr., St. Olaf; KS: Kay Smith, St. Olaf; LAS: Lynn Arthur Steen, St. Olaf; MT: Michael 
Tveite, St. Olaf; MU: Milton Ulmer, Carleton; TAV: Theodore A. Vessey, St. Olaf; MW: Martha Wallace, 
St. Olaf; FLW: Frank L. Wolf, Carleton; PZ: Paul Zorn, St. Olaf. 
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Section Reports 


An asterisk (*) by the title of a paper indicates that copies of the paper are available from 
the author. Papers presented under special sponsorship as part of joint meetings are so noted in 
parentheses. 


Seaway Section 


The spring meeting of the Seaway Section held April 26-27, 1985 was a joint meeting with the 
New York State Mathematics Association of Two Year Colleges. Approximately 250 attended the meet- 
ings. 


Invited Addresses: 


"Is Discrete Mathematics the New Math of the 80°s?" by Anthony Ralston, SUNY Center at Buffalo. 
"A Vector Approach to Euclidean Geometry," (The Gehman Lecture) by Wolfgang Jurkat, Syracuse 
University. 


Short Presentations: 


"Alternatives to Least Square Estimators," by Marvin Gruber, Rochester Institute of Technology. 
"Left, Right, and Other Things," by Saleh I. Assad, Erie Community College. 

"Derivations and Commutativity," by Howard Bell, Brock University. 

"Models Based on the Mean and the Median," by David Farnsworth, Rochester Institute of Technology. 
"Mathematics for Computer Science--A First Course," by Eric R. Muller, Brock University. 


Student Papers: 


"Function Spaces," by Jay Robartes, State University College at Potsdam. 

"There’s More Than One Way to Slice a Space," by Dwight Tuinstra, State University College at 
Pot sdam. 

"Extensions of the Heisenberg-Weyl Inequality," by Martin Smith, McMaster University. 

"A Predictive Model for the Relationship Between Age and Memory Span," by Wendy Witt, Rochester 
Institute of Technology. 

"Sabermetrics," by James Johnston, St. Bonaventure University. 

"Analysis of Antithetic Variates Technique Applied to Cosage," by Thomas Johnson, Rochester Insti- 
tute of Technology. 


Michigan Section 


The annual spring meeting of the Michigan Section was held on May 3-4, 1985 at Western Michigan 
University in Kalamazoo, Michigan. Meeting attendance was approximately 140. 


Invited Lectures: 


"Another Approach to Riemann-Stielt jes Integrals," by Ken Ross, University of Oregon. 

"Some Recent Problems and Results in Combinatorial Geometry," by Paul Erdos, Hungarian Academy of 
Sciences, 

"Mathematics Programs in Russia," by Richard Dahlke, University of Michigan. 

"Statistical Analysis of the Love Canal Environmental Study," by Michael Stoline, Western Michigan 
University. 

"Editing the Monthly," by Mary and Robert Wardrop, Central Michigan University. 

"Bernoulli’s Polynomials and Periodic Splines," by Dennis Pence, Western Michigan University. 

"The Recent Solution of the Bieberbach Conjecture," by Peter Duren, University of Michigan. 

"On the Napoleon Triangle Theorem," by John Wetzel, University of Illinois. 

"Recent Results in Linear Programming," by Michael Saks, Rutgers University and Bell Communication 
Research. 


Short Presentations: 


"Discrete Mathematics," by Karen Sharp, Mott Community College. 

"The Joint ACM/MAA Retraining Program in Computer Science," by Philip Demavois, Northwestern 
Michigan College. 

"From Computer Science to Topology,” by Linda Brinn, University of Michigan, Dearborn. 

"Curriculum: A Need for Change," by Terry Wesner, Henry Ford Community College. 

"A Survey of K-12 Curriculum," by Harry Nustad, Henry Ford Community College. 

"New Directions in Two-year Mathematics," by Keith Shuert, Oakland Community College; Karen Sharp, 
Mott Community College. 

"On the Distribution of First Digits of Linear Sequences," by Peter Shive, Saginaw Valley State 
College. 

"Toward a Michigan Mathematics Early Placement Test: A Progress Report," by John Kiltinen, North- 
ern Michigan University. 
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Student Papers: 


"Karmarkar’s New Polynomial-time Algorithm for Linear Programming," by Jack Snoeyink, Calvin Col- 
lege. 

"The Logistic Distribution as a Model to Describe the Joint Action of Drugs," by T.J. Widmar, 
Western Michigan University. 

"Fractals and Computer Graphics," by Eric Jones, Michigan Technological University. 

"Tarsky Geometry and New Theorem Proving Techniques," by Jon Mills, Western Michigan University. 

"Development and Utilization of Logic Machine Control Language," by Jennifer Arbanas and Rich 
Stevens, Western Michigan University. 


Panel Discussion: 


International Congress of Mathematics Education. 


New Jersey Section 


The spring meeting of the New Jersey Section was held as a joint meeting with the Mathematics 
Association of Two-Year Colleges of New Jersey on April 27, 1985 at Monmouth College, West Long 
Branch, New Jersey. Approximately 50 people attended. 


Invited Addresses: 


"Graph Theory: An Application of Path-length Distributions," by Edward Boyne, Montclair State. 

"Mathematical Research and Artificial Intelligence," by Susan Epstein, Hunter College. 

"Some Applications of Pure Mathematics at the Census Bureau," by Mary-Mulry Liggan, U.S. Census 
Bureau. 

"Highlights of the 1985 International Congress of Mathematics Educators in Australia," by Gloria 
Gilmer, Copin State College. 


Northeastern Section 


The spring meeting of the Northeastern Section was held on June 14-15, 1985 at Norwich College 
in Northfield, Vermont. Approximately 80 participants attended the meeting. 


Invited Addresses: 


"Why Every Manager Should Be a Mathematician," by Alfred Willcox, Executive Director of MAA, Wash- 
ington, D.C. 

"The Role of Mathematics and Problem Solving in Business Decision Making," by Gordon Prichett, 
Babson College. 

"Public-Key Cryptosystems," by Dennis Luciano, Western New England College. 

"A Discrete/Continuous Mathematics Course for Non-math Non-computer Science Majors," by Margaret 
Cozzens, Northeastern University. 


Contributed Paper: 


"Purely Geometric Analog of Lorentz-Einstein Equations Associated with the Unit Circle," by Her- 
bert Nichol, Drexel University. 


Student Papers: 


"Discrete Prolate Spheroidal Sequences and Their Applications to Secure Voice Communications," by 
Gwyn J. Crouch, Worcester Polytechnic Institute. 

"Estimating Age-specific Fecundity for Modeling Population Dynamics of Clams," by Ann Kochen- 
dorfer, Fairfield University. 

"Estimating Optimal Transformations for Regression," by Skip Olson, Bowdoin College. 

"Periodicity Within Pascal’s Triangle Modulo k," by David A. Sawtelle, Colby College. 


Workshop Presentations: 


"Spreadsheets for Approximation Methods in Numerical Analysis," by Steve Wiitala, Norwich Univer- 
sity. 

"The International Conference on Mathematical Education of 1984," by Joan Mundy, University of New 
Hampshire. 

"Management Information Systems," by Cynthia Lane, Bedford Research. 


Panel Discussion: 
"Undergraduate Mathematical Training and Its Use in an Industrial Setting," by Leonard J. Borucki 


(Moderator), IBM; Frederick Brown, Union Mutual Insurance Company; Joseph P. Noonan, Bedford 
Research; Karl Seer of Pratt-Whitney Aircraft. 
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Texas Section 


The annual meeting of the Texas Section was held at Texas A&M University at Galveston, Texas on 
April 11-13, 1985. There were 209 registrants for the meeting. 


Invited Addresses: 
"“Lacunary Fourier Series, from Sidon to Pisier," by Kenneth Ross, University of Oregon. 
"The Examinations of the MAA Committee on American Mathematics Competitions," by Walter Mientka, 
University of Nebraska. 
"Approximation of Matrices," by Paul R. Halmos, University of Santa Clara. 


Contributed Papers: 


"A Theorem on Planar Continua and an Application to Automorphisms of the Field of Complex 
Numbers," by Robert R. Kallman, North Texas State University. 

"Banach Algebras Satisfying Ax“ = Ax," by Victor A. Belfi, Texas Christian University. 

"Evaluating Special Limits Using Riemann Sums," by Sudhir K. Goel and Dennis Rodriquez, University 
of Houston, Downtown. 

“New Characterizations Using Feebly Open and Regular Open Sets," by Charles Dorsett, Louisiana 
Tech University. 

"Integration Techniques Using Parametric Integrals," by Aurel J. Zajta and Sudhir K. Goel, Univer- 
sity of Houston, Downtown. 

"The Cauchy-Bunyakovsky-Swartz Equality~-1821 to the Present," by Robert S. Doran, Texas Christian 
University. 

"An Interesting Subset of QR@ ," by Elizabeth M. Bator, North Texas State University. 

"An Alternate Proof of the Mean Mean Value Theorem," by Vincent P. Schielack, Jr., Texas A&M 
University. 

"The Problem of Non-T--Special Spaces," by Joseph GC. Warndof, Midwestern State University. 

"Kaplansky”’s Theorem on C(X) Revisited," by Don E. Edmondson, University of Texas at Austin. 

"On Infinite Periodic Rings," by Effraim P. Armendariz, University of Texas at Austin. 

"The Norm-closed Convex Hull of the Orbit of an Automorphism Group of a W*-algebra," by Joseph M. 
Szucs, Texas A&M University, Galveston. 

"SDP Transforms of the Quartic Diophantine Equation of Euler," by Aurel J. Zajta, University of 
Houston, Downtown. 

"On Equivalent Endomorphisms of Finite Groups," by John R. Durbin, University of Texas at Austin. 

"Independence Spaces," by Wynne Johnson, University of Texas, Arlington. 

"Generalized Absolute Neighborhood Retracts," by Stuart Anderson, East Texas State University. 

"Painting a Fence Derives the Fundamental Theorem of Calculus," by Laurence Maher, North Texas 
State University. 

"A Spiral Approach to Discrete and Continuous Mathematics Instruction," by Richard A. Alo and 
Michael G. Murphy, University of Houston, Downtown. 

"Accreditation of Undergraduate Computer Science Programs," by Ashok Kumar and Sudhir K. Goel, 
University of Houston, Downtown. 

"A New Mathematics Degree--Bachelor of Science in Quantitative Methods," by Ron Barnes, University 
of Houston, Downtown. 

"In Search of Excellence--A Comparative Study in the Mathematics Teacher Program Between the Pro- 
vince of Taiwan, Republic of China, and the United States," by Ping-Tung Chang, Laredo State 
University. 

"Technical Mathematics Program at Eastfield College," by Vivian Dennis, Eastfield College. 

"Galculus Teaching and the Role of Testing," by Gregory D. Foley, Austin Community College. 

"A Plea for Nothing," by Tammie Ann Hill, Prairie View A&M University. 

"Projectionally Exposed Cones in Convex Programming,” by George D. Poole, Lamar University. 

"Uses of the Microcomputer for Mathematics Classroom Demonstrations," by Timothy P. Donovan, East 
Texas State University. 

"An Interactive Graphics Package for Spline Approximation," by Susan Hunt, Baylor University. 

"Third Order Iteration Techniques," by Donald F. Bailey, Trinity University. 

"On the Oscillation of Solutions of Certain Third Order Difference Equations," by Beverly Smith, 
Texas Southern University. 

"Solving Polynomial and Rational Function Inequalities," by Bella Wiener, Pan American University. 

"The Microcomputer as an Aid in Teaching Business Math," by Carol Ghomi and Dennis Rodriquez, 
University of Houston, Downtown. 

"Searching for Narcissistic Numbers by Computer," by John F. Lamb, Jr., East Texas State Univer- 
sity. 

"A Classroom Note on Numerical Singularity," by John Dennis, Rice University. 


Special Sessions: 


"Forum on Funding Higher Education in Texas: Problems, Proposed Solutions, and Portents of the 
Future." 


Special Reports: 


"Progress Report on New Mathematics Curriculum and New Standards for Mathematics Teacher Certifi- 
cation," by Barbara Montalto, Texas Education Agency. 
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Panel Discussion: 


"Mathematics Course Work--Who’s Calling the Shots?" by Rose Marie Smith (Moderator), Texas Women’s 
University; John Allen, North Texas State University; Don Bailey, Trinity University; Don 
Edmondson, University of Texas, Austin; Peter Lindstrom, Northlake College. 


Wisconsin Section 


The fifty-third annual meeting of the Wisconsin section was held at Marquette University in 
Milwaukee, Wisconsin, April 26-27, 1984. Approximately 175 people attended. 


Invited Addresses: 


"Some Unexpected Results in Elementary Mathematics," by Ivan Niven, University of Oregon. 
"Peculiarities of Rational and Irrational Numbers," by Ivan Niven, University of Oregon. 


Short Presentations: 


"Algorithm for Computer Graphics," by Andrew Matchett, University of Wisconsin, LaCrosse. 

"How Does One Treat an I11-Posed Problem," by Robert H. Moore, University of Wisconsin, Milwaukee. 

"A Sabermetric Model for Predicting the 1985 Al East Baseball Standings and Analysis of Results 
from 1984," by Steve Krevisky, University of Wisconsin, Washington County. 

"Zero-Player Games: A Look at Cellular Automata," by Mike Slattery, Marquette University. 

"On Arbitrary Functions of Matrices with a Unique and Closed Representation with Applications," by 
N.L. Petrakopoulos, University of Wisconsin, Green Bay. 

"The Optimal Number of Traces of the Product of Simplices," by Hasien S. Moghadam, University of 
Wisconsin, Oshkosh. 

"Inflating Balloons--Deflating a Misunderstanding," by Nicholas Passell, University of Wisconsin, 
Eau Claire. ; 

"Some Things the Numerical Analysis Texts Forget to Mention," by John Oman, University of Wiscon~ 
sin, Oshkosh. 

"On F-structure Manifolds," by Lovejoy Das, University of Wisconsin, Sheboygan. 

"Finite State Automata: Linking Theory and Application," by Thomas L. Naps, Lawrence University. 

"Periodic Functions on the Apple Without Trigonometric Functions," by Edward Gade, III, University 
of Wisconsin, Oshkosh. 

"A Nonparametric Test for Comparing Two Multivariate Populations," by John Bachhuber, University 
of Wisconsin, Oshkosh. 

"Stirling’s Formula and Sums of Powers of Integers," by Eric Key, University of Wisconsin, 
Milwaukee. 

"Subgroups Satisfying a Certain Distributive Law," by Roger Erickson, University of Wisconsin, 
LaCrosse. 

"Building Fundamental Statistical Concepts: Tukey’s Exploratory Data Analysis and Its Impact on 
the Secondary School Mathematics Curriculum," by Gail Burrill and Henry Kepner, Whitnall High 
School and University of Wisconsin, Milwaukee. 

"Mathematics Education in Third World Countries," by Virginia Pillsbury, Milwaukee Area Technical 
College. 

"A More or Less Simple Approach to Limits," by Richard Menzel, University of Wisconsin, Oshkosh. 

"An HGR Calculus Tutorial," by Michael Lucas, Cardinal Stritch College. 

"Conic Sections in Taxicab Geometry," by Barbara E. Reynolds, Cardinal Stritch College. 

"Free Subgroups in Linear Groups over Some Skew Fields," by Alexander Lichtman, University of 
Wisconsin, Parkside. 

"An Algorithm for Minimizing a Non-linear Objective Function," by Tzu-Cheq Kao, University of 
Wisconsin, Oshkosh. 

"An Actuarially Useful Implementation of the Maximum Likelihood Principle," by Nasser Hadidi, 
University of Wisconsin, Stout. 

"Numerical Simulation of Fluid Flow in a Heated Enclosed Cavity," by William Shay, University of 
Wisconsin, Green Bay. 


Panel Discussion: 


"Curriculum Planning in Mathematics" (a preliminary report of the Mathematics Curriculum Task 
Force writing for the new Guide to Curriculum Planning in Mathematics to be issued by the 
Wisconsin Department of Public Instruction), by Paul Campbell (Moderator), Beloit College; 
LeRoy Dalton, Wauwatosa West High School; Robert Hall, University of Wisconsin, Milwaukee; John 
Moyer, Marquette University; Virginia Pillsbury, Milwaukee Area Technical College. 


Student Presentations: 


"Complex Numbers and Synthetic Geometry," by Susan Kelly, University of Wisconsin, Eau Claire. 

"Taxicab Geometry," by Lou Mertens, Cardinal Stritch College. 

"On the Simultaneous Numerical Computation of the Characteristic Polynomial, Adjoint Matrix, 
Inverse Matrix, Determinant, and the Complex Characteristic Values Associated with an Arbitrary 
Nonsingular Matrix," by Dana M. Johnson, University of Wisconsin, Green Bay. 

"Infinitesimals Within Standard Analysis," by Mark Hopkins, University of Wisconsin, Milwaukee. 
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F(¢)M = 0, it follows that 
F(~) M = 0mod(¢ — Wy) M. 


The theorem above is thus an immediate consequence of this observation and Theorem 1.1. 


References 


1. M. Braun, Differential Equations and Their Applications, 3rd Ed., Springer-Verlag, 1983. 
2. J. M. Cohen, A spectral sequence automorphism theorem; applications to fibre spaces and stable homotopy, 
Topology, 7 (1968) 173-177. 
3. , Clarification to a result, Topology, 9 (1970) 299-300. 
4. Marvin Jay Greenberg, Note on the Cayley-Hamilton Theorem, this MONTHLY, 91 (1984) 193-195. 
5. Peter Hilton, A note on finitary automorphisms, Aspects of Mathematics and its Applications, North 
Holland, 1985 (to appear). 
6. Peter Hilton and Joseph Roitberg, On pseudo-identities I, Archiv der Mathematik, 41 (1983) 204-214. 
7. Kenneth Hoffman and Roy Kunze, Linear Algebra, 2nd ed., Prentice-Hall. 
8. Saunders Mac Lane and Garrett Birkhoff, Algebra, 2nd Ed., Macmillan, 1979. 
9. Maurice Orzech, Onto endomorphisms are isomorphisms, this MONTHLY, 78 (1971) 357-362. 
10. Joseph Roitberg, Finitary automorphisms and integral homology, Topological Topics, London Mathematical 
Society Lecture Notes, 86 (1983) 164-168. 
11. Gilbert Strang, Linear Algebra and its Applications, 2nd Ed., Academic Press, 1980. 


NOTES 


EDITED BY SABRA S. ANDERSON, SHELDON AXLER, AND J. ARTHUR SEEBACH, JR. 
For instructions about submitting Notes for publication in this department see the inside front cover. 


GENERALIZED LEAST SQUARES AND EIGENVALUES 


ALAN SHUCHAT 
Department of Mathematics, Wellesley College, Wellesley, MA 02181 


In a recent note in this section of the MONTHLY [2], Hung C. Li discusses the generalized least 
squares problem of finding the best-fitting line L to a set of points in the plane. Here 
“generalized” means that the sum S of squares of the perpendicular distances from the points to L 
should be minimized, as opposed to ordinary least squares where distances are measured vertically 
or horizontally. This forms the basis of orthogonal regression [3] and principal component analysis 
[4]. Li’s approach is to treat S as a function of the two parameters that determine L and to 
minimize it by partial differentiation. However, we can obtain additional geometric insight and 
extend the solution more easily to higher dimensions by using linear algebra, as in [4]. The 
presentation given here is simpler and accessible to undergraduates. 

If L is aline and x a point in R”, we let d(x, L) denote the orthogonal distance from x to L. 
Given points x',...,x’ in R”, we seek a line L for which the sum S = L,d(x*, L)* is minimal. 
We will show that Z must pass through the centroid of the given points and that its direction can 
be given by any eigenvector for the largest eigenvalue of a certain symmetric matrix. More 
generally, the d largest eigenvalues determine a best-fitting affine subspace of dimension d. 

We first show that the desired line must pass through the centroid, which for convenience we 
assume to be at the origin, i.e.,,x* = 0. Let L’ be any line x = w + tv in R”, where v is a unit 
vector, and let L be its translation x = tv to the origin. For each x in R”, 


2 4 2 
d(x,L)" = ||x\" —(x- vo), 
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SO 
d(x,L’)’ = d(x —w,L)’ = d(x,L)° + d(w, L) —2x-[w—-(w-v)v]. 
This implies 
y d(x", L’)” = d(x*,L) + md(w, L)” 
k k 


> Yi d(x*, L)’. 
k 


So we need only consider lines L through the origin. 
Now we must minimize 


(1) S=La(xt LY = Liles? L(x vy 


over ||v|| = 1, i.e., maximize f(v) = X,(x* - v)*. Let X be the m X n matrix with rows x* and A 
the n X n symmetric matrix X'X. Then f(v) = ||Xv||? = Av - v, so f is a quadratic form. 

We now apply the spectral theorem for symmetric matrices and diagonalize A. In fact, 
maximizing a quadratic form such as f is one way to prove the spectral theorem [1]. The matrix A 
is orthogonally similar to the diagonal matrix D of its eigenvalues A,, i.e., D = P’AP for an 
orthogonal matrix P. We may arrange the eigenvalues, which are real, in nonincreasing order. The 
quadratic form corresponding to D is then 


g(u) = Du-u=))),u?. 
Since f(v) = g(P’v) and P is an isometry, 
max{ f(v): llol] = 1} = max{ g(u): [lull] = 1}. 


But g(z) lies in the interval spanned by the A,, so its maximum value is the largest eigenvalue A, 
and occurs when u = e,, the first standard basis vector. So the maximum of f occurs when 
v = Pe,, which is the first column of P and a unit eigenvector for A,. This vector v gives the 
direction of LZ. Any unit eigenvector v, of A with eigenvalue A, will give the same value 
f(v,) = A, for f, so if A, is a repeated eigenvalue, then LZ will not be uniquely determined. 

Similarly, the worst-fitting line through the origin, for which S is maximized, is the line for 
which f is minimized and its direction is given by any unit eigenvector v, for the smallest 
eigenvalue A,. This is essentially Rayleigh’s principle [5]. 

Moreover, we can view maximizing f as finding the farthest points from the origin on an 
ellipsoid in R”. The quadratic form Av - v is positive semi-definite, so each A, > 0. This quadratic 
form is positive-definite if and only if the vectors x* span R”. If they do (as in the example 
below), then each A, > 0 and A has a positive-definite symmetric square root VA , which can be 
constructed as follows: Let YD be the diagonal matrix with diagonal entries JA; and let VA = 
PVD P', so that (VA )* = A. Then f(v) = |\VA v||* and the image of the unit circle ||v|| = 1 under 
VA is the ellipsoid A~1x - x = 1, where x = VA v (A™! is also positive-definite and symmetric). 
By diagonalizing A~', D~! = P‘A~'P, we see that our problem amounts to maximizing || y|] such 
that ©,A;+y? = 1, where y = P'x. Since Ly? < A, for all such y, the maximum occurs at y 
= Ay e,. So the maximum on the ellipsoid occurs where x = Jy U,, 1.e., the best-fitting line lies 
along the major axis of the ellipsoid, in the direction of v,, and the semi-major axis has length 
Ay . Similarly, the worst-fitting line lies along the minor axis, in the direction of v,, and the 
semi-minor axis has length )rn ; 

Finally, in (1), the sum ©, ||x“||* is just the trace tr A of A, the sum of the diagonal entries of A. 
Since the trace is also the sum of the eigenvalues, the value of S for the best line is the sum of the 
eigenvalues except for A,, while the value for the worst line is the sum except for X,,. 
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We illustrate these ideas with the example given by Li [2]. We wish to fit a line to the seven 
points (0, — 2), (4, — 1), (6,0), (6, 1), (4, 2), (0,3), and (— 6,4) in R*. We first translate these points 
so that the centroid, now at (2,1), moves to the origin. This gives 


and 


The eigenvalues of A are 


A, = 70 + 14713 ~ 120.48 
and 
A, = 70 — 14713 = 19.52. 


Corresponding unit eigenvectors are 


v, = (3 + v13, —2)/V¥26 + 6/13 = (.96, —.29) 


and 


v, = —(3 — ¥13, —2)/V¥26 — 6V13 = (.29,.96), 


which is orthogonal to v,. 
The matrix A is positive-definite, and A~*x - x = 1 describes an ellipse in R* with equation 


xp + 2x,x, + 4x} = 84. 


ss 
| 
| 


TT = tr —_ 


J 
/ oo) ive, 
/ 


L, 


| 


Fic. 1 
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Its axes lie along v, and v,, with semi-axis lengths Ay and |r2 , respectively. After translation 
back to the original centroid (2,1), the best-fitting line L, becomes 


x = (2,1) + 4(3 + y13, —2) 


and the worst-fitting line L, becomes 
x = (2,1) + 2(3 — y13, —2), 


which are just the lines found by Li in [2]. The sum S is A, for L, and i, for L,, which explains 
the numerical values found by Li. Fig. 1 summarizes the example. 


Acknowledgment. I would like to thank the MIT Operations Research Center, where I was on leave while 
preparing this note, for its hospitality. 
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FIXED POINTS AND EXAM TAKING STRATEGY 


THOMAS O’BRIEN 
Department of Mathematics and Statistics, Bowling Green State University, Bowling Green, OH 43403 


In certain mathematics exams, a student is given a large number of problems and a limited 
time in which to answer correctly a number sufficient to achieve a passing score. A good example 
is provided by the early examinations in the examination program for attaining fellowship in the 
Society of Actuaries. The student’s problem is, “How much time should be spent on a question 
before giving up and moving on to the next?” Our answer is, “If it is going to take longer than 
average, move on to the next question.” In this note we explain what this answer means, why it is 
so, and how the question arose. 

In an article in a recent newsletter of the Society of Actuaries [1], Chan provided the following 
statistical model for solving the above problem. 


Chan model. Each problem is screened for S minutes. After screening, the working time W is 
known. Total time is then given by JT = S + W. Here, S and W are random variables and they 
are assumed to have exponential distributions with E(S) = 1/A and E(W) = 1/8. Thus S and 
W have densities f(x) = Ae~** and g(x) = Be **, x > 0. Further, 

1 1 
E ( T) = 7 + B . 


The strategy is then: Screen each problem to determine W; if W > 7, go on to the next problem. 
We seek the best moving-on time, 7. The expected working time for a problem worked becomes 


“xBe7P* dx Br 
E(WIW <1) = 8, = 41 - 8 —_|. 
B “Bs 
[ Be® dx l—-e 
0 


Not every question screened will be worked out and thus the expected screening time per question 
worked out will increase to 

1 1 

Ali —e 4] 
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The + we seek is that which minimizes the total time per worked question, 
1 1 Bre~* | 
7) = —{|——_ | + —] 1 -— ———— ] . 
o( ) rv | 1 _ =a B 


1—e 
This function is easily seen to have a unique critical point at which its minimum value is attained. 
At the critical point, 


11 A eB 

- 4 eae 

A 8B B 

In [1] the B in the denominator of the exponential term was left out. We thus undertook to 

redo the computations of Chan’s note and in solving for the critical point using Newton’s method 
discovered that the minimum occurs at a fixed point of ¢. Indeed, solving the last equation for 
e~ ®" and substituting into the expression for (7), we get (7) = 7. We then inquire whether this 
is merely an idiosyncrasy of the chosen model. 


Alternative model. Here, S, W and TJ are as above but the densities of S and W are 
respectively 


f(x) =xMe~** and g(x) = xB2e7®. 
Then, 
2 2 
In this model, the modes of the distributions are at 1/A and 1/8 rather than at 0 as in the Chan 
model. Calculating as in the first model we obtain 


9) 7 B?e 8 
E(WIW< T) jy! Eo | 


and the expected screening time per problem worked becomes 


2) 

A} 1 — Bef" — e At] 
Summing the last two expressions yields 
2—7°\Be 8 


g(r) = 54 — 2-7 Abe 
B A(1 - TBe 8" — e Br) ) 


Again, there is a unique critical point yielding a minimum and it is the solution to the equation 


2 2 2 
tT+[ t+ tle =—4+—. 
er ae 
Solving for e~ 47 and substituting into the expression for ¢, we again obtain (7) = 7 and we are 
led to consider a general explanation for this phenomenon. 


General model. Suppose that S and W, again with the same meaning as above, have 
distributions with continuous positive valued densities f and g, respectively, defined on (0, 00) 
and that 


E(S)=y and E(W)=a 
[x8(2) dx 


fal») dx 


E(W\|W <7) = 


1985] NOTES 661 


The screening time per question worked becomes __t—_. Thus 


I g(x) dx 


0 


[fa ax] 


(787) — 6(7)8(7))/f a(x) ax 


$¢(1) = 


o'(1) = (—8(7)($(1) - “Vf "g( x) dx. 


Since y > 0, lim, _,9¢(T) = +00 and lim, _,,.¢(7) = a + y. Also the graph of @ cannot cross the 
diagonal from below. It follows that @ has a unique fixed point and it occurs at the min of 9. 

We now restate our answer to the question posed at the beginning. If, after screening, the 
estimate of the time to work the problem is longer than the average time spent per problem 
worked, then move on to the next problem. This answer may not be terribly helpful to the test 
taker and so we present in a table the results of the numerical calculations for the two models 
above for a 60 question three-hour exam with E(T) = 6. 


po E(S) E(W) Best 7 Answers Screens 


5 3.534 
4.23 4.561 
4.793 
5.524 
5.895 


Alternative 
model 


Thus, the first model suggests that, on this test, a problem should be screened for 1.77 minutes 
and then if it is estimated that the time needed to work the problem is more than 4.561 minutes, 
go on to the next. The second suggests screening for 1.4 minutes with a moving-on time of 4.855 
minutes. 
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ANSWER TO PHOTO ON PAGE 630 


Alfred Haar (1885-1933). 
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A COMBINATORIAL PROOF OF A CRITERION FOR NORMALITY 


JAMES C. OWINGS, JR. 
Department of Mathematics, University of Maryland, College Park, MD 20742 


It is well known that if G is a finite group, p is the smallest prime that divides |G|, and H isa 
subgroup of G of index p, then H is normal in G. This is usually proved by representing G as a 
group of permutations on the left cosets of H, along the lines of Herstein [2, pp. 72—73] (although 
Herstein does not explicitly mention this result). We present here a simple theorem relating right 
and left cosets from which the results easily follows. We also discuss a related theorem of Marshall 
Hall, Jr. 


THEOREM. If H and K are subgroups of a group G, L is a left coset of H, and R,, R, are right 
cosets of K such that L\ R, # @ and LO R, # S, then LM R, has the same cardinality as 
LOR). 


Proof. Let w, € LO R,, wy. © LO R,. Then 
L=w,H = w,H, R, = Kw,, and R, = Kn,. 
So 
LOR, =wHo kw, = (w, Hw? ‘a K)w, 
and 
LOR, =wH O Ky, = (w, Hwy? ‘a K)w,. 
But since w, H = w,H, 
w, Hw) = w,Hw,! = w,(w,H) = w,(w,H)” = w, Awe}. 


Hence, both LM R, and LM R, are right cosets of w, Hw, * M K, a subgroup of G. This proves 
the theorem. 


COROLLARY. Let G be a finite group, let p be the smallest prime that divides \G|, and let H be a 
subgroup of G of index p. Then H is normal in G. 


Proof. Let g © G\ H; then gH \ H = ©. Hence, if k is the number of right cosets of H that 
gH intersects, k < p — 1. By our theorem, the size |gH 1 R,| of these intersections is constant, 
say c. Since 


IH|=|gH|= d,s ISH OR | = ck, 
sHAR, + © 


k is a divisor of |H| and, by Lagrange’s Theorem, a divisor of |G|. Therefore k = 1. So gH = Hg. 


Suppose H and K are subgroups of G. Marshall Hall, Jr. [1, p. 14, Lemma 1.7.1] has shown 
that if ZL, and L, are left cosets of H, then the sets 


{ R|R is aright coset of K and L; 1 R ¢ @} 


and 


{ R|R is aright coset of K and L, 1 R# @} 


are either equal or disjoint. Obviously, both our result and Hall’s result remain true if the words 
“right” and “left” are interchanged. By putting the two results together, we get a nice picture of 
the interaction between right and left cosets. The left cosets of H and the nght cosets of K can be 
partitioned into a finite number of blocks B,,...,B, and Bj,..., By, respectively, such that if 
Le B, and R © B’, then LO R # @ if and only if i =j. Our theorem then shows that, if 
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B, = {l,,...,L,} and B/ = {R,,...,R,}, then all the intersections L; 9 R,1l<i<s,l<j< 
t, have the same size. 
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ON THE DIFFERENTIABILITY OF FUNCTIONS OF SEVERAL VARIABLES 


MICHAEL W. BOTSKO AND RICHARD A. GOSSER 
Department of Mathematics, Saint Vincent College, Latrobe, PA 15650 


Let f be a real-valued function defined on D a neighborhood of a in R”. Then f is said to be 
differentiable at a if there is b © R” and y: D — R such that for all x € D 


(1) f(x) — fla) =b-(x— a) +(x)||x — al] and (x) > 0asx >a, 
where a - b is scalar product. 

The purpose of this note is to present an equivalent condition for differentiability of functions 
of several variables, which is often easier to apply than the definition itself. The condition is 
similar to the one used by Carathéodory for functions of one variable, see [1]. Our main result is 
the following: 


THEOREM. Let f: D — R where D C R” is a neighborhood of a. Then f is differentiable at a if 
and only if there exists h: D — R" continuous at a and satisfying 


(2) f(x) — f(a) =h(x)-(x-— a) forallxin D. 


Proof. Suppose (1) holds, and for i = 1,2,...,n, let a;, x,, and b, denote the ith component 
of a, x and b. Also for each i define 

X: — Qa: 

b, + n(x)7———, 

h(x) = MT all 

b 


ifx #a, 


ifx =a. 


Clearly each h,: D — R is continuous at a, and for x # a 
f(x) — fla) = b-(x— a) + (x) |lx — all 


= Li bi( x; — a;) +A) yx, ~ a,)” 


ll — all 


_ n(x)( x; — a;) a 
“2 b; + Ix — all (x, i) 


= LA (x) x; _ a;) 
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=h(x)-(x-— a), 


where h(x) = (A,(x), h)(x),...,4,(x)) and h is continuous at a. Since the equality obviously 
holds for x = a, we have 


f(x) —f(a) =h(x) +(x — a) forall x in D. 
Hence (1) > (2). 
Conversely, suppose there exists h: D — R” continuous at a and satisfying (2). For each 
i=1,2,...,n, define y,; D—> R by 
n(x) =h,(x) —h,(a) 
and y: D > R” by 
y(x) = (m(x),2(x),---,0,(x)). 
Hence 
f(x) — f(a) = h(x) (x — a) = (h(a) + y(x)) ‘(x — a). 
Put b = h(a) and define 7 so that, 
y(x) -(x = a) 
n(x) = |x — al 
0 ifx =a. 


ifx Fa, 


For all x € D we have 


f(x) — fla) = b-(x— a) + y(x) “(x — a) 
=b-(x—a) +n(x)|lx — all. 


Since h(x) > h(a) as x > a, n(x) > 0 as x — a and the proof is complete. 


If f is differentiable at a, it is easy to show that h(a) is the gradient vector of f at a. 
As an example of using our criterion, let us show that the following function is differentiable at 
0. Define f: R” — R such that 


1 
x? sin—— if x + 0, 
fx) = 2 r 


0 if x = 0, 
where a, > 1 for each i. Now define h: R" > R” so that 


1 
x? sin | e, if x + 0, 
h(x) = E[ = El r 


(0,0,...,0) ifx =0, 


where b, = a, — 1 and e, is the unit vector with 1 in the ith position and 0 elsewhere. Obviously 
f(x) = h(x)-x for all x in R” and h is continuous at 0. Thus f is differentiable at 0. 

It is easy to prove that sums, products and quotients of differentiable functions are differentia- 
ble using the criterion in this paper. In addition the usual chain rule and directional derivative 
formulas are readily established using this technique. The reader is invited to work through some 
of these or even to formulate others. 

As an example we give a short proof of the following well-known result. 


THEOREM. Let f and g be real-valued functions defined on a neighborhood of a in R”. If f is 
differentiable at a, f(a) = 0, and g is continuous at a, then fg is differentiable at a. 


Proof. Since f is differentiable at a, there exists D a neighborhood of a and h: D — R” 
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continuous at a such that 
f(x) =h(x)-(x-—a) forall x in D. 
Obviously D can be chosen so that g is defined on D. Multiplying by g(x) yields 
f(x) g(x) = g(x)h(x) -(x— a) forall x in D. 


Since gh: D — R” which is clearly continuous at a, it follows at once that fg is differentiable 
at a. 


Reference 


1. Maurice Heins, Complex Function Theory, Academic Press, New York (1968) 58. 
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E 3111. Proposed by Themistocles M. Rassias, Athens, Greece. 
Prove that 


E 3112. Proposed by M. J. Pelling, London, England. 


Let Q, be the set of rationals greater than 1. Does Q, contain a minimal set B such that every 
gq € Q, is a product, not necessarily unique, of elements in B? 
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FE 3113. Proposed by Frank Schwellinger (student), Karlsruhe, West Germany. 


Prove that there is a continuous, strictly decreasing function g(x) of the real line R onto R 
such that g(g(x)) = 2x + 2, but that there is no such function satisfying g(g(x)) = x + 1. 


FE 3114. Proposed by M. J. Pelling, London, England. 


Find the largest cube that can be inscribed in some tetrahedron of volume 1. 


FE 3115. Proposed by I. J. Schoenberg, University of Wisconsin-Madison. 
(a) Let z, (j = 1,...,m) be the zeros of the polynomial 


(1) P(z) =z" +a,z" ? +--+ +4 


n 


with complex coefficients. (Note 1'z, = 0.) Show that 
dL lz)l* > 2layl, 
j=l 


with the equality sign if and only if the z, all lie on a straight line passing through 0 in the 
complex plane. 
(b)* Let w, (j =1,...,m — 1) be the zeros of the polynomial P/(z), from (1). Show that 
n—-1l n—-2 n 
ee ae 
j=l j=l 


with equality if and only if all the z, lie on a straight line passing through 0 in the complex plane. 


FE 3116. Proposed by M. Laub, Jerusalem, Israel. 


(a) If x and y are nonnegative real numbers, prove that x* + y” > x” + y*. 
(b) If {x,,x,,...,x,,} 18 a sequence of nonnegative real numbers and { y,, y),..., y, } 18 any 
permutation of this sequence, prove or disprove that 


Xqt + x52 + 00+ +xpM DS xP + xZ2 + 0+ txFn, 


SOLUTIONS OF ELEMENTARY PROBLEMS 


Properties of the Matrix of a Rosette 


E 2898 [1981, 538]. Proposed by U. Abel, Heidelberg DKFZ, West Germany. 


A rosette is a directed graph G: (V = {xo,...,x, }, E) with nonzero arc-weights, consisting of 
cycles which all pass through the central point xo. 

Let A be the matrix corresponding to G, 1e., a,, = weight on (x;,x;), and let m; and n, 
denote the number of cycles in G of length > i and = /, respectively. Assume that the maximal 
cycle length / is greater than 1. 

Prove the following statements: 

k 


(a) dimkerA* = )) (m,-1), k < J; 


i=1 
l 


(b) dimker A! = dimker AU} =1 —14+ Yo n,(k— 1). 
k=2 


Solution by the proposer. Inductively define 
M,={x€V|x#x, and (x, xq) isan edge of G}; 
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Thus 

[-1 

y m, =(/-1)n,+(/-2)n,_, +--+: +n, 

p=1 

/ 
= di n,(p-1). 
p=2 

Therefore 

[-1 [-1 
(5) Le (m,-1)=GA-)+ Vin (p-). 

p=1 p=l 


Together, (a), (4) and (5) yield (b). 
An Application of Fubini’s Theorem 
E 2965 [1982, 594]. Proposed by Bu Qi-yue, Shanghai Jiao-tung University, People’s Republic 


of China. 


Let 0 < h, A C [a,b], A measurable. Prove 
= [’m[4 A(x —h,x+h)] dx < mA. 


Solution by K. F. Andersen, University of Alberta. If x ,(t) denotes the characteristic function 
of A, then we have 


[om{ 4 A(x —h,x+h)] dx = [Pax xa dt. 


Applying Fubini’s Theorem to the double integral on the right, we see that this is clearly less than 
or equal to 


fo xa) af, ® = (mA)(2h), 


as required. 


Also solved by 26 other readers and the proposer. 
An Irreducibility Criterion 


E 2969 [1982,697]. Proposed by John Brillhart, The University of Arizona, and Constantin 
Sevici, University of Michigan. 


Let f(x) =a yx" + +-: +pa, be a polynomial with integer coefficients such that aja, # 
0,(ao,41,---, pa,) = 1, and let p bea prime such that p > XL"=3Ia,||a,|"1~*. Prove that f(x) is 
irreducible in Z[ x]. 


Editorial note. The original problem was slightly misstated, as the example (due to W. Janous, 
Austria) n = 2,da) = 1,a, = 2,a, = 1/p, p any odd prime, shows. The intention was to assume 
that a,, not merely pa,, is an integer. With this amendment we have the following. 


Solution by E. R. Gentile, Argentina. Let a € C be a root of f(x). We claim that |a| > |a,|. 
Otherwise, f(a) = 0 implies 
n-l 


_ -1- 
|pa,,| = |aga” + +++ +a,_10| < |a,| L lag|lanl" °° < lanl ipl, 
s=0 
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a contradiction. Assume that f(x) is reducible with respect to Z[x]. Then we have in Z[x]: 
f(x) = (box’ +++ +bp)(egox' +++: +e), 1<t<(n-1), 


with bocy = ao, 5c, = a,. Let a,,...,a, be the roots of the second factor. We have 


Therefore 


which is absurd. Hence f(x) must be irreducible with respect to Z[x]. 


Also solved by O. P. Lossers (The Netherlands), D. P. Mehendale and M. R. Modak (India), B. L. Osofsky, 
University of South Alabama Problem Group, and the proposers. 


Simplices, Circumspheres, and Centroids 


E 2987 [1983, 133]. Proposed by George Tsintsifas, Thessaloniki, Greece. 


Let S, = A,A,...A,,, be an n-simplex in R” and M a point inside its circumsphere S: 
(0,R). The straight line A;M intersects the sphere (0, R) at the point A’. We denote K = 
Lt |A, M|/|MA‘|. Let G be the centroid of S,. Prove: 


(a) K>n+1_ ifand onlyif M lies outside the sphere (s) with diameter OG. 
(b) K=n+1_ if and only if M lies on the sphere (s). 

(c) K<n+1_ ifand onlyif M lies in the interior of the sphere (s). 

Solution by Leon Gerber, St. John’s University, Jamaica, New York. 


cn LAM? ay 
1 [M4 |4,M| 2, R? —(Mo)? 
For any point P let p = OP and p = |OP|. Then 


n+1 n+1 


K(R-m’)= ¥ |lm- 4,’ = XY (m? - 2m-G,; + a?) 
i=] i=] 
=(n+1)m* -—2(n+1)m-¢+(n4+1)R?’ 
implies 
A4n+1) _, n+1 \’ 2(n + 1) 
2_ “AY Tt) es ers 2 p2_ “AAT 4) p2 
me Kint+1” +(e sq) s R°- Kinvi’ 
n+1 2 2 
(gE 4) 06 =f, 
or 
| n+1  _|/ 5 
m— =f*. 
K+n4+1 


That is, for a fixed value of K, M lies on the sphere with radius f and center F given by 


ra K _ n+1 
f= Ryn? * Kent 


> 
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ADVANCED PROBLEMS 


Solutions of these Advanced Problems should be mailed in duplicate to Professor D. H. Mugler, Department 
of Mathematics, University of Santa Clara, Santa Clara, CA 95053, by March 31, 1986. The solver’s full 
post-office address should be on each sheet. 


6503. Proposed by Gérard Letac, Universite Paul-Sabatier, Toulouse, France. 


The Markov chain (X,,)?_ 9 on the interval (—1,1) is such that, given X,, X,,, is uniformly 
distributed on (X,,,1) if X, < 0, and on (—1, X,,) if X, > 0. Find the stationary distribution. 


6504. Proposed by Michael S. Perkins, Stanford University. 


Find each function f(z) which is analytic at z = 0 and for which nf, £ converges uniformly 


to f(z) in some neighborhood of z = 0, where f, = f and f, is the composition of f with itself n 
times. 


SOLUTIONS OF ADVANCED PROBLEMS 
The Constant Term in an Expansion 


6458 [1984, 259]. Proposed by Mark Haiman and David Richman, MIT. 
Determine the constant term f(p) of 
( x + yt x! +y')*, 
where x and y are noncommutative variables. 


Solution by Allen J. Schwenk, Office of Naval Research, Arlington, Virginia. The expansion of 
this expression produces 4” terms which we shall interpret as walks of length p in an infinite tree 
that has four edges incident at each vertex. Each edge contributes one of the four factors x, x~', 
y, or y ' as it is traversed. The same edge traversed in the opposite direction contributes its 
inverse factor. Now a term reduces to the constant 1 if and only if it represents a closed walk, that 
is, it terminates at its origin. Of course this means p = 2n must be even, and f(2n) is the number 
of possible closed walks of length 2n. 

We shall use generating functions to enumerate closed walks. Let A(z) = X%_,a,,,z°" denote 
the generating function for closed walks of length 2 with a prescribed first step uv and returning 
to the origin u exactly once. Now if v is visited k + 1 times before returning to u, between each 
pair of occurrences of v there must be a closed walk using one of three available first steps. Thus, 
a closed walk returning k + 1 times to v before returning to u is counted by z*3*A*(z). Summing 
for all k > O we find 


oe) 2 
Zz 
A(z) = 22 3*AK(z) = ———. 
Kn 1 — 3A(z) 
Observing that A(0) = 0 we conclude that 


1 —(1 — 1222)'” 
(ey Lae)? 


Now let B(z) = L%,b,,z7" be the series for all closed walks starting at u with repeated 
returns allowed. A walk that returns exactly k times is counted by 4*A*(z). Thus, 


Bz)= ¥ aa(2) = aay. 
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from which it follows that 

2(1 — 12z2)'7* -1 
1 — 162? 

From this we easily derive the recurrence relation 


B(z) = 


2 n 
bay = 16Ban-2 — 3G (7) 


with the initial condition b) = 1, and from the recurrence relation we obtain the formula 
: 2 2k \ 2kyen-k 
f(2n) =b,, =16"- > soa )3 16” ". 
np 2k - AN k 
This approach extends easily to handle m noncommuting variables. The constant term of 
(2, (x; + x7 +))*” is found to be 


(2m)2" - x eel \(em ~1)*(2my2"2*, 


Also solved by L. E. Clarke (England), S. V. Kanetkar, O. P. Lossers (The Netherlands), J. G. Mauldon, 
R. W. K. Odoni (England), James Propp, B. E. Sagan, Lajos Takacs, A. Tissier (France) and the proposers. Partially 
solved by Richard Staum and D. H. Wiedemann (Canada). 


On Infinite Values of a Derivative 


6459 [1984, 314]. Proposed by Mark Bowron (student), University of Wisconsin. 


Let C be Cantor’s ternary set, and let g:[0,1] — [0,1] be the Cantor-Lebesgue function. 
Define a measure p on C by setting 1(.S) = mg(S) for S C C, where m is Lebesgue measure. We 
know that g’(x) = 0 ae. (m) in [0,1]. Show that 29’(x) = o0 ae. (m) in C. 


Solution by F. S. Cater, Portland State University, Portland Oregon. The result is an im- 
mediate consequence of Theorem 4.6 on page 271 of S. Saks, Theory of the Integral, second 
revised edition. This theorem states that if a nondecreasing function f has a finite Dini derivative 
at each point of a set E with mE = 0, then mf(E) = 0. 


Ivan Netuka and Jifi Vesely (Czechoslovakia) and Richard B. Tucker observed that the result follows easily from 
Theorem 8.10 on page 169 of W. Rudin, Real and Complex Analysis, second edition. Also solved by David Moews, 
Alexander V. Stanoyevitch, Wm. Douglas Withers, and the proposer. 


A Pathological Function 


6460 [1984, 371]. Proposed by L. Richard Duffy (student), University of Chicago. 


It is well known that a discontinuous /inear real function (1.e., one satisfying f(x + y) = f(x) 
+ f(y), all x, y € R) must be unbounded on any open interval. Is there a linear function which is 
so pathological that it actually assumes a// real values on any open interval? 


Solution by Detlef Laugwitz, Technische Hochschule Darmstadt, West Germany. The answer is 
“yes”. Let B be a Hamel basis of R over @ with 1 € B. Since B has the same cardinality as R, 
we may write B = {b,|x € R} with b) = 1. Define a Q-linear function f by setting f(b.) = x. 
For each x there exists a rational number qg such that b, + q lies in a given interval, and 


f(b, + 9) = f(b) + af(1) = f(b) = x. 


Also solved by the proposer and twenty-four others. A number of solvers pointed out that a solution of the 
problem is contained in a paper by F. B. Jones, Connected and disconnected plane sets and the functional equation 
f(x) + f(y) = f(x + y), Bull. Amer. Math. Soc., 48 (1942), 115-120. 
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An Exponential Sum Inequality 


6461 [1984, 371]. Proposed by L. E. Mattics, University of South Alabama. 
Let p be a prime and suppose 1 < h < p where the order of h modulo p is ( p — 1)/v. Prove 
that 


p-1 
y e2mih /P 
m= 


(v-1)~p+1> 


Solution by Harald Niederreiter, Austrian Academy of Sciences, Vienna, Austria. More general 
results are available in the literature. Let M> 2, b, and A be integers with gcd(b, M) = 
gcd(h, M) = 1 and with the order of h modulo M being + = $(M)/v, where ¢ is Euler’s totient 
function. Then it is shown in [1, Theorem 8.3] that 


< VM - —(/M -1), 


t 
> e2mibh /M 
m=1 


The desired result follows immediately by putting M = p, b = 1, and noting that modulo p the 
powers h”,m = 1,2,...,p — 1, run through »p full periods. 


Reference 


1. H. Niederreiter, Quasi-Monte Carlo methods and pseudo-random numbers, Bull. Amer. Math. Soc., 84 
(1978), 957-1041. 


Also solved by Karl Dilcher (Canada), Ronald J. Evans, S. V. Kanetkar, O. P. Lossers (The Netherlands), 
R. W. K. Odoni (England), James C. Smith, and the proposer. 


REVIEWS 


EDITED BY ALLAN L. EDMONDS AND JOHN H. EWING 
COLLABORATING EDITOR FOR FILMS: SEYMOUR SCHUSTER 


The Lore of Prime Numbers. By George P. Loweke. Vantage Press, New York, 1982. vii + 259 pp. 
$17.95. 


CLAUDIA SPIRO 
Mathematics Department, State University of New York at Buffalo, Buffalo, NY 14214 


The ancient Greeks defined a prime as an integer p greater than 1, which cannot be factored 
into a pair of integers, both of which exceed 1. Integers greater than 1 which are not primes are 
called composite numbers. Every integer n > 1 can be written as a product of prime numbers, and 
this representation of n is unique, except for the order of the factors. Thus, the primes freely 
generate the semigroup of positive integers under multiplication. This result, which was known to 
Euclid, is called the Fundamental Theorem of Arithmetic. 

Euclid also knew that there are infinitely many primes; however, there is still no known 
“simple rule” for finding the nth prime. For example, it is natural to consider a nonconstant 
polynomial in n with integer coefficients. But, it can readily be shown (see Theorem 21 on p. 18 of 
[9]) that any such polynomial takes infinitely many composite values. As an interesting and 
well-known example (see pp. 106-7 of [16]), we remark that m* — m+ 41 is prime for m = 
1,2,...,40, but that 41 evenly divides m* — m + 41 whenever m is an integer multiple of 41. It is 
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important to mention that there are nonconstant polynomials with integer coefficients in several 
variables such that when they are evaluated at positive integers, and the resulting value is positive, 
it is prime. For a good, accessible account of these results, we refer the reader to [5]. It is also 
known [4] that no nonconstant rational function of one variable, with real coefficients, can assume 
only prime values at the integers. W. H. Mills [17] has shown the existence of a real number A > 0 
such that [A*’] is prime for all positive integers n, where [x] denotes the greatest integer not 
exceeding x. His proof relies on the result of Ingham [11] that there is a positive constant c, such 
that for all real x > 1, there is a prime between x and x + c,x°/*. Unfortunately, to find the A in 
Mills’ result (or at least to approximate it), the very primes that his rule gives have to be found 
first. 

Although it is not easy to predict the terms of the sequence of primes, this sequence behaves 
well on the average. To indicate what we mean by “on the average”, we first write 7(x) for the 
number of primes not exceeding the positive real number x. In addition, if f(x), g(x) are 
real-valued functions of x, we write f(x) = o(g(x)) to mean that 


mm 16%) - 
ae g(x) ° 


On the basis of numerical tables, Carl Friedrich Gauss conjectured that 


(1) lim 212) 


xo X/log x 


> 


where log x signifies the natural logarithm of x. An account of his reasoning is given on pp. 4—5 
of [14]. Equation (1), which can be restated as 


(2) n(x) G7 = lies): 


has come to be called the Prime Number Theorem. It was first proved, independently, by J. 
Hadamard [7] and by Charles de la Vallée Poussin [20] in 1896. Their proofs required complex 
variable theory. In 1899, de la Vallée Poussin [21] improved the estimate (2) by showing that there 
are positive constants c, and c, such that 


(3) 


for x > 2. We remark that 


(6-5 
2 logt log x (log x)’ (log x)” 
and that the right side of (3) grows more slowly than x(log x)~*, so that (3) implies that 


n(x) esa). 


logx (Jog x)? (log x)’ 


In fact, (3) can be made to yield an asymptotic expansion for 7(x)/x, descending by powers of 
log x. 

Much effort has been exerted to learn what the smallest function is with which the nght side of 
(3) can be replaced. In 1958, in independent work, I. M. Vinogradov [22] and N. M. Korobov [13] 
showed that the right side of (3) can be replaced by 


_ b 
C4 xe C5 (log x) > 


where b is any constant less than 3/5, and c, and c, are positive constants which may depend on 
b. It has been conjectured that in fact, the right side of (3) can be reduced to the quantity 
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c<¥x logx, for some positive constant c,. This improved version of (3) was shown by von Koch in 
1901 [12] to be equivalent to the so-called Riemann Hypothesis. To state this hypothesis, we 
define the Riemann zeta-function by 


for the real part of s exceeding 1. This function has an analytic continuation to {s complex: 
s #1}, with a simple pole of residue 1 at s = 1. Its connection with primes can be seen in the 
beautiful formula due to Euler [6], which states that 


&(s)= TT Q-p vy”. 
Pp prime 
Its proof is given on p. 231 of [1]. The Riemann Hypothesis is the conjecture that §(s) has no 
zeroes with real part exceeding 1/2. It is supported by a very large amount of evidence [15], [2], 
[3]. 

Since there are roughly x/logx primes p < x, the average distance between consecutive 
primes not exceeding x is logx. If p > 3, then the next prime must be at least p + 2. It is 
conjectured that p + 2 is prime infinitely often. Pairs of primes of the form p, p + 2, like 3, 5 and 
101, 103, are called twin primes, and the as yet unproven conjecture about the existence of 
infinitely many such pairs is called the Twin Prime Conjecture. J. R. Chen (see p. 4 and Chapter 
11 of [8]) has shown that there are infinitely many p for which p + 2 has at most two prime 
factors. If p > 5, then p, p + 2, p + 4 cannot all be prime, since 3 evenly divides one of them. 
However, it is conjectured that there are arbitrarily large p for which p, p + 2, p + 6 are all 
prime. These conjectures can be generalized considerably (see Chapter 1 of [8)). 

Ir the other direction, we can ask how large the gap between consecutive primes can be. There 
is conjectured to be a prime between any two squares of positive integers. Let p* denote the next 
prime after p. Then this conjecture would imply that p* < p + 4p'”? for all p, and would follow 
if we could show that p* < p + 2p'/? + 2 for all p. In [10], D. R. Heath-Brown and H. Iwaniec 
showed that for any real number @ > 11/20, there is a constant c,, possibly depending on @, so 
that p* < p + c,p”. In particular, there is a prime between any sufficiently large cubes of integers 
(this fact was already known to Ingham [11] in 1937). 

The ancient Greeks defined a perfect number to be a positive integer n which is the sum of all 
positive integers d < n which evenly divide n. Thus, the first few perfect numbers are 6, 28, 496, 
8128, 33550336. The ancient Greeks proved that every even perfect number has the form 
2?-1(2? — 1), where p and 2? — 1 are primes. It is still unknown whether any odd perfect 
numbers exist. The numbers 2” — 1 are called Mersenne numbers, after a seventeenth century 
Frenck mathematician, Marin Mersenne, who listed values of p for which he thought 2? — 1 to 
be prime. There are 29 values of p now known for which 2” — 1 is prime, the largest of which is 
p = 132,049. The corresponding Mersenne prime—which is also the largest known prime—has 
slightly less than 40,000 digits, and the corresponding perfect number has slightly less than 80,000 
digits. The second largest known value of p which gives a Mersenne prime is 86,243. For a table 
of the others, see [18]. The values 132,049 and 86,243 were found by David Slowinski, with the aid 
of supercomputers made by Cray Research in Chippewa Falls, Wisconsin. 

Finding large primes used to be of merely theoretical interest, until 1977, when Robert L. 
Rivest, Adi Shamir, and Leonard M. Adleman discovered an application to national security. In 
their system, two large primes, p and gq, are multiplied together. It follows from Euler’s 
generalization of Fermat’s Little Theorem that for any positive integer a, we have 


(4) q*e—WN4q-)+1 = a(mod pq). 


Hence, if we choose any two positive integers b and c with be = (p — 1)(q — 1) + 1, then we can 
use the following code: 


To send the information “a”, for 0 < a < pq, find the remainder of a’ on division by 
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pq, and communicate that. To decode the information received, raise it to the cth power, 
and find the remainder on division by pg. That remainder is a. 


For example, if p = 13, g = 19, then 
(p-—1)(q-1) +1 =217 =7- 31, 


so that we can take b = 31, c = 7, pq = 247. To send “2”, we observe that 27! = 193 (mod 247), 
and communicate “193”. The receiver then notes that 193’ = 2 (mod 247), to decode the message. 
The elegant thing about this type of code is that even if the enemy learns how to encode messages, 
it is much easier to find two primes with—say—200 digits each, and multiply them together, than 
to factor the resulting 400 digit number. It will shortly be possible to determine if a 200 digit 
number is prime in 8 minutes [19]. To factor even a 100 digit number can currently take decades. 
For a fuller account, we refer the reader to [18]. 

Although the book on which this article is based was not written by a pure mathematician, it 
contains a great amount of historical information about the lore of prime numbers. The book 
contains few, if any, proofs. Except for a few sections, all that a student needs to read it is a good 
understanding of the first year of high school algebra, although it should also be enjoyable reading 
material for the undergraduate student who has had a semester of elementary number theory. 
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Problem-Solving through Problems. By Loren C. Larson. Problem Books in Mathematics. 
Springer-Verlag, New York, 1983. xi + 332 pp. $34.00. 


HARLEY FLANDERS 
Department of Mathematics, Florida Atlantic University, Boca Raton, FL 33431 


Springer-Verlag, the most important mathematics publishing house of the twentieth century, 
the house that brought us the incomparable Polya and Szeg6, has now, in its new series Problem 
Books in Mathematics, published the volume under review. The stated purpose of the book is “to 
isolate and draw attention to the most important problem-solving techniques. ..undergraduate 
mathematics. ..interesting examples and problems not easily found in other sources... material is 
especially appropriate for students preparing for mathematical competitions.” 

When I received the volume, I looked forward to finding some interesting new problems, some 
excellent solutions, some really useful text material for students. I shall try to share my 
disappointments. 

Chapter 1 has twelve heuristics: Search for a Pattern, Exploit Symmetry, Choose Effective 
Notation, etc., each illustrated by several, often inappropriate, worked examples and some 
problems. (I can’t understand the lack of an answer section—what good are the roughly 470 
exercises with solutions scattered in the literature or not even referenced?) 

Under Search for a Pattern, the author wisely gives four examples in which to search for a 
pattern means to examine low cases. Then follows his fifth example (1.1.5, page 7): Let S be a set 
and * be (a) binary operation on S satisfying the two laws x * x = x forall x in S,(x* y)*z = 
(y *z)*x forall x, y,z in S. Show that x * y = y * x for all x, y in S. In spite of what the author 
says, solving this problem really seems a different matter, and the author gives essentially no 
insight into how to solve such problems. (Using a lower case x next to * is a model for confusing 
notation.) 

I think that in a book like this, the author usually ought to tell his readers what are hard 
problems and what are not so hard problems. In the heuristic Consider Extreme Cases, the very 
first example is Problem 1.11.1 (page 50): Given a finite number of points in the plane, not all 
collinear, prove there is a straight line which passes through exactly two of them. That this yields to 
minimizing the distance between a point P of the set and a line through two other points of the 
set, but not through P, does not detract from the fact that it is a very hard problem that took 40 
years from Sylvester’s 1893 conjecture to Gallai’s solution. The author could tell us this story. 

Nowhere in the heuristics chapter, or anywhere else, is there a mention of George Polya. This is 
unforgiveable. 

The chapters after the first take up specific topics and apply the material to problem solving. 
For instance there is a chapter on inequalities, a chapter on summation of series, a chapter on 
geometry, etc. I find time and again that the examples are misplaced and have better solutions by 
other techniques, discussed in this very book. What I find particularly annoying are long, cloddish, 
computational solutions. Frequently the author gives the distinct impression that grinding out 
computation is the real heart and difficulty of a solution, whereas the essential idea just drops 
effortlessly out of “heuristics” and is passed off with “Notice that...”. Surely every single 
solution in a book whose purpose is to teach problem solving should emphasize the key ideas and 
never snow the reader with calculations. No solution should be published if it is clumsy or there 
remains any hope of replacing computation by concepts. (It is possible that no mathematician 
besides the author checked the manuscript before publication—no one’s help is acknowledged in 
the Preface. The large number of misprints suggests this and a careless proofreading job.) 

I happened to be working with some students on preparation for the Putnam Competition 
when Larson’s book appeared. I used a number of his solutions as models of how not to solve 
problems. The book is a rich vein for such models; perhaps this is the most appropriate use of the 
text. Some examples follow; they are not exceptional, but typical, cases. Consider Problem 3.5.5, 
page 117: 
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Let G, = x"sinnA + y"sinnB + z"sinnC, where x, y,z, A,B,C are realand A + B+ Cisan 
integral multiple of 7. Prove that if G, = G, = 0, then G,, = 0 for all positive integral n. (Author’s 
source: 1980 USA Olympiad.) 


The author’s solution starts with the reasonable observation that G,, is the imaginary part of 
xreind + ynein® 4 7"e!"©, but then gives a horrendous, brute-force, unmotivated computation 
that in no way shows where the result is coming from. Yet, if we set 


a = xe!4, B = ye'®, y = ze”, 

it is natural to consider 
f(T) =(T- a)\(T- B)\(T-y) =T*? — oT’ + oT - 6, 

and the sums of powers s, = a” + B” + y”. Now we are on well-trodden ground, and in case we 
have forgotten Newton’s relations 

Sn+3 — U5n42 + S5n41 — O35, = O, 
then multiplying f(T) by T” and summing on T = a, 8, y will jog our memories. The hypotheses 
easily imply with almost no computation that o, = 5, is real, o, is real, s, is real, so 
206, = s} — s» is real. So is s, = 3, and then by induction 50 is s,. 

Even worse is the author’s solution to Problem 5.3.3, page 172: 


Express 


M83 


1/[ m?n + mn* + 2mn] 
1 


oe) 
du 
n=1 


m 


as a rational number. (Author’s source: 1978 Putnam competition, B2. With all due respect to the 
Putnam, “Evaluate” without “rational number” would have been a better statement.) 


The author’s solution is a full page orgy of numerical computation, starting with a partial 
fraction decomposition of the denominator mn(m+n-+ 2). The final answer seems like a 
miracle. Aside from the obvious ugliness of an unbroken string of complicated equalities 
stretching over 17 lines, with at least one misleading misprint, all of which could have been 
organized into something far simpler and intelligible, how on earth would the author’s method 
have applied to denominator mn(m+n+ 7) or mnp(m+n-+ p + 3)? (Heuristic 12: Gener- 
alize!) 

Now if we, very naturally, set 


f(x) = x x x™tntk*) 7 mn(mtnt+k-+1)], 


m=l1 n=1 


then one differentiation clears away most of the foliage, and we have 


S, = > > ! ['x*[in( — x)]° dx 


mn(m+n+k-+1) 7 

= fa — x)“(In x)’ dx. 

0 

It is a routine binomial expansion and integration from here to 

k J 

—1 
jeo(jt iy VV 
Now let’s look at Problem 6.6.4, page 219: 


Let f be differentiable with f continuous on [a,b]. Show that (if) there is a number c in (a, b| 
such that f’(c) = 0, then we can find a number & in (a, b) such that f’(€) = [f(€) — f(a)|/(6 — a). 


678 HARLEY FLANDERS 


(Author’s source: S. Penner, Math. Mag., 49, p. 150.) 


The solution fills two full pages, is repetitious and I think misses the point. First of all, the 
denominator b — a can be anything. Second of all, € can be found in (a, c]. Thus (Heuristic 4: 
Modify the Problem) the problem should be: 


Let f(x) be continuous in [a,b], differentiable on (a, b\, with f’(b) = 0, and let K > 0. Prove 
there is an x in (a,b) such that f’ (x) = K[ f(x) — f(a)}. 


Clearly there is a simplification of notation (Heuristic 5) and no loss of generality in taking 
a= 0 and f(a) = 0. Set g(x) = e **f(x), a device the author mentioned a few pages earlier. If 
f’(x) > Kf(x) on (a, b), then g’(x) > 0 so g(b) > g(a) = 0, hence f(b) > 0. By the Darboux 
property, f’(b) > Kf(b) > 0, which is impossible. Similarly, f’(x) < Kf(x) on (a, b) is impossi- 
ble, so the conclusion follows. 

In the sections on geometry, the author outdoes himself with clumsy notation and computa- 
tional instead of conceptual solutions. Imagine using IPO} for the length of PO. and next to it IPO 
for the vector obtained by rotating PO about P through 90°. See p. 306 where such awkward 
notation almost forces misprints. Let’s look at a nice little problem from the 1977 USA Olympiad, 
Problem 8.3.8, page 308: 


Prove that if the opposite sides of a skew quadrilateral have the same lengths, then the line joining 
the midpoints of the two diagonals is perpendicular to these diagonals. 


The author’s start is good: Let A, B,C, D denote the vertices of the quadrilateral (in order), 
and let P and Q be the midpoints of AC and BD respectively. Now Pll replace 1 1/2 pages of 
computation by a conceptual solution (Heuristic 6: Exploit Symmetry): Let p denote the affine 
reflection in the plane BPD that transposes A and C. Let o denote the affine reflection in the 
plane AQC that transposes B and D. Set A = op. Then 


A: 4 >C, CoA, BoD, D->B, P-P, O-@. 


By hypothesis, AB = CD and BC = DA, so it follows that A preserves the lengths of the six sides 
of the tetrahedron ABCD. Therefore » is a rigid motion, clearly not a reflection. Since A 
preserves P and Q, it is a rotation about PQ. Since A transposes B and D, BD is perpendicular 
to PQ. Similarly so is AC. 


I conclude that the author has failed to meet his goals. Some of his problems, particularly the 
many taken from Putnam competitions and Olympiads are excellent, but not at all hard to find in 
the literature. As for problems from the periodical literature and other sources: some are good and 
some are bad; it is not an outstanding collection. As for the solutions: The long ones can usually 
be shortened; the short ones can be incomplete, even incorrect. Consider Problem 2.5.4, page 76: 


Prove that any positive rational number can be expressed as a finite sum of distinct terms of the 
harmonic series. (Author’s source: 1954 Putnam, B6.) 


This famous theorem on Egyptian fractions is a tricky proposition to prove, and the author 
could at least have consulted the Putnam Competition book, p. 398, if not one of the many 
number theory books that supply a proof. Not at all; he polishes it off in a few lines: “Let m/n 
be any positive rational. Then m/n =1/n+1/n +--+ +1/n is a sum of harmonic terms with 
n — 1 (sic) duplications. Recursively expand all duplicates by the identity 1/n = 1/(n + 1) + 
1/n(n + 1) until all terms are distinct.” Aside from the misprint, this is no proof at all, and I see 
no way to make it into a proof. The very last problem on a Putnam competition is seldom trivial 
at all, let alone that trivial. New heuristic: Check Your Work. 


(The author is grateful to Murray S. Klamkin and Stephen C. Locke for helpful suggestions.) 


LETTERS TO THE EDITOR 


Material for this department should be prepared exactly the same way as submitted manuscripts (see the inside 
front cover) and sent to Professor P. R. Halmos, Department of Mathematics, University of Santa Clara, Santa 
Clara, CA 95053. 


Editor: 


While I found the approach taken by Professor Meyer Dwass in his note on the convolution of 
Cauchy distributions [1] both novel and interesting, I am not convinced that it is simpler 
algebraically than the direct method to which he refers; namely the partial fraction decomposition 
of the integrand in that convolution. I, like Professor Dwass, could not find the details worked out 
anywhere; and so perhaps this would be the time and place to present them. 

The convolution integral to be evaluated is 


h(u) =f" fsa) flu — x,b)dx, 
a a ey 
mo [a> + x?2]| b? +(u— x) 


The partial fraction decomposition of the integrand is 
1 _2Ax +B, 2C(u—x)+D 
[a2 + x?]| B +(u—x)| a? + x? b? +(u— x) 
with 
A=C=u/A, B=(u?-a’*+b*)/A, and D=(wia—B)/, 
where the denominator in each coefficient is 
A= l(a +b)’ + u?|[(a —b) + u?|. 
Integration now yields 


a~ + x? 


h(u) ~ ,eonn| 8 


—a(u? + a* — b*) tan™! 


+ b(u* — a* + b*) tan“? = 


u— x 


b 


== [o(u? ~a? +b?) +a(u2 + a? — b?)| 
a+b 


= ———_————_ = f(u,a+t+b), as desired. 
m|(a +b)? + u? | 


Reference 
1. Meyer Dwass, On the convolution of Cauchy distributions, this MONTHLY, 92 (1985) 55-57. 


Roger B. Nelsen 
Department of Mathematics 
Lewis and Clark College 
Portland, OR 97219 
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156. MISCELLANEA 


Nothing is more repellent to normal human beings than the clinical succession of definitions, 
axioms, and theorems generated by the labours of pure mathematicians. The logical rigour 
achieved by such investigations is of the highest value, but can seldom come before we have 
grasped the idea in itself. Geometry existed before Euclid, and analysis before Cauchy; the 
significance of an irreducible representation of a group can be understood without a proof of 
Schur’s Lemma in all its generality. 


—J. M. Ziman, Elements of Advanced Quantum Theory (Cambridge, 1969), p. vii. 


157. MISCELLANEA 


I maintain that we must be careful of a problem’s first contact with our minds. We should be 
careful of the first words a question utters in our minds. A new question arising in us is in a state 
of infancy; it stammers; it finds only strange terms, loaded with adventitious values and 
associations; it is forced to borrow these. But it thereby insensibly deflects our true need. Without 
realizing it we desert our original problem, and in the end we shall come to believe that we have 
chosen an opinion wholly our own, forgetting that our choice was exercised only on a mass of 
opinions that are the more or less blind work of other men and of chance. 


—Paul Valéry, “Poetry and Abstract Thought,” 
Paul Valery, an Anthology, Princeton University 
Press, 1977, p. 138. 


158. MISCELLANEA 


Clerihew Corner 


George Boole 

was nobody’s fool: 

but never forget— 

his mathematical legacy is the empty set. 


Robin Harte 

Department of Mathematics 
University College 

Cork, Ireland 
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returning to the starting point when d = 3. The authors interpret this theorem as 
a statement about electric networks, and then prove the theorem using tech- 
niques from classical electrical theory. The techniques referred to go back to 
Lord Rayleigh who introduced them in connection with an investigation of 
musical instruments. 

In Part I the authors restrict themselves to the study of random walks on finite 
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LOCAL SYMMETRY OF PLANE CURVES 


P. J. GIBLIN AND THE LATE S. A. BRASSETT 
Department of Pure Mathematics, The University of Liverpool, P.O. Box 147, Liverpool L69 3BX, England 


1. How can we perceive, recognize, describe or represent the shape of an object? These 
questions are of great interest to workers in theoretical biology and in the perception of patterns 
—whether by people or by computers. Organisms have simple shapes which are not usually 
equilateral triangles, squares or other basic mathematical shapes, so new concepts are needed to 
capture their simplicity. Organisms move and grow: what is preserved as their shapes change in 
these ways? Shapes that we all immediately recognize as the same (such as all the instances of the 
letter m in this journal) are in fact subtly different: what do they have in common? 

In order to tackle questions such as these, a number of ways have been suggested for reducing 
the huge amount of information carried by a shape down to a “skeleton” of crucial, possibly 
discrete, information which can be more readily assimilated. For example, H. Blum [1] suggested 
that a planar shape could be studied by fitting circular disks inside it and noting the locus of their 
centres, which he called the “sym-ax” or symmetric axis transform (Fig. 1, left). A wiggling worm 
can be described, perhaps, by keeping the radii fixed and flexing the sym-ax; a growing worm by 
changing the radii but leaving the sym-ax unaltered. 
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The mathematical questions raised by such an imaginative idea are very interesting. Suppose 
we take a planar shape, bounded, let us say, by a simple closed curve. What forms will the locus of 
centres take? 

Whenever a circle is bitangent to a curve, that is tangent to the curve at two different places 
(Fig. 1, right), there is an “infinitesimal axis of symmetry” which bisects the angle between the 
tangents to the curve (or the circle) at the two places. The centre c of the circle lies on Blum’s 
sym-ax, at any rate if the circle lies inside the curve close to the points of contact. For 
mathematical purposes we prefer to drop the latter requirement, that is we allow all bitangent 
circles including circles such as those in Fig. 5. We also change the name to symmetry set. Thus: 


The symmetry set of a plane curve is the locus of centres c of circles bitangent to the curve. 


Of course in special cases the circle could be tangent to the curve at three (or more) different 
places. Taking three places in pairs, such a point would lie on three “branches” of the symmetry 
set (and for four places, six branches). 


Peter Giblin joined the faculty at Liverpool University in 1967 and has been there ever since, apart from a year 
as Visiting Professor at the University of North Carolina at Chapel Hill. His special interests are in the geometrical 
applications of singularity theory, many of which can be effectively illustrated, and even aided, by the use of 
computer graphics. 

Stephen Brassett was a final year undergraduate at Liverpool during the time that the work described in this 
article was done. He wrote a project, under Giblin’s supervision, in which he used computer graphics to study 
symmetry sets and midloci. Besides computer programming, Stephen had an intense interest in butterflies. From the 
time of his graduation in 1984 until his tragically early death in January 1985 he worked in industry in the field of 
computer aided design. 
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Because the symmetry set, or the sym-ax, suffers from various technical shortcomings (for 
example it reflects poorly the perceived symmetry of a rectangle), M. Brady in [2] suggested a 
simple variant: instead of the centre of the circle we take the mid-point m of the chord of contact. 
(The reader will find many more references to follow up in [1] and [2].) He called the resulting 
locus the “smoothed local symmetry”, but we shall adopt a more directly descriptive term. 


The midpoint locus (midlocus for short) of a plane curve is the locus of midpoints m of chords of 
contact of bitangent circles. 


Fic. 2. Symmetry set and midlocus for an ellipse. 


Let us consider a very simple example, when the curve is an ellipse (Fig. 2), say given by 
1x? + y? = 1. Evidently—by symmetry!—we need only consider circles centred on one or other 
coordinate axis; let us take the x-axis for which the contact is internal to the ellipse. The circle 
(x — a)? + y? =r?’ has slope —(x — a)/y at (x, y) and the ellipse has slope —x/4y at (x, y), 
so if they meet at a point (x, y), where the tangents coincide, then x = 4a/3. From the equation 
of the ellipse we find y* = 1 — 4a’/9, so that |a| < 3/2, and the circle touches at (44/3, 
+(1 — 4a?/9)'/*). Thus the portion of the x-axis between — 3/2 and 3/2 is in the symmetry set. 
Since the midpoint of the chord of contact is at (4a/3, 0), the midlocus contains the interval from 
—2 to 2 on the x-axis, that is precisely, and not surprisingly, the major axis of the ellipse. The full 
symmetry set and midlocus are the solid lines in Fig. 2. The crossover point on the symmetry set is 
the centre of two concentric circles each bitangent to the ellipse. Notice also that, as a > 3/2, the 
points of contact of circle and ellipse tend to coincidence at (2, 0): the circle centre (3/2, 0) has all 
four points of intersection with the ellipse coincident at (2,0). This four-point contact results in an 
abrupt end to the symmetry set and midlocus. Thus the four points e,; in Fig. 2 are strictly in the 
closure of the symmetry set and the corresponding points v,, the vertices of the ellipse, in the 
closure of the midlocus, as these are defined above. 

The dashed curve in Fig. 2 is the evolute of the ellipse, that is the locus of centres of circles for 
which three of their intersections with the ellipse have coincided at a single point. These are circles 
with three-point contact, or osculating circles, or circles of curvature. Note that the evolute has 
cusps at the points c; corresponding to vertices v,. 

Fig. 3 shows a more complex example, but with the same “cross” shape to the symmetry set 
and midlocus. Again the v, are vertices, where a certain circle has four-point contact with the 
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Fic. 3. Symmetry set and midlocus for a cubic oval (x — 0.9y)? = y — y?. 


curve; the centre of this circle is at c; on the evolute. 

Because we have to consider pairs of points of the curve, the constructions of the symmetry set 
and midlocus are altogether more complicated than that of, say, the evolute. They give rise not 
only to interesting mathematical problems but also to computational problems, and we say a little 
about the latter in §2. 

Suppose we are given a smooth simple closed plane curve and a point p on it. Is there always a 
circle or straight line tangent to the curve at p and at another point p’ # p? In fact the answer is 
yes, and we invite the reader to find a simple proof. This shows that to every point of such a curve 
there corresponds either a point of the symmetry set or a limit point “at infinity” (corresponding 
to a bitangent /ine). The point is always finite for the midlocus, for we can take the midpoint of 
the chord of contact in the case of a bitangent line (compare Fig. 18, right). 

In [5] an elaborate mathematical tool is used to study the symmetry set for curves in the plane 
and also surfaces in 3-space (where circles become spheres). One of the results is the following 
(see also [3] p. 137). Consider a point x in the plane, on the symmetry set of a generic plane curve 
(i.e., practically any curve). Then, after a diffeomorphism of a neighbourhood of x in the plane, 
the symmetry set looks like one of the five pictures of Fig. 4. Note that the triple crossing, 
corresponding to a circle tangent at three places to the curve, cannot be removed by a small 
perturbation of the curve. Note also that the last picture shows a smooth piece of the symmetry 
set coming to an end (as with the ellipse and cubic oval above). For another theoretical 
investigation of the related concept of central set see [11]. 


rd 


Fic. 4. The five local pictures for the symmetry set of a generic plane curve. 


Our purpose here is much less technical. We apply some techniques from calculus, notably the 
implicit function theorem, to describe the symmetry set and midlocus locally in a variety of 
situations, illustrating by means of computer drawn examples. The midlocus is not covered at all 
in [5] and also we consider here a number of “non-generic’”’ examples of the symmetry set. For 
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example if there is a “biosculating circle”, that is, one having three-point contact with the curve at 
two separate places, then both midlocus and symmetry set can have an isolated point (see (3.1), 
(6.3) below). This occurs naturally when the curve is undergoing a deformation and we give an 
example of this in §7, where one part of the symmetry set shrinks to a point and vanishes 
(Fig. 26). 

However, the midlocus has the pleasant property that it is very often smooth (free from cusps), 
though it can have endpoints and crossings as in Figs. 2 and 3. See (3.1). 

One topic not covered here but treated in detail in [5] is the recovery of the curve from its 
symmetry set, by means of an envelope of circles. Another interesting idea is to look directly at 
the infinitesimal axes of symmetry, perhaps as a locus in the dual plane whose points are the lines 
of R?. For example, one can show that this dual locus acquires a cusp precisely when the points of 
contact of bitangent circles are points of equal curvature on the original curve—a pleasantly 
symmetrical condition. See (4.2) (1) below. 

The plane of the paper is as follows. In §2 we review some results on the contact of circles with 
curves and set up various functions and maps with which we shall examine the symmetry set and 
midlocus. In §3 we deduce that the midlocus, and in §4 that the symmetry set, are smooth curves 
in suitable circumstances. In §5 we introduce the fold map from catastrophe theory and apply this 
to the cases where the midlocus has cusps. In §6 we look similarly at the singularities of the 
symmetry set. Finally in §7 we look briefly at families of curves and their evolving symmetry sets 
and midloci. 

The computer pictures were all done using an IBM4341 and CalComp incremental plotter at 
Liverpool University. 


2. Contact of circles with curves. A general reference for this section is [3]. Let y: J > R* bea 
parametrization of a smooth plane curve, that is, y(t) = (x(t), y(t)) where x and y are smooth 
functions and x’(t), y’(t) are never zero for the same ¢t. Here J is an interval of real numbers, or 
possibly a circle for closed curves, but below we stick to intervals. The circle, with centre uy in the 
plane R’, and passing through the point y(7,), has equation 


IIx — uoll? — lly(to) — woll* = 9, 
double bars denoting length and x here denoting the vector previously called (x, y). This circle 
meets the curve y at points with parameter ¢ where 


F(t, uy) = |ly(t) — uoll* — IIly(to) — “oll? = 9. 


Of course F(t), uo) = 0 by construction. If also (d'F/dt')(ty, uo) = 0 for i=1,...,k, but #0 
for i = k + 1, then we say that the circle has (k + 1)-point contact with the curve at y(t, ), or has 
A, contact with the curve there. (The function t > F(t,u 9) has at t) an “A, singularity” in 
standard terminology; compare [3], p. 43.) Dropping the “ # 0” condition above we say there is 
A. , contact. The following will be assumed in this paper—see for example [3], Chapter 2. 


(2.1) Write T, N,« for the unit tangent, unit normal and curvature of y. Then, between the curve 
y and the circle with centre ug passing through y(t,) there is: 

A, contact if and only if ug lies on the normal to y at ty (i.€., Uy = ¥(to) + AN(to), for some 
real number 2). 


A. , contact if and only if uy is the centre of curvature of y at ty (i.€., Uy = (to) + N(to)/K (to ))- 
A. contact if and only if there is A... contact and y has a vertex at to (i.e., &’(ty) = 0). 


A. 4 contact if and only if there is A.., contact and y has a higher vertex at to (i.e., k’’(tg) = 0 
too). 


Now the evolute of y is the locus of centres of curvature, that is the locus of points uy which 
are A., points for y, meaning points for which the above circle has A, , contact with y for some 
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ty. These circles with A. , contact somewhere are the osculating circles or circles of curvature of y. 

Both the symmetry set and the midlocus are concerned with circles not merely touching y (i.e., 
being tangent to y) at one point but at two points or more: bitangent, or tritangent or... . But we 
can adapt the A, notation easily. A circle, with centre u having 2-point contact (A, contact) with 
y at each of two separate places is said to have A? contact, or A, A, contact, and u can be called 
an A? point for y. Likewise an A,A, point u is the centre of a circle having A, contact 
somewhere and A, contact somewhere else: the circle is the osculating circle at the latter point, 
which is not a vertex of y. The centre u of a “bi-osculating circle” is an A} point, or in fact an 
A.A, point, being exactly 45 when neither point of contact is a vertex of y. 

Suppose that a circle Cy), with centre u, and radius 7) > 0, is bitangent to the simple curve y 
(simple: without self-intersections). We write y, and y, for small pieces of y close to the points of 
contact, and parametrize each y, by arc-length ¢; in such a way that, for each i, ¢, =0 
corresponds to the point of contact of C). Orient y, as in Fig. 5, both anticlockwise around Cp. (It 
is possible that one or both of the y, is oriented oppositely to y; thus y,(¢,) = y(+t+4+ k,) for 
constants k, when y is parametrized by arc-length.) Let 7; = y/ be the unit tangent to y,, and N, 
the unit normal, 90° anticlockwise from T;. 


Y2 


2 (0) 


¥, (0) 


Fic. $ 


Consider the function g: R X R, (0,0) — R, 0 defined as follows. (The notation indicates that 
g is defined on some neighbourhood of (0,0) in the source R X R, and that g(0,0) = 0.) 
(t,t) = (1 (4) — ¥2(t2)) ‘(T,(4) ~ T,(t,)), 
the dot being scalar product. Because of orientations, 7, — T, cannot be zero. 
Thus g = Oif andonlyif y, — y, is perpendicular to T, — 7,, 

if and onlyif yy, — y, is parallel to N, — N,, 
ifandonlyif y, — y,+7r(M, — N,) =0, for some real number r + 0, 
ifandonlyif y, +rN, =y,+7rN,, forsome r # 0. 

When this happens, the common value u of y, + rN, is the centre of a circle of radius |r| touching 


y, at y,(¢,) and y, at y,(¢,). Thus ¢t, = t, = 0 gives u = uy and r = 74. Conversely, circles close 
to C, touching y, at y,(t,) and y, at y,(t,) will give 


M1 + rN, = ¥2 + ry, 
rather than, say 
¥, + rN, = ¥ — Ny. 
Thus g = 0 picks out just those pairs (t,, t,) close to (0,0) giving bitangent circles close to C. 


In fact the computational technique used to plot the symmetry sets and midloci in this paper is 
precisely to look for the zeros of a function such as g, globally defined for a particular curve y. 


694 P. J. GIBLIN AND THE LATE S. A. BRASSETT [December 


Thus we fix say ¢, and note changes of sign in g(t,,¢,) as ft, varies. 

The equation g = 0 itself defines a curve through the origin in the (4, t,) plane. If dg/dt, # 0 
at (0,0), then this curve is smooth, and ¢, can be used as a local coordinate on the curve; that is, 
t, is a smooth function of t,, by the Implicit Function Theorem. The same holds with 1 and 2 
interchanged. Using the standard formulas for plane curves (see e.g. [3], Chap. 2 [9]), we find that, 
when g = 0, giving 

M1 — % = —7(N, — Np) at(t,,t), 

dg/dt; = (1 — T, - T,)(1 — re,). 
Now 1 — T, : J, = 0 if and only if 7, = T, (unit vectors!), so the condition for dg/dt, = 0 is 
rk, = 1, 1.e., that the circle centre u is the osculating circle at y,(t,), the radius r being equal to 
the radius of curvature 1/k, at f,. 

Assuming thatdg/dt, is nonzero at (0,0), i.e., that Cy is not the osculating circle at y,(0), we 
can write t, = f,(t,) on the curve g ‘(0), the equation of this curve being g = 0. If t5 = dt,/dt, 
# 0 at t, = 0, then the curve g '(0) is not tangent to the ¢,-axis. Differentiating g(t,,t,(t,)) = 0 
with respect to ¢, we find that 


th = —( dg/dt,)/( dg/dt,). 
Thus the condition t5 # 0 becomes 1 # 7«,(0), which says that C, is not the osculating circle at 
y,(0) either. Examining 1£4/(0), which comes to 79«{(0)/(1 — 7k(0)) when £5 (0) = 0, we find the 
following. 


ty ty ) ty 
y hy = hy ty yy, \ hy 
(i) 


(ii) (iti) (iv) 


Fic. 6. The curve with equation g = 0. 


(2.2) Assume that the circle Cy is not osculating at y,(0), i.e., assume 1 # r%,(0). Then the local 
picture of g~'(0) is a smooth curve in the (t,,t,) plane, as follows (Fig. 6): 


(i) If C, is not osculating at y,(0) and 1/1 is between «,(0) and ,(0). 


A? contact 
1 Oe’ (3 If Cy is not osculating at y,(0) and 1/1) is not between k,(0) and «5(0). 


(iii) If Cy is osculating at y,(0) and «{(0) (1 — m«,(0)) > 0. 


A,A, contact (| 
(iv) If C, is osculating at y,(0) and «{(0) (1 — m«,(0)) < 0. 


This tells us how the points of contact move along y, and y,, taking (say) t, to move steadily 
through 0 from negative to positive values. For instance, in case (iii) the point of contact y,(t,) 
moves from beyond y, (0), back to y,(0) and then it turns round (Fig. 7). Perhaps this leads us to 


2 (0) 


Y2 


Fic. 7. The point of contact turns round. 


1985] LOCAL SYMMETRY OF PLANE CURVES 695 


expect some strange behaviour on the symmetry set or the midlocus. In fact, we shall see that 
there is (generally) a cusp on the symmetry set but that the midlocus scarcely notices the 
turnaround; see (6.2)(ii) and (3.1)(i) below. 

What if the circle C, is bi-osculating, ie., 1 = m%«,(0) = m«,(0)? Then the local picture of 
g _'(0) depends on the second partial derivatives of g at (0,0). In fact, calculation shows that the 
Taylor expansion of g up to degree 2 terms is, when 1/7 = k,(0) = &,(0) = k, Say, 

Bo( t,t) = («i /e)( Ny «Ny — 1)t? +(«5/u)(N, > Ny — 1723, 
where the coefficients of t? are evaluated at t, = t, = 0. Now N, - N, # 1, since N,(0) # N,(0), 
SO g, iS a nondegenerate quadratic form provided «{(0), «5(0) are both nonzero, i.e., provided 
neither y,(0) nor y, (0) is a vertex. Note that this says that the contact of Cy with the curve y is 
exactly AZ. It is a standard fact of differential topology that a function, such as g, with Taylor 
expansion having no linear terms and nondegenerate quadratic terms (a “Morse function’) can be 
reduced, by a change of variables in R*, to its quadratic part g,. (See for example [7], p. 123.) 
That is to say there is a local diffeomorphism 
| @:R?,0 > R?,0 

with 


8($(t,t2)) = g(t, tr); 
the “higher terms” of g have been eliminated by a mere change of variables. Since (g; '(0)) = 
g 1(0), the set g~'(0) is thrown onto g;1(0) by the diffeomorphism $~'. This gives the following. 


ty 
ty 


(i) (il) 
Fic. 8. Curve g = 0 in the bi-osculating case. 


(2.3) In the bi-osculating (A5) case the local picture of g~'(0) is as follows (Fig. 8): 
(1) an isolated point if x{(0)K5(0) > 0, 
(ii) two smooth curves crossing at (0,0) if «{(0)«4(0) < 0. 


Thus in case (i) there are no nearby pairs (¢,,¢,) giving a bitangent circle, while in case (ii) 
there are two values of t, for each t, near 0 (and vice versa). 


3. The structure of the midpoint locus. How can we use the information of §2 to describe the 
local structure of the symmetry set or the midlocus? For the midlocus it is easy. Let 


m:R XR, (0,0) > R? 
be defined by 
1 
m(t,,t,) = 5(1(4) + ¥(t2)). 


(The notation indicates that m is defined on some neighbourhood of (0,0) in R <X R but does not 
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specify m(0,0).) Then m(g~*(0)) is precisely the midlocus. If m is actually a local diffeomor- 
phism, then g™‘(0) and the midlocus have exactly the same local structure up to a local 
diffeomorphism, that is up to a smooth change of coordinates in the plane. By the Inverse 
Function Theorem m will be a local diffeomorphism if and only if the 2 x 2 Jacobian matrix of 
m at (0,0) is nonsingular, 1e., if and only if 7,(0), 7,(0) are independent vectors. This can only 
fail if T,(0) = T,(0), excluded above, or 7,(0) = — T,(0), Fig. 9. 


T, (0) 


Y2 


me 
T, (0) 
Fic. 9. m is not a local diffeomorphism here. 


Consider the A? and A, A, cases, so that g~ (0) is smooth by (2.2). Then for the midlocus to be 
smooth it is not actually necessary that m be a local diffeomorphism. We merely need m restricted 
to g—'(0) to give a smooth curve in R*, and this requires only that m|g~1(0) be an immersion at 
(0, 0), that is that its derivative at (0,0) be nonzero. There are various ways of rephrasing this; for 
example parametrizing g '(0) by t, we can require that 


8(4) = 5 (nt) + n(4(4))) 


have 6’(0) # 0. (Alternatively, Kernel Dg(0,0) intersects Kernel Dm(0,0) precisely in (0,0), 
where D stands for derivative.) This shows that m|g~'(0) is an immersion in the A? and 4,A, 
cases provided: 


(*) T,(0) # —T,(0) or 24 7(x,(0) + k»(0)). 


When this is satisfied, the tangent direction to the curve g  ‘(0) in the (t,, ¢,) plane is taken by m 
(or more precisely by its derivative Dm(0,0)) to the tangent direction to the midlocus. We find: 


(3.1) (i) The midlocus is a smooth curve, for (t,,t,) close to (0,0), in the A? and A,A, cases 
(that is, where C, is not bi-osculating), provided (*) above holds. The tangent to the midlocus has 
direction T,(1 — rk,) — T,(1 — rk,). 

(ii) In the A case (where C, is bi-osculating, but neither y,(0) nor y,(0) is a vertex) and 
assuming T,(0) # — T,(0), the midlocus is locally an isolated point when «{(0)«5(0) > 0, and a pair 
of smooth crossing arcs when «1(0)«5(0) < 0. In the latter case with x{(0) = a? > 0, «5(0) = —B? 
< 0, the tangent directions to the midlocus are BT,(0) + aT, (0). 


Before going on there is one more case for the midlocus worth mentioning. Suppose there is a 
straight line tangent to y, at y,(0) and y, at y,(0)—see Fig. 10, where the curves could be on 
opposite sides of the line but are to be oriented left to right. Clearly there will be bitangent circles 
touching close to y,(0) and y,(0); the centres go off to infinity as the circles tend to the bitangent 


Y1 Y2 


y, (0) A Y2 (0) 


Fic. 10. Bitangent line. 
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line, but the midpoints of the chords of contact have for their limit the midpoint A of the segment 
from y,(0) to y,(0). So we can ask for the local structure of the midlocus near A. Instead of g we 
use the following function: 


h:R X R, (0,0) > R,0, 


where 


h(t,,t2) = (mt) — w2(4%2)) (Mi (4) + (4)). 
Then, provided N, + N, # 0 (which is true at t+, = ¢, = 0, and therefore nearby), h(t,,¢,) = 0if 
and only if there is a circle or line touching y, at y,(¢,) and y, at y,(t,). Using this and similar 
arguments to the above we find the following. 


(3.2) (i) The curve h~‘(0) in the (t,,t,) plane is smooth at (0,0) provided «,(0) #0 or 
k,(0) # 0, i.e., provided the bitangent line is not bi-inflexional. The local picture of h~*(0) is shown 
in Fig. 11, where we assume k,(0) # 0. 

(ii) The midlocus is smooth at A provided y,(0) # y,(0), and its tangent direction is parallel to the 
bitangent line. (See the right-hand half of Fig. 20.) 


ty 
hy 
KK, <0 K\K, > 0 


Fic. 11. The curve h = 0. 


It is of course possible to find the local form of the midlocus as a curve parametrized by f,. It 
comes to 
(((k. — «,)/2k,)t, + higher terms, («,(«, + «,)/2«,)t? + higher terms), 


where «,,«, are at 0. Thus when x, # 0 and «, + «, # O (as well as k, # «,) at t; = ¢, = 0 the 
local picture of the midlocus is as in Fig. 12, where we take x,(0) > 0. (When «,(0) = O the 
midlocus has an inflexion; can the reader discover which way it flexes?) 


ML gi ML "2 ay 
N 1 ML 
Kk, > 0 Kk, < —K, —Ky< Kk, <0 


Fic. 12. Midlocus for a bitangent line. 


(3.3) REMARK. There are various circumstances, detailed by 3.1 and 3.2, in which the midlocus 
is not smooth, namely: (i) the bi-osculating case A,,A, >, (ii) the Aj and A,A, cases when 
T, = —T, and 2 =r(k, + k,), (iii) the bitangent line case when x, = «,. All these are “non- 
generic” or exceptional in a precise sense (compare [4]). If a closed curve y has any two points 
where one of the above three things is true, then almost any sufficiently small perturbation of y 
will have no such points. So the midlocus is, for a “general” curve, always smooth (though it can 
have end-points and self-crossings). However, when we consider general families of curves, a 
non-smooth midlocus can very well occur for certain special curves of the family. We say 
something about non-smooth midloci in §5. 


4. The smooth part of the symmetry set. We now turn to the symmetry set. This is not quite so 
easy, since it depends not only on the points y,(¢,), y,(t,) where a circle is tangent to two pieces 
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Y1> Y2 Of y (as in Fig. 5), but also on the radius r. We start with a circle C) as in §2 above. Let 
f:R XR XR, (0,0,%) — R’,0 
be defined by 
f(t.t57) = 14) — n(t2) + r(Ni(t) — M(t). 


Then f(t,, t,,7) = 0 precisely when a circle of radius r is tangent to y, at y,(t,) and y, at y>(f,). 
We expect f (0) to be a curve in R? (coordinates (t,,t,,7r)), and the “centre” map 


c:R°,(0,0,7%) > R? 
be defined by 


C(4,%,7) =1(t) + 7N,(4) 
takes f~'(0) to the symmetry set. Thus the symmetry set can be regarded as the “projection” 
c( f—'(0)) by the map c of the space curve given by f = 0 to the plane. 

The Implicit Function Theorem can be applied to f to find out when f~‘(0) is a smooth curve 
near (0,0, 7%): we require the 2 by 3 Jacobian matrix of f to have rank 2 at (0,0, 7 )—that is, we 
require f to be a submersion at (0,0,%). (The Jacobian matrix of f has column vectors 
T,(1 — «,r), —T,(1 — «,r), N, — N,.) Then we can write down the condition for this curve to 
project under c to a smooth curve in R? (i.e., for c|f~*(0) to be an immersion): the tangent vector 
to f-1(0) at (0,0,%) must not go to 0. (An equivalent condition is that Kernel Df(0, 0, 7%) 
intersects Kernel Dc(0,0, 7) in precisely (0, 0, 0).) 


(4.1) (i) The space curve f—'(0) is a smooth curve parametrized by t, provided 1 # «,(0)%, i.e., 
provided C, is not osculating at y,(0). The same holds with 1 and 2 interchanged. Thus f—*(0) is 
smooth in the A? and A,A, cases. 

(ii) The symmetry set, namely c( f—‘(0)), is locally a smooth curve if Cy is osculating at neither 
point y,(0), y> (0) (the Aj case). In that case either t, or t, can be used as a local parameter on the 
symmetry set. Its tangent vector is along the direction T, — T,, the infinitesimal axis of symmetry 
(Fig. 13). 


Y2 
Y2 (0) 


infinitesimal axis 
of symmetry 


Nh 


y, (0) 


Fic. 13. Symmetry set touches the infinitesimal axis. 


(4.2) REMARKS. (1) In the A? case it is not difficult to determine whether the symmetry set is 
concave upwards, as in Fig. 13, or downwards. For example, we can parametrize by ¢,, writing say 
5(t,) for the symmetry set, and then the condition for upwards concavity is 


8”"(0) « (N,(0) — Np (0)) > 0. 
Calculation shows that this is equivalent to the following (all at (0,0, %)): 


k, > «, When 1/r is between k, and «,; «,<«, when 1/r isnot between k, and x5. 
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(Compare (2.2) (i) and (ii).) When «,(0) = «,(0) the symmetry set has an inflexion at uy, the 
centre of C,. (Since by (4.1)(ii) the tangent to the symmetry set is the infinitesimal axis of 
symmetry, we can also consider the curve in the dual plane given by these axes—it is the dual of 
the symmetry set. It has a cusp when the symmetry set has an inflexion, that is when the 
curvatures agree at the points of contact, as above.) Note that, by (3.1)(i), the midlocus has its 
tangent along the infinitesimal axis of symmetry when «,(0) = «,(0). 

(2) At the cost of some complication we can eliminate r and describe the symmetry set by 1¢, 
and ¢t, alone. For the equation f = 0 gives 


r=(11— %)°(M -— M)/AM Ny - 1), 
and this can be substituted in the formula for the map c. 


5. More on the midpoint locus. We now turn to some of the cases (besides (3.1)(ii)) where the 
symmetry set or the midpoint locus is not smooth—even though, in the latter case, this happens 
only for nongeneric curves (but in generic families). First, here is an entertaining example. 


(5.1) EXAMPLE. Consider the quartic curve y = x* — ax? + bx (Fig. 14). Up to rigid motion in 
the plane every graph of a monic quartic polynomial has this form (can the reader prove this?). 
Provided a> 0 there is a bitangent line, namely y = bx — 4a’, touching the curve where 
x = +(a/2)'”*. Is the midlocus smooth at the midpoint A of the bitangent? We have two 
parameters a and 6 available—surely we can adjust them so that a smooth midlocus results? 
According to (3.2)(1i) we only have to check whether the curvatures of the quartic curve are 
different at the points where x = +(a/2)'/”’. Alas! the curvatures always coincide, both equalling 
4a(1 + b*)~3/*. So the midlocus is never smooth. (Actually, it is worse than that. See (5.4)(2) 
below, and the left-hand half of Fig. 18.) 


Fic. 14. Bitangent line of a quartic curve. 


How can we get at the structure of a singular curve? In [5] we showed how to use the theory of 
unfoldings and discriminants to do just that for the symmetry set (see also [3] p. 137). However, 
the midlocus does not (so far as we know) arise in that way so that theory does not help with the 
midlocus. It has already been noted above (§3) that the map m is a local diffeomorphism precisely 
when 7, # +7,. What if this condition fails at (0,0)? There is a whole hierarchy of maps 
R’*,0 — R* which are more degenerate than a local diffeomorphism; here we shall consider only 
one of these, the fold map; the cusp map or Whitney pleat is mentioned very briefly in (5.4) (2) 
below. The fold and cusp are of great significance in the study of maps from the plane to the 
plane: in fact singularity theory can be said to have begun with H. Whitney’s proof in 1955 that, 
besides diffeomorphisms, these are the only stable maps—essentially unaltered by small perturba- 
tions. See for example [6] and [10]. 


Let ¢:R’*,0 > R’,0 be defined by $(t,,t,) = (t,,t3). This is a standard model (“normal 
form’’) for the fold map. To see what is going on it is helpful to factor the map through R°. Thus 
¢@ can be factored 
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$1 cal 


R? > R? > R? 


(t,t) > (1), 13, ty) 
(u,,U,,U3) > (m4, 4). 
Here $, maps the plane into R® as a surface and 7, projects the surface “vertically” to the plane 
again (Fig. 15). The geometry of the fold map (and the cusp or Whitney pleat) is described in [6]. 
Given a map @:R7’,(0,0) — R? it is possible to decide whether, by smooth changes of coordi- 
nates in the two copies of R*, the map @ can be converted into the normal form @ above. When 
this can be done we call @ a fold map. Because such a @ is “equivalent” to ¢ by merely changing 
coordinates, many properties of 8 can be read off from the corresponding properties of ¢. 


>, (2) 


7 VA *2 


hy 


$(2) 


xy 


Fic. 15. Fold map. 


There are two features of fold maps @ which are of special significance. The first is the kernel 
line, the line through (0,0) in the direction of the tangent vector at (0,0) sent to zero by @. For 
6 = this is clearly the ¢,-axis, in the direction (0,1). For the map m of §3 we need 
T,(0) = +7, (0) for there to be such a line at all, and then it is the line of points (4, ¢, ) satisfying 
t, + t, = 0, respectively. A curve through the origin whose tangent there happens to lie along the 
kernel line of a map @ will be sent by @ to a singular curve (Fig. 16,a) since its tangent vector at 
the origin will be collapsed to zero. Other curves will have (locally) smooth images under @ (Fig. 
16,b). In terms of the pictures of Fig. 15 for @, the kernel line is in the direction which goes to a 
vertical direction by ¢,. Curves with tangent vectors in the kernel direction go to curves which are 
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vertical at (0,0, 0), and, when the latter curves are projected by 7,, they become singular. 

The other feature of interest to us is the fold curve = in the source R’ of 6. This consists of 
points (t,,¢,) at which the Jacobian matrix of @ is singular. For 0 = $, > is the t,-axis; for 0 = m 
it is the set of (¢,,¢,) for which T,(t,) = +7,(t,). In the first case it is clear that = is a smooth 
curve; for 6 = m this needs proof. (To prove it consider the function k(t,,t,) = T,(t,) - Ny (t2). 


6(b) 


O(a) 


Fic. 16. Curves touching or not touching the kernel line. 


It is easy to check that the derivative of k is nonzero at (0,0) provided «,(0) and «,(0) are not 
both zero; clearly > = k~'(0) so this shows > is (locally) smooth provided we exclude the case 
k,(0) = «,(0) = 0. Note incidentally that k(t,,t,) = T,; - T, — 1 looks promising (or we might 
need T, - T, + 1) but fails disastrously to prove = = k~'(0) is smooth!) The image 6(2) is of 
geometrical significance since it separates points (x,,x,) in R* which have different numbers of 
inverse images under 0. For 6 = ¢, and hence for any fold map, the image @(2) is a smooth curve 
which separates those (x,, x,) with 0 inverse images from those with 2 (these have x, < 0, x, > 0, 
respectively). 

For 8 = m we need to know when m is a fold map. We shall not go into the details of how this 
is checked (see [8], p. 146) but merely state the result. 


(5.2) The map m is, at (0,0), 
(i) a local diffeomorphism if and only if T, # +T,; 
(ii) a fold map if and only if T, = +T, and x, # +k,, respectively. 


Here T.,«, are all at t, = t, = 0, of course. In (ii) the kernel line of m is the line t, = Ft, in R?. 


Let us look more closely at curves tangent to the kernel line at (0,0) of the standard fold 
map @. 

Consider in fact a curve t, = a,t} + a3t3 + --- in R? tangent to the ¢,-axis, which is the 
kernel line of ¢. The image under ¢ of this curve is the curve consisting of points 


(u,,U>) = (a, t3 + a,t3 + os ,t3). 


A smooth change of coordinates (u,,u,) — (u, — a,u,,u,) turns this into the set of points 
(a,t3 + ---,t3). Such a set of points in R* is said to have an ordinary cusp (Fig. 17) at (0,0) 
provided a, # 0. An ordinary cusp is the simplest kind of singularity a curve can have and, as the 
argument above shows, the standard fold map will take almost any smooth curve through (0, 0) to 
a curve with an ordinary cusp at the corresponding point. The latter result holds for any fold map, 
for such a map can be reduced to ¢ by changes of coordinates. (Formally, the condition for 
5:1R,0 — R?’ to have an ordinary cusp at 6(0) is that 6(0) = 0 and 6’’(0), 6’’’(0) are linearly 
independent. This can be used to show that, if @ is equivalent to @ by smooth changes of 
coordinates, then an ordinary cusp occurs on @ o 6 if and only if it occurs on ¢ ° 6.) 

Applying this to the curve g 1(0) or h~'(0) and the map m of §3, we find the following 
(compare (3.1), (3.2)). 
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us 


uy 


Fic. 17. Ordinary cusp. 


(5.3) (i) In the A? or A, A, cases (k,r #1 or kr #1), with T, = —T, and 2=1n(k, + k5) at 
t, = t, = 0, the midlocus generally has an ordinary cusp and the map m is always a fold map. 

(11) Suppose there is a bitangent line which is not bi-inflexional (kx, # 0 or x, # 0), but that 
k,(0) = (0). Then the midlocus generally has an ordinary cusp and the map m is always a fold map. 


(5.4) REMARKS. (1) It is, of course, possible to find the precise conditions for an ordinary cusp, 
but they do not seem to be very illuminating. To convince the reader of this, here is the condition 
in (11): 


(i + K,) (4K + 2A( Ki — x)) # Ai ( 17’ — Ky), 


Here, as usual, primes denote derivatives with respect to t, or t, aS appropriate, derivatives being 
evaluated at t, = t, = 0, and A is the length of the bitangent while « = «,(0) = «,(0). It is a 
curious fact that when we consider the case of the recalcitrant quartic curve (5.1), when the 
curvatures at the points of contact of the bitangent line are always equal, then, in fact, the 
non-equality condition above always fails: the midlocus never has an ordinary cusp! So from this 
point of view the family of quartic curves which are graphs of functions is extremely non-generic. 


) \ double tangent 


Fic. 18. Midlocus for two quartic curves. 
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See the left-hand half of Fig. 18. By contrast, the right-hand half shows a better behaved curve 
where the midlocus is smooth at the midpoint of the chord of contact of the double tangent. In 
fact this curve is also a quartic curve (it is an orthotomic of an ellipse) but not, of course, the 
graph of a function since it is a closed curve. 


(2) For readers familiar with the cusp map (Whitney pleat) we mention that m is a cusp map at 
(0,0) if and only if 7, = +75, «, = +«,+#0 and xj # «45 at 0. These conditions are not 
compatible with the hypotheses of (5.3). 


Suppose that 7,(0) = — 7,(0), but «,(0) # «,(0), so that by (5.2)(ii) the map m is a fold map. 
The fold curve = is those (¢,,¢,) for which 7,(t,) = —7,(t,), and m(%) is the set of midpoints 
of the chords joining y,(t,) to y,(¢t,) for such t,,t,. Note that there are not necessarily any 
bitangent circles in sight at present; for that we need also y,(t,) — y,(t,) to be perpendicular to 
T,(t,). This locus of midpoints of chords joining points of parallel tangency is then a smooth 
curve and it separates points (x,,x,) of the plane which are midpoints of two chords (joining 
points close to t, = t, = 0) from those which are midpoints of none. See Fig. 19. 


Fic. 19. Midpoints of chords of contact of parallel tangents. 


6. More on the symmetry set. When we try to apply the ideas of fold maps to symmetry sets, 
we are faced with a difficulty. For the symmetry set is the image c(f 1(0)), as in §4, where c goes 
from R? to R?. Where is there a map from R* to R7? Let 


&:R XR XR,(0,0,,) ~-R 
be defined by 
8(t.0.r) =8(t4.b)=(n-n): (1 - qT). 
Then we have: if f = 0, then g = 0; hence 
M = f—'(0) is contained in S = & *(0). 

In general S$ is a surface; when it is smooth and M is a smooth curve in S we can parametrize S$ 
locally by two variables and so treat S very much like a piece of R*. Then we are back to a curve 
in the plane. 

In fact suppose rm «,(0) # 1, while f(0,0, 7) = 0 (thus the circle Cy of radius 7% is tangent to 
y, and y, at t, = t, = 0 and is not osculating at t, = 0). Applying the arguments leading to (2.2) 


to & instead of to g, we can parametrize S locally by ¢, and r. Then using ¢, and r as local 
coordinates on S, and writing ¢ for the restriction of c to S, we have: 


E(t4),7) =11(4) + 7N,(4). 
The curve M can be thought of as a curve in the (t,,r) plane and ¢(M) is the symmetry set 
(Fig. 20). 
Notice that é takes lines parallel to the r-axis to normals to y,, and the t,-axis to the curve y, 


itself. The fold set = of @ is found by the condition that the 2 by 2 Jacobian matrix of ¢ is 
singular, i.e., that the columns 
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C(t,-axis) = y, 


local coordinates 


on S 


C(r-axis) 


Fic. 20. Symmetry set via the map ¢. 


T,(1 —_ rk, ) and N, 
are dependent. This requires 1 = rk,, so 
x= {(t,,r):rk,(t,) = 1}, 
and ¢(2) is precisely the evolute of y,. We find: 


(6.1) The map ¢ is, att, = 0, r=n, 

(i) a local diffeomorphism if and only if «,r # 1 if and only if Cy is not osculating at t, (A? case), 

(ii) a fold map if and only if x, = 1, x, #0 if and only if C, is osculating at t,, not a vertex 
(A, 4). 


We now want to consider the symmetry set, which is ¢(M). When does the tangent to M lie 
along the kernel direction of ¢, assuming we are in case (ii) or (iii) above? The kernel direction is 
(1,0) since at (0,7%) the Jacobian matrix of ¢ has columns 0, N,(0). The tangent direction at 
(0,0, 7%) to M in (t,, t,,7)-space is (1, 0,0) and of course this corresponds exactly to the direction 
(1,0) in the (7,, 7) plane. Thus M is always tangent to the kernel direction of ¢. This shows: 


(6.2) The symmetry set 
(i) is smooth at an A? point (compare (4.1) (ii); 
(ii) generally has an ordinary cusp at an A,A, point. 
See Fig. 21, where we assume «,(0) and «{(0) are > 0, and the evolute of y, is drawn dashed. In 


fact, more delicate methods show that the symmetry set a/ways has an ordinary cusp at an 4,A, 
point. 


Y2 


A,-contact 


Y2 


or 


A,-contact 
Fic. 21. Cusp on the symmetry set. 


In the bi-osculating (42) case neither M nor S above is smooth, But in fact we can 
“recognize” f to be a map rather like a fold map. 
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(6.3) Assume that, at (0,0,7%), we have 1 = rk, =rk,, «i #0, «5 #0 (AS case). Then by 
smooth changes of coordinates in R? and R? the map f can be written as 
(x1, Xz, X3) > (x1, x3 + x3), 
where the sign is + or — according as x{(0)k5(0) is > 0 or <0. 
Such an f is a “generalized fold map”, and from (6.3) we deduce that M =f 1(0) is, near 
(0,0, 7), 


(i) an isolated point when «/(0)«3(0) > 0; 
(ii) two smooth curves crossing when x{(0)«4(0) < 0. 


FIG. 22 


In fact, S = g ‘(0) is in these cases, respectively, a line (the r-axis) or a pair of smooth surfaces 
intersecting in the r-axis (compare (2.2)). See Fig. 22. In case (i) the symmetry set, which is ¢(M), 
has an isolated point. By carefully analyzing case (ii) we find that the symmetry set consists of two 
cusps (generally ordinary cusps) pointing the same way, as in Fig. 23, where we take k,,,,«{ all 


> 0 and «4 < 0. The evolutes of y, and y, both pass through the centre of C) now; they are 
drawn dashed. 


Y2 


A, contact 


or 


A, contact 


Fic. 23. Symmetry set: one bi-osculating case. 


7. Families of curves. Finally we say a brief word about families of curves—that is, we 
suppose our starting curve y to be varying in a one-parameter family of smooth simple curves. 
Suppose we have a bi-osculating circle (A5 case) as in (3.1)(ii) and (6.3). What effect will a small 
perturbation of the curves y, and y, have on the midlocus and the symmetry set? We merely draw 


P. J. GIBLIN AND THE LATE S. A. BRASSETT [December 


bf —=— ee —-- nothing 


Fic. 24. Transitions on midloci (above) and symmetry sets (below) in one bi-osculating case. 


x = dC 
A 


Fic. 25. More transitions: the other bi-osculating case. 


NY _. 
/™ 


Fic. 26. A moth in the act of disappearing. 


some pictures here, for a full discussion is beyond the scope of this article. In Fig. 24, top, a 
transition is shown on the midlocus whereby a closed curve shrinks and vanishes (the closed curve 
may not be simple—it could be more like a figure eight for example). In Fig. 24, bottom, we have 
what we call a “moth transition” on the symmetry sets: the moth shape shrivels to a point as in an 
intense candle flame and disappears! These assume «jx > 0, as in (6.3)(i), which gives the 
isolated point in each picture. For «j«4 < 0 the situation is illustrated in Fig. 25, the centre picture 
in each row coming from the A} case. Finally Fig. 26 shows an explicit example of a moth 
transition on a family of quartic curves. For clarity only part of the symmetry set is drawn in this 
diagram; as usual the evolute is drawn as a dashed curve. 
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R. VOLPE 
Via O. Morisani 13, Napoli, Italy 


A number of authors have addressed “the well-known conviction that geometry may be built 
without points”’ ({5]; see also the other references). In this paper we propose an approach to metric 
spaces, and thence to geometry, based on the primitives: solid, inclusion, and distance between 
solids. 


DEFINITION. A space of solids is a set S of elements called solids, together with a partial order 
relation < on S, called inclusion, and a function 6 from S X S to the nonnegative real numbers, 
called distance, such that, for all solids u,v, w: 

(1) if u < w, then 6(u,v) > (vu, w), 

(2) the diameter A(v) = sup{8(p,g): p < vu and g < v} is finite, and 


5(u,v) + 8(v,w) + A(v) > 8(u,w), 


(3) u < v if and onlyif 8(u’,v) = 0 for every wu’ included in u, 
(4) if e > 0, then there are solids u’, v’ such that 
u’<u, v’<v, A(uw’)<e, Al(v’)<e, and 8(u’,v’) = 8(u,v). 
A point in a space of solids (S, < , 6) is a minimal element of (S, <). An s-point, where ¢ > 0, is 
a solid u such that A(u) < «. 


The following statements are easy consequences of the definition. 
(a) For all u and v, 8(u,v) = 8(v, u). 
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in Naples (1600-1700). 
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(b) The solid uw is a point if and only if A(uv) = 0 

(c) The set of all points in S$ is a metric space with respect to 8. 

(d) If u’ < wu and vw’ < v, then 8(u’, v’) > 8(u, v). 

(©) For all u,u’, vv’, 

5(u,v) — 6(u’,v’) < 8(u,u’) + 8(v, v0’) + A(u’) + A(v’). 
Hence if these solids are e-points, then 
|[8( u,v) — 8(u’,v’)| < 8(u,u’) + 8(v,v’) + 2e. 

Obviously a metric space (M,d) is a space of solids with S = M, 6=d, and <_ simple 

equality. In this trivial example, every solid is a point. For a more interesting example obtained 


from (M, d), let S be the set of all nonempty compact subsets u of M such that u°~ = u, where ° 
denotes interior and ~ denotes closure; define 


d(u,v) = min{d(P,Q):PEeu,Qeov}, 


and let <_ be set inclusion. We show that (S, < ,&) is a space of solids, called the space 
associated with (M, d). 

Axiom (1) is clear. Since v is compact, A(v) < oo. To complete the proof of (2), choose U € u, 
V, © v, W Ew, V, © v such that 


5(u,v) = d(U,V,) and &(w,v) = d(W,V,). 
For e > 0, let 
D(V,) = {P €v°: d(P,V,) < e} and D(V,) = {P € v°: d(P,V,) < e}. 
Then v, = D(V,)” and v, = D(V,) are solids contained in v. Choose V/ © v, and V; € v, such 
that d(v,,v,) = d(V/, V7). Then 
d(V,,V,) < dV, Vi) + d(VYi,Vz) + d(Vi,%,) <e + 8(,,%) +¢ 
for all e > 0; so d(V,,V,) < 8(v,, v,) < A(v). Hence, 
5(u,w) < d(U,W) < d(U,V,) + d(V,,V,) +d(V,W) < &(u,v) + &(v,w) + A(v). 


For (3), it is obvious that u < v implies 6(u’, v) = 0 for all u’ < u. Conversely, suppose that 
5(u’,v) = 0 for all u’ < u but that u ¢ v. Choose P in u° \ v and e > 0 such that C= {X€ 
M: d(X, P) < €} is disjoint from v. Let B= {X € u°: d(X, P) < e}. Then u’ = B” is a solid 
and u’ (1 v = ©. Then 8(u’,v) > 0, a contradiction; hence, u < v. 

For (4), given u, v, and e, choose U and V such that (u,v) = d(U,V), let DOU) = {xX € 
u°:d(X,U) < e} and D(V) = {X € v°: d(X,V) < e}. Then u’ = D(U) and v’ = D(V) are 
e-points and 8(u’, v’) = d(u, v). 

Recall that a metric space (M, d) is perfect if it has no isolated points. 

It is easy to see that 


(f) a metric space is perfect if and only if its associated space of solids has no points. 


The association of a space of solids with a metric space is characteristic of all spaces of solids, 
in the following sense. 


THEOREM. Every space of solids (S, < ,8) is isomorphic to a space of solids (S’, C , 8’), where 
S’ is a family of nonempty subsets of a metric space (M,d) and 8’[u, v] = inf{d( P,Q): P © uand 
Q€vd}. 

Proof. Let M be the set of all sequences P = (p,) of solids such that p,,, <p, for all n and 
lim A(p,) = 0. For P and Q = (q,) in M, define d(P,Q) = lim8(p,, q,). Since (6(p,, 4,)) is 
increasing and bounded above by 8( p,, q,) + A(p,) + A(q); d is defined on M X M. It is cay 
to show that (M, d) is a pseudometric space. Let M = M/, where PZO means d( P,Q) = 
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and for P,Q € M let d(p,Q) = d(P,Q), where P is in the equivalence class P and Q is in the 
equivalence class Q. Then (M, d) is a metric space, called the space associated with (S, < , 8). 
Define the relation belongs to, B C M X S, as follows: PBu means there is a sequence (p,,) in P 
with p, = u. Define a function f from S§ into the set of all subsets of M by 


f(u) ={P eM: PBu}. 
Let S’ be the image of f. We must prove, for all u,v in S: G@) f(u) # @, (ii) u< v => f(u) C f(v), 
(1) d(u,v) = 8 f(u), f(v)], Gv) f is injective. 
(1) By repeated use of (4) we may build a sequence (p,,) such that p, = u, p,,1 < p,, and 
A( p,,) = 1/(n + 1). Then the equivalence class of this sequence is in f(1). 
(i) If u < v, then any sequence u = p, > p, > -:: may be extended tov>u>p,> ::- 


(iii) Suppose PBu and QBv. Choose sequences P = (p,) and QO = (q,) with p, = u, gq, =v. 
Then 


(u,v) = 6(p,,%) < d(P,Q) = d(P,Q); 
SO 
d(u,v) < inf{ d( P,Q): PBu and QBv} = 8 | f(u), f(v)]. 
Now by (4) we may build ( p/),(g/,) such that 
Pi = Us = Phar SPs Mra < Gs MC mm) <1 /n, AC gn) <1/n, 


and 


8( Pits Qv1) = 8( ph, 9%) = (u,v). 
Then these sequences represent equivalence classes P’,Q’ such that 6(u, v) = d(P’, Q’). Thus 
5(u,v) > inf{ d( P,Q): PBu and QBv}. 


Hence d(u,v) = 87 f(u), f(v)]. 
(iv) If u + v, then we may assume u <¢ v. By (3) there exists u’ < u such that 8(u’,v) > 0. 
Then by (iii), 


inf{d( P,Q): PBu’ and QBv} > 0; 
hence f(u’) < f(v). But by (ai), f(u’) < f(u); hence f(u) # f(v). 


FURTHER REMARKS. Since euclidean, elliptic, and hyperbolic geometries may be defined solely 
in terms of axioms about metric spaces [1], the definition above leads to axiomatizations of 
geometries without the primitives “point”. It remains to reformulate the resulting axioms in a 
more natural, less convoluted way. As one example, define three e-points u,v,w in a space of 
solids (S, < ,8) to be e-collinear if one of the following holds: 


5(u,v) + 8(v,w) — &(u,w) < 3e, 
5(u,w) + 8(w,v) — &(u,v) < 3e, 
5(v,u) + 6(u,w) — 8(v,w) < 3e. 


Then the axiom 

(A) there exist three noncollinear points 
holds in the associated metric space (M, d) if and only if the axiom 

(A’) for some positive ¢ there exist three &-points that are not «-collinear, 
holds in (S, < ,8). 


Proof. Suppose that P, Q and T are noncollinear elements of M; let (p,), (q,,) and (¢,,) be in 
P, QO and T. Then there exists e > 0 such that 
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d(P,T) + d(T,Q) — d(P,Q) > 3e, d(P,Q) + d(Q,T) — d(P,T) > 3e 
and d(Q,P) + d(P,T) —d(Q,T) > 3e. 


It follows that there exists m © N such that 


A( pi) < €, MC gn) < @, ACtn) < €, 8C Pas In) + BC Gns tn) — 8 Pas tn) > 38, 
B( Piotr) + 8( ts dn) — 8( Pas dn) > 3e and 8( 4, Pr) + 8( Pasta) — 8Cdusty) > 3e 
for every n > m. This proves that p,,, qg,, and ¢,, are not e-collinear e-points. 


Conversely, suppose that there exist three e-points u, v and w that are not e-collinear. Thus, if 
PBu, QOBv and TBw, we have that 


d(P,T) + d(T,Q) —d(P,Q) > 0. 
Indeed, observe that 
5(u,w) + 8(w,v) — 8(u,v) —2e re 


Now, by (iii), there exist UBu, U’Bu, VBv, V’Bv, WBw, W’Bw such that 
5(u,w) = d(U,W), 8(w,v) = d(W’,V), 8(u,v) = d(U’,V’). 


Then 
d(P,T) + d(T,Q) —d(P,Q) > 8(u,w) + 8(w,v) — d( P,Q) 


> 8(u,w) + 8(w,v) —[d(P,U’) + d(U’,V’) + d(V’,Q)]| 


> 8(u,w) + 6(w,v) —[e + 8(u,v) +e] = 8(u,w) + 8(w,v) — 8(u,v) —2e >e> 0. 
In the same manner one proves that 
d(P,Q)+4d(Q,T) —d(P,T)>0 
and that 
d(Q,P)+d(P,T) —d(Q,T) > 0. 


In conclusion, P, QO and T are noncollinear. 
Axiom A can also be reformulated with no mention of points at all. Define three solids u,v, w 
to be collinear if 


5(u,v) + 6(v,w) — &(u,w) < 3m 
or 

8(u,w) + 8(w,v) — (u,v) < 3m 
or 

5(v,u) + 6(u,w) — 8(v,w) < 3m, 
whete m = max[A(u), A(v), A(w)]. Then A holds in (M, d) if and only if 

(A”) there exist three noncollinear solids 
holds in (S, < , 4). 
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CENTROSYMMETRIC (CROSS-SYMMETRIC) MATRICES, 
THEIR BASIC PROPERTIES, EIGENVALUES, AND EIGENVECTORS 


JAMES R. WEAVER 
Department of Mathematics / Statistics, The University of West Florida, Pensacola, FL 32514 


1. Introduction. For a number of decades symmetric matrices over the real field have been 
studied intently by every beginning linear algebra student as a class of matrices which are defined 
by the property of being symmetric about their main diagonal or by the fact that you can 
interchange the rows and columns of a given symmetric matrix and it remains unchanged. In [1] 
(p. 124) A. C. Aitken defines a centrosymmetric matrix P and this definition coincides with the 
definition given by Graybill of a cross-symmetric matrix [6] (p. 361). 


DEFINITION 1. An n X n matrix P over the real field is centrosymmetric if 


Pi = Pyeitijn-js4» forl <i,j <n. 


A closer look at Definition 1 reveals that a centrosymmetric matrix is nothing more than a square 
matrix which is symmetric about its center. 


b a be 
(? ) and dea 
boa 

c boa 


are examples of centrosymmetric matrices. This type of matrix is very important in the study of 
certain types of Markov Processes because they turn out to be transition matrices for the process. 
As we look at centrosymmetric matrices we will find that they have many interesting properties, 
comparable in some ways with symmetric matrices, but yet in some ways they are very different 
from symmetric matrices. For instance; it will be shown that they form an algebra. 


2. Historical Notes. M. Iosifescu points out in [7] that “The concept of Markov dependence 
appears in an explicit form in 1906 in a paper [12] of the Russian mathematician A. A. Markov 
(1856-1922). In a series of papers starting with [12] he studied various properties of sequences of 
dependent random variables which in his honor are nowadays called finite Markov chains. 

Almost at the same time, studying the problem of shuffling the card deck, the French 
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mathematician Henri Poincaré came across sequences of random variables which are in fact 
Markov chains. Nevertheless, Poincaré did not undertake a systematic study of those sequences so 
that the recognition accorded to Markov is entirely justified.” 

S. Wright [15] introduced the idea of random genetic drift in 1931 and the mathematical model 
of this physical phenomenon is a Markov process and was developed by G. Malécot [11] in 1944. 
M. Iosifescu [7] examines a special case of this phenomenon, which consists of a population 
consisting of individuals that are able to produce a single type of gamete. The transition matrix Q 
for this Markov chain is an (n + 1) X (n + 1) centrosymmetric matrix where 


n\(n—i\" /(i\? , 
Ory = (4) (=) ) O<i,j<n. 


n n 


It should be noted that the entries of this transition matrix are binomial probabilities. R. G. 
Khazanie [8], R. G. Khazanie and H. E. McKean [9], and F. Barnett, J. S. Frame, and J. Weaver 
[3] give an explicit formula for the eigenvalues of this matrix and in [3] an explicit formula for the 
eigenvectors is found. 

M. Kimura [10] studied a Markov process in his article “Some Problems of Stochastic 
Processes in Genetics” whose transition matrix R is an (n+ 1) X (n+ 1) centrosymmetric 
matrix. The matrix R is given when a = 2 by 


n—i)a ia — 
Raa = (CO \('*) [es O<i,j<n. 


It should be noted that the entries of R are hypergeometric probabilities. 

I. Schensted [13] studied the same centrosymmetric matrix R as M. Kimura [10] in her paper 
entitled “Model of Subnuclear Segregation in the Macronucleus of Ciliates”. A formula for the 
eigenvalues of this centrosymmetric matrix is given when a = 2 by both Kimura and Schensted 
but neither gives a formula for the eigenvectors. In [14] J. Weaver finds a formula for the 
eigenvalues and eigenvectors of R for any positive integer a > 1. Fora =1,R =I. 

I. J. Good points out in [5] that a symmetric Toeplitz matrix is an example of a matrix which is 
both symmetric and centrosymmetric. He also states in [5] that “Toeplitz matrices arise as discrete 
approximations to kernels k(x,t) of integral equations when these kernels are functions of 
|x — t|. Similarly if a kernel is an even function of its vector argument (x, 7), that is k(x,t) = 
k(—x, —t), then it can be discretely approximated by a centrosymmetric matrix.” 

With this historical background as a backdrop, we pursue a more detailed study of centrosym- 
metric matrices. 


3. Eigenvalues and Eigenvectors of Small Centrosymmetric Matrices with Their Row Sums 
Equal. Note that the 2 x 2 centrosymmetric matrix (‘ ’) has a + b and a — Db as its eigenval- 


a b c 
ues with eigenvectors (1,1)' and (1, —1)‘, respectively. The centrosymmetric matrix | ¢ | 

c b a 
whose row sums are equal has eigenvalues a+ b+ c, a-—c, and e— b with eigenvectors 


(1,1,1)’, 1,0, —1)’, and (b, —2d, b)’, respectively, if b or d is not equal to zero. It is regrettable, 
but these nice formulas cannot be extended to the 4 x 4 case. 

Once some basic information about centrosymmetric matrices is obtained, some theorems are 
given which are very useful when it comes to finding the eigenvalues and eigenvectors of any 
centrosymmetric matrix. 


4. Basic Properties of Centrosymmetric Matrices. As opposed to the definition of centrosym- 
metric matrices given by A. C. Aitken [1] and F. Graybill [6], a slightly different definition will 
now be given. 


DEFINITION 2. If P is an m X n matrix, then it is centrosymmetric if 
P =P 


7 m-itin-j+i1 forl<i<gm and l<j<n. 
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It is not unusual to talk about a symmetric matrix as a matrix which is equal to its transpose. 
After the “reflection” of a matrix is defined we will be able to say a matrix is centrosymmetric if it 
is equal to its reflection. 


DEFINITION 3. If P is a m X n matrix, then the reflection of P is P*® and 


[Pj = Pa-istn—yj41 forl<i<m and 1l<j<n. 
If P =(‘ ° ;) then P*® -(/ , ‘| and an immediate consequence of Definition 3 is 


PROPOSITION 4. An m X n matrix P is centrosymmetric if and only if P® = P. 


One observation that simplifies work with centrosymmetric matrices is given once the matrix J 
is defined. 


DEFINITION 5. Let J be the n X n symmetric elementary matrix defined by J, ; = 6, ,_,41, for 


1 <i, j <n, where 6, , is the Kronecker delta. 


Note that J has ones along the secondary diagonal and zeros elsewhere. The order of J will 
differ in what is to follow, but its order will usually be clear from the context in which it is used. J 
is an orthogonal matrix which is symmetric and hence J? = J, where J is the identity matrix. 
Premultiplication of a matrix by J will reverse the order of the rows and postmultiplication of a 
matrix by J will reverse the order of the columns. Using the matrix J one obtains 


PROPOSITION 6. An m X n matrix P is centrosymmetric if and only if JP = PJ. 
Proof. If JP = PJ, then P = JPJ and 
P= [JPI]i,; =[ PJ] m-141,) = Pn-i+i,n—-j41 forl<i<m and l<j<n. 


Therefore P is centrosymmetric. 
Conversely, if P is centrosymmetric, then it is easy to show that JP = PJ. 
Using Proposition 6, one is able to obtain 


THEOREM 7. The set of n X n real centrosymmetric matrices form an algebra. 


Proof. If P and Q are n X n real centrosymmetric matrices, then 
J(P+Q)J=P+Q, JPOJ = PQ, 


and J(cP)J = cP for a scalar c. It then follows that the set of n Xn real centrosymmetric 
matrices form an algebra. 

Two easy consequences of Proposition 6 are that if P is a nonsingular n X n centrosymmetric 
matrix, then P~' is a centrosymmetric matrix, and if P is an n X n centrosymmetric matrix, then 
P’ is also a centrosymmetric matrix. 

In [5] I. J. Good shows how to find the inverse of an n X n nonsingular centrosymmetric 
matrix and in [1] A. C. Aitken shows how to find the determinant of an n X n centrosymmetric 
matrix. In [4] A. Cantoni and P. Butler present information about n X n matrices which are 
symmetric and centrosymmetric. Many of the results established by A. C. Aitken [1], A. Canton 
and P. Butler [4], and I. J. Good [5] make extensive use of the following orthogonal transforma- 
tion Q. 


DEFINITION 8. If m = 25s is a positive even integer, then 


IT — 
o-(3(7 ~%), 
and if n = 2s + 1 is a positive odd integer, then 
IT OO -J 
Q=\s10 v2 of, 
J 0 I 
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where 7 and J are s X s matrices. 


In either case it is easy to see that Q is an n X n orthogonal matrix which diagonalizes J. 
Using the appropriate Q on an n X n centrosymmetric matrix one is able to prove 


THEOREM 9. (a) If P -(4 4 is ann X n centrosymmetric matrix with n = 2s and A, B, C, 
and Ds X s matrices, then P is orthogonally similar to 


oro - (4 + JC 0 ) 


0 A-—JC 
A x B 
(b) IfP=|» @ yw isann X ncentrosymmetric matrix withn = 2s +1, A, B, C, and D 
Cc Jx D 


s X S matrices, x is as X 1 matrix, y is a1 Xs matrix, and q is a scalar, then P is orthogonally 
similar to 


A+JC y2x 0 
Q'PQ = y2y q 0 
0 0 A-—JC 


Proof. The proof is given for part (a) and may also be found in [4]. Since P -(4 > | is 


centrosymmetric, 
(2 10 3-00 ae 3) 
C D})\J O J O;\C DY} 


Hence BJ = JC and DJ = JA. It then follows that B = JCJ and D = JAJ. Thus P=(* 7°") 
and multiplication gives 


A — ° ic) 
JCI 
yt 
JAS 


For part (b) one shows that P =| » 


Cc 


q and multiplication gives 
Jx 


A+JC y2x 0 
Q'PQ=| y2y — q 0 
0 


0 A-JC 
Theorem 9 is demonstrated for the 3 X 3 and 4 X 4 cases. 
v2 0 1f2\fa b c\{[1f2 0 -1~2 atc y2b 0 
0 1 0 ded 0 1 0 |=| 2d e 0 |, 
-1/fV2 0 1f2}\e b af\1f2 0 12 0 0 a-c 
1/y2 0 0 1/2\{a b c« d\{[1f2 0 0 -1//2 
0 1/v2 1f2 ~—sO0 e f gih 0 1/2 -1/2 0 
0 -1/72 1/2 0 g f e 0 1172 1/2 0 
—1//2 0 0 12/\d c b al\1fy2 0 0 1/¥2 
at+d b+ec 0 0 
et+h ft+g 0 0 
0 0 f-g e-h 


0 0 b-—c a-d 
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Using the partitioned matrix Q’PQ in the respective cases when P is either of even or odd 
order, A. C. Aitken [1] shows that in the even case, using the Laplacian expansion, that 


|P| = |O'PQ| =|A + JC||A — JC|. 


In the odd case, 


A+JC v2x 
v2y 


In [5] I. J. Good shows in the even case that if P -(4 om, is a nonsingular matrix, then since 


0 # |P|=|A + JC||A —JC|, A + JC and A — JC are nonsingular matrices, and 


|P| = |Q'PQ| = |A — JC]. 


—1 
p-l =i (A + JC) 0 | 
0 (A —JC)* 
, ; ; A+JC 2x A+JC  y2x 
In the odd case, if P is nonsingular, then 0 # |P| = |A — JC| and ( and 
y2y q v2» q 
(A — JC) are nonsingular matrices. Therefore 
A+JC y2x)\_ 0 
P'=Q'|\ y2y gq 0 Q. 
0 0 (A -—JC) 


The formulas given in [5] by I. J. Good are more detailed and make the computation of the inverse 
of a centrosymmetric matrix a lot easier. 


5. Eigenvalues and Eigenvectors of a Centrosymmetric Matrix. Using the matrix Q described 
in Definition 8 of this paper, and observing that the n X n matrix P is orthogonally similar to 
A+JC  y2x 0 


( “we 4 ’ ic) in the even case and | 2, q 0 in the odd case, one can easily see 
0 0 A—JC 
that the eigenvalues of P are the same as those of A + JC and A — JC in the even case and the 
A+JC 2 . ° ge 
same as those of ( A | and A — JC in the odd case. It needs to be noted at this time that 
y q 


I. J. Good goes into more detail in describing the eigenvalues of a centrosymmetric matrix on 
p. 928 of [5]. 

After the eigenvalues of a matrix have been found, one is confronted with the problem of 
finding the eigenvectors. In [2] A. L. Andrew presents the following definition. The two results 
that follow may be found in [2] and [4]. 


DEFINITION 10. An n dimensional vector X is symmetric if JX = aX with a= 1 and is 
skew-symmetric if a = —1. 


Using Definition 10 one easily obtains 


THEOREM 11. If P is ann X ncentrosymmetric matrix, then the eigenvectors corresponding to the 
eigenvalue X, where dim NS(AJI — P) = 1, are either symmetric or skew-symmetric. 


Proof. If X is an eigenvector of P corresponding to A, then PX = AX and PJX = JX. It 
then follows that aX = JX for some scalar a not equal to 0, and a is also an eigenvalue of J. 
Since J is an orthogonal matrix, its eigenvalues are +1, and so a= +1. Therefore JX = +X, 
and X is either symmetric or skew-symmetric. 

A corollary of Theorem 11 which is useful in computing eigetivectors of a centrosymmetric 
matrix for a given eigenvalue A, when dim NS(AJ — P) > 1, is given below. 


CoROLLARY 12. If P is a centrosymmetric matrix and X is an eigenvector of P corresponding to 
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the eigenvalue X, then JX is also an eigenvector of P corresponding to X. 


Proof. If PX =X, then PJX = AJX and JX is an eigenvector of P corresponding to the 
eigenvalue A. 


1 0 0 


6. Examples. If P =| « ©» «|, where the row sums are equal, then (0,1, 2)’ is an eigenvector 
0 0) 1 
of P corresponding to the eigenvalue 1. From Corollary 12 it is easy to see that (2,1,0)’ is also an 


eigenvector of P corresponding to the eigenvalue 1. 

In papers [3] and [14] explicit formulas are given for the eigenvalues and eigenvectors of two 
classes of centrosymmetric matrices. It is shown in [3] that A, = 1,/n', for 0 < t <n, are the 
eigenvalues of the centrosymmetric matrix P, where 


n\(n—i\"~/(i\! - 
Para = (4) n (~] ’ O<i,j<n. 


This information may also be found in [7], but Theorem 2 of [3] gives an explicit formula for the 
eigenvectors of P. For n = 8 and t=7, A, = 8,/8' = (8-7-6-5-4-3- 2)/8’ is an eigen- 
value and each (i + 1)th entry of an eigenvector X, is any convenient multiple of the determinan- 
tal expression 


1 1 —(n — 3),4n~4 0 
Ko=(5n-I(n— ai (n -i)i n—1 1 —(n—5),n72}. 
(n — i)? (n—1)° 3n — 5 


The first 6 entries of X, are computed and, after suppressing the factor 315/2'*, are found to 
be 0, 2835, — 7552, 7705, 0, and — 7705. Since dim NS(P — A,J) = 1, —7705 = a(7705) and 
a = —1. From Theorem 11 it then follows that 7552, —2835, and 0 are the last three entries 
of X,. 

In [14] a centrosymmetric matrix P is given by 


n—i)a\{j 

Staal (“, - \('*) /(). O<t,j<n. 

The eigenvalues of this matrix are given by A, = (a‘n,)/(an), for0 <t<n and a a positive 
integer greater than 1. Theorem 2 of [14] gives an explicit formula for the entries of the 
eigenvectors for each eigenvalue. In the case n = 7 and t = 4 an eigenvalue is A, = (a*7,)/(a7), 
and the first 5 terms of an eigenvector X, corresponding to X, are found to be 0, 90(a — 4), 
5(a + 1), —81(a — 8/27), and —81(a — 8/27). Since X, = aXf, a = 1, and the last three terms 
are 5(a + 1), 90(a — +), and 0. 


7. Conclusion. In conclusion it is worthwhile to note that if @ is the mapping defined by ¢: 
P — Q'PQ, where P is a centrosymmetric matrix and Q is the orthogonal transformation defined 
in Definition 8, then ¢ is an isomorphism from the algebra of centrosymmetric matrices to the 
direct sum of two full matrix algebras. 


THEOREM 13. The algebra of all real centrosymmetric matrices of order n is isomorphic to the 
direct sum M,_,(R) ® M,(R) (where k = [n/2)) of full matrix algebras. 
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UNSOLVED PROBLEMS 
EDITED BY RICHARD GUY 


In this department the MONTHLY presents easily stated unsolved problems dealing with notions ordinarily 
encountered in undergraduate mathematics. Each problem should be accompanied by relevant references (if any 
are known to the author) and by a brief description of known partial results. Manuscripts should be sent to 
Richard Guy, Department of Mathematics and Statistics, The University of Calgary, Calgary, Alberta, Canada 
T2N 1N4. 


MONTHLY UNSOLVED PROBLEMS, 1969-1985 


RICHARD K. GUY 
Department of Mathematics and Statistics, The University of Calgary, Calgary, Alberta, Canada, T2N 1N4 


As usual in these updating articles in odd-numbered years, references in brackets [1983, 684] 
are year and page numbers of this MONTHLY, while references in parentheses (1981) are to the 
bibliography at the end; items not yet published are labelled (tbp) or (wrc), according as they are 
likely to be published formally or to remain as written communications. 

In order to enhance the usefulness of the bibliography, we have added Math. Reviews numbers 
and a bracket indicating where the problem originally appeared in the MONTHLY. 

Carmichael’s problem, find » such that p(x) = 7 has a unique solution, x, where p(x) is 
Euler’s totient function, was mentioned by Klee [1969, 288]. The unitary analog (a unitary divisor, 
d, of n, is one where d and n/d are coprime) was discussed by Ismail and Subbarao (1976), a 
reference we didn’t notice earlier. 

Graceful numbering of trees and other graphs, an offshoot of the Kotzig-Ringel problem asked 
by Duke [1969, 1128] continues to generate papers. Label the m + 1 vertices of any tree with the 
integers 0,1,..., so that the differences along the n edges are 1,2,...,n. Prospective authors are 
urged to consult the existing literature. My bibliography now contains 109 items; recent additions 
are Poljak and Sura (1982), Bloom and Hsu (1983) and Grace (1982, 1983). 

Ogilvy [1970, 388; 1975, 996; 1983, 684] asked about the convergence of z,,, = c+ z, and 
partial answers were given by Gerber and by Rohde. Peter Walker (1983) notes theorems implicit 
in Burckel (1981) who quotes Macintyre (1966): 
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Theorem 1: Let f be an analytic map of the disc D = {z: |z| < 1} to itself which is not a 
rotation, and with f(0) = 0, then for any z) in D and z,,, =f(z,), n > 0, we have z, > 0 as 
n — oo. Theorem 2: Let f be an analytic mapping of a simply connected region G C C to itself 
with f(G) # G, and suppose that f has a fixed point at / in G, then for any z, in G, the sequence 
defined by z,,, = f(z,) for n > 0 converges to / as n > oo. 


We can now give the reference to Goodey’s (1984) confirmation of Peterson’s [1972, 505; 
1983, 684] conjectures. If S,C are non-coincident planar convex sets, then they have equal width 
in every direction just if a(S,C’), the number of connected components of dS \ dC’, is even or 
infinite for all translations C’ of C. . 

In confirmation of a conjecture of Erdés and Guy [1973, 52], Leighton (1981) proves that the 
crossing number of a graph with n vertices and k > 4n edges is at least k?/375n7. A similar 
theorem w&s reported [1981, 757] to be in preprints which were later superseded by Ajtai, Chvatal, 
Newborn and Szemerédi (1982) who prove “at least k*/100n* edge-crossings and fewer than 
10'°” crossing-free subgraphs.” The latter result settles a conjecture of Newborn and Moser 
(1980). 

Detlef Seese’s results on the computability of the crossing number and rectilinear crossing 
number of a graph have also been obtained by Garey and Johnson (1983), who also prove that the 
problem of calculating the crossing number of a graph is NP-complete. Seese uses this result to 
show that the rectilinear crossing problem is also NP-complete. He mentioned this at ICM, 
Warsaw, but doesn’t yet plan formal publication. 

Jan van Leeuwen sends a useful short history of Nash’s reachability problem [1973, 292]. He 
himself (1974) solved the problem for dimension < 3, and Hopcroft and Pansiot (1976) for 
dimension < 5. The complete problem, for every dimension, was settled by Ernst Mayr (1980, 
1981, 1984); and see Mayr and Meyer (1981). 

We indicated [1979, 848] that our earlier announcement [1977, 808-809] of the solution was 
premature. Grabowski (1980) obtained “a positive solution to a decision problem left open by 
Karp and Miller (1969) and Hack (1976) and posed explicitly by Landweber and Robertson 
(1978): the problem of persistence of a vector addition system (or a Petri net).” Incidentally it 
gives a positive solution to the reachability problem for persistent vector addition systems. See 
also Kleine-Biimning (1980). Kosaraju (1982) has provided a somewhat more elementary proof. 

A famous theorem of Vizing is that if the maximum valence of a graph is p, then the 
edge-chromatic number is p or p + 1. A graph of edge-chromatic number p + 1 is called critical if 
deletion of any edge reduces its edge-chromatic number. Wilson and Beineke [1976, 128] conjec- 
tured that there are no critical graphs with an even number of vertices. An infinity of counterex- 
amples was found by Goldberg in 1978; see Goldberg (1981) or Chetwynd and Wilson (1983). 

Alter [1980, 43] revived the thousand-year-old congruent number problem: which integers a 
allow simultaneous solution of the diophantine equations 


2 2 


x*+ay?=2z? and x*-ay’=??. 


He conjectured that if a = 5, 6 or 7 (mod 8), then a is congruent in this sense. We reported 
[1983, 685] that Tunnell had proved that all sixteen numbers = 1, 2, or 3 (mod 8) for which a 
blank was left in Table 1 [1981, 759] or in Table 7 of Guy (1981), were non-congruent, but we did 
not give the reference (1983). 

Jean Lagrange, in a letter dated 83:02:10, reported that the nine remaining numbers = 5 or 7 
(mod 8), of unknown status in the same table, are now known to be congruent, so the conjecture is 
verified for a < 1000. 

Gerd Hofmeister notes that it would have been more appropriate, in my comments [1981, 759] 
on the postage-stamp problem [1980, 206] to have quoted Redseth’s papers (1981, 1982) on 
h-bases, rather than his papers on the Frobenius problem. 

Classification of dendrograms is of interest to theoretical biologists, so Smith and Waterman 
asked [1980, 552]: how alike are two trees? They quoted their own paper (1978), but Jarvis, 
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Luedeman and Shier (1983, 1984) have since found counterexamples to Theorem 3. These also 
invalidate the proofs of Theorems 4 and 5: neither rigorous proofs nor counterexamples to these 
theorems have been found. See also Boland, Brown and Day (1983), Brown (1982), Culik and 
Wood (1983), Day (1983), Robinson (1971) and Moore, Goodman and Barnabas (1973). 

Emma Lehmer has kindly written us the definitive note about the next problem. 


ON SQUARE-SEPARABLE PRIMES 


EMMA LEHMER 
1180 Miller Avenue, Berkeley, CA 94708 


Jacob Beard [1980, 744] defined a 3-square-separable prime as a prime such that each pair of 
cubic residues which are not quadratic residues, is separated by a quadratic residue. Richard Guy 
[1980, 686] stated “it seems likely that 7, 19, 37 and 43 are the only 3-square-separable primes,” 
and asked the corresponding question where 3 is replaced by any odd s. 

It is the purpose of this short note to restate this question in terms of cyclotomy of order 25, to 
answer the question for s = 3 and s = 5 and to indicate how it can be done for larger values of s. 

Let p = 2sf + 1 be a prime with s odd. We partition the integers less than p into 2s sets C, 
with respect to a primitive root g, so that 


C={g°**/ (mod p),O<k<f-1}, 1<j<2s. 
The cyclotomic number (i, /)),, is the number of integers in C, which are followed by an integer 
of C,. For example, if p = 61, s = 3, f = 10, one of these sixth power residue classes is 
C, = {8, 24, 11, 33, 38, 53, 37, 50, 28, 23}. 
It contains the two consecutive pairs 23, 24 and 37, 38, so that the cyclotomic number (3, 3), 1s 2 
for this prime. 
In terms of this notation we can state at once the following 


THEOREM 1. The prime p = 2sf + 1 is not s-square-separable if (s,s) , > 0. 


Proof. (s,5)5, > 0 implies that there is at least one pair of consecutive s-power residues. 

It is well known that there is only a finite number of primes for which (i, 7),, = 0. Hence our 
problem reduces to finding and examining those primes for which (s,5),, = 0. 

Another obvious theorem 1s: 


THEOREM 2. If p = 8n +°5 and if 2 is an s-power residue, then p = 2sf +1 is not s-square 
separable. 


Proof. Under the conditions of the theorem, the middle pair ( p — 1)/2,(p + 1)/2 are both 
s-power residues and quadratic non-residues, and hence p is non-separable. 


Previous authors considered p = 7 to be 3-square separable and p = 11 to be 5-square-sep- 
arable, but since these primes, and in general the prime p = 2s + 1, have only a single s-power 
residue, such primes may be omitted. 

For s = 3 the cyclotomic numbers of order six can be expressed in terms of the quadratic 
partition [2] 

4p = L* + 27M? with L = 1(mod 3), and, if LM isodd, M = —L (mod 4). 
In particular, the value of (3,3), is given by 
p-5-2L f even, 2 a cubic residue 
p-i14+4L f odd, 2 a cubic residue 
p—5+(5L+9M)/2_ f even, 2 a cubic non-residue 
p-11-(L+9M)/2_ f odd, 2 a cubic non-residue. 


36(3,3)¢ = 
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Since L < ~4p and M < 4p/27, it follows that (3,3), = 0 implies that p < 83 in all cases. 
The following table gives the values of L, M and (3,3), for p < 83: 


p 7 13 19 31 37 43 61 67 + «=73 ~~ ~=«79 
L 1-5 7 4 -1l -8 1 -5 7 -17 
M -1 1 1 2 -1 2 3 =-3 «=3 1 

(3,3 0 0 0 1 0 0 2 2 2 2 


The primes 13, 19, 37 and 43 have to be checked for square-separability, but this is easily done 
with a table of indices. This proves that 13, 37 and 43 are indeed the only 3-square-separable 
primes. 

Similarly for s = 5 we can write down the values of (5,5),,) 1n terms of the quadratic partition 
[4] 

16p = x* + 50u? + 50u? + 125w?, x =1(mod 5) 
as follows: 
p-9+2x f even, 2 a quintic residue 
p-19+2x f odd, 2 a quintic residue 
p—9+4+(17x + 50u— 25w)/4  f even, 2 a quintic non-residue 
p—19+(7x — 50u+ 25w)/4_ f odd, 2 a quintic non-residue. 


100(5,5)10 = 


Since x < 4/p, u,v < 4/p/50, w < 4/p/125 , it follows that (5,5),) = 0 implies that p < 911. 

If we calculate (5,5), from the above formulas, we find that the only values for which (5, 5),. = 0 

are p = 11, 31, 41, 61, 71, and 101. When we examine these we find that 31, 41 and 71 are indeed 

5-square-separable, while 61 is not since it does not contain a square between the pair 29 and 32 

of quintic residues; similarly p = 101 has a pair 39 and 41 of quintic residues with the quadratic 

non-residue 40 in between. Unfortunately, Guy listed 61 and 101 as 5-square-separable primes. 
We have proved: 


THEOREM 3. The only 3-square-separable primes are p = 13, 37 and 43. The only 5-square-sep- 
arable primes are p = 31, 41 and 71. 


Cyclotomy of order 14 has been developed by Muskat [3] and of order 18 by Baumert and 
Fredricksen [1]. These could be used to make the corresponding analysis of the problem for s = 7 
and s = 9, but this would involve more cases and probably should be computer assisted. 
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Gobel’s rind-peeling problem [1982, 113; 1983, 686] remains baffling in its simplicity. Schwenk 
(tbp, 1985) considered sequences containing just two copies of each of m letters and used a 
computer to find sequences yielding the maximum number, p(m), of rinds. Had all 2m letters 
been distinct, the number would be 2?”~!. The deficit, d = 271 — p(m) for the best of these 
sequences is, for m < 14: 


3 4 5 6 7 8 9 10 11 12 13 14 
8 18 55 138 470 1164 4055 10140 35609 89782 316513 803040 


1 


m 1 2 
d 1 2 


1985] UNSOLVED PROBLEMS 721 


Schwenk conjectures that, for m > 8, the greatest number of rinds is given by the sequence 
T(k), k + T(m — k), where k =|m/2] and T(k) is the sequence k,k — 1, 
.. 94, 2,3,1,1,2,3,4,...,k, which runs from k to 1 and back to k, except for the reversal of 3, 2. 
Gibson and Slater (1982, tbp) also consider this case, and generalize the problem to peeling the 
labelled (or colored) vertices from a graph, subject to the condition that the graph remains 
connected. 

In spite of editorial admonition [1983,35] and the appearance of Jeff Lagarias’s article 
[1985, 3-23], the infamous 3x + 1 problem maintains its lure. Korec & Znam (wrc) let Y be a 
subset of P, the set of all positive integers, and write P < Y if, for every x in P, there is some y 
in Y and i and ; such that f(y) = f/(x), where f(x) = x/2 if x is even and 3x + 1 otherwise. 
The 3x + 1 problem then becomes: 

iP ~< {1}? 
They prove that if 2 is a primitive root of p*, then for every a,n in P, where p does not divide a, 
P < {a,mod p”}. 


They say that they have also proved that P < {1,mod 4} and that P < X for a set X of zero 
density. 

Somewhat to his surprise, Peter Giblin writes that moving triangles of the kind envisioned 
[1983, 121] do exist; see his paper with Kingston (tbp). This was also discovered by David Lindsay 
(wrc). 

Leonard Gilman extends the age of snowplow problems [1983, 199] to at least 43 years: such a 
problem appeared in the first edition (1942) of Agnew’s book on Differential Equations. 

Jan Mycielski writes concerning his pursuit problem [1984, 415] that fact (11) implies that 
T)(v,x) 18 not differentiable at x = 1 — v. However, problem (c): Is 75(v, x) differentiable? is 
still open for all other x € [0,1]. His paper [1984, 416, 4] is available on request. 

Thews [1984, 416] asked if the circle |z| < rv/(v — 1) was the domain of largest area which 
could be guarded by a defender with destruction radius r and maximum speed 1 against an 
invader with maximum speed v > 1. Rzymowski and Stachura (wrc) show that this is so. 

Johnson and Brooks (wrc) succeed in generating a bivariate Poisson distribution [1984, 562] 
with correlation p, for any p, —In2 < p < 0. Let U be distributed uniformly on [0,1], X, = 
—InU, X, = —In(1 — U). Then X,, X, are exponentially distributed with parameter A = 1. 
Define N,(t), i = 1,2, as a counting process with interarrival times according to X,. Then the 
vector (N,(t), N,(t)) is bivariate Poisson. For intervals of length ¢ < In2, 


Pr( N,(t) > 0, N(t)>0)=0, Pr(N,(t) = 0, N,(t) = 0) = 2e 7 -1>0 
and 
Pr( N,(t) = k, N,(t) = 0) = Pr(N,(t) = 0, M(t) =k) = tke */k!, =k =1,2,.... 
Further, 
E(N,(t)) = E(N,(1)) = 0, VAR(A,(t)) = VAR(N,(t)) = ¢ 


and correlation (N,(t), N,(t)) = —t. 

Roger Nelsen (wrc) has also generated such distributions with correlation coefficients in the 
interval —0.5 > p > —0.9585. 

Thiemann [1984, 562] asked: when is there a probability measure on a o-algebra? Grzegorz 
Plebanek (wrc) of Wroclaw sends the following result: Let A be a set and let #= {A,:a € A} be 
a semi-compact family of subsets of A, 1.e., for each subset &’ of oY with OW’ = ©, there isa 
finite subset /” of ’ such that NW” = ©. {a,:a € A} is any set of real numbers. By a(.#) 
[resp. o(.#)] we denote an algebra [resp. o-algebra] generated by the family ~. If, for every finite 
subset S of A, there exists a quasi-measure p, on a (.%) such that u.(A,) > a, for a € S, then 
there exists a measure pp On o(.W) satisfying 4(A,) > a, for every a. He proves this from results 
of Pachl (1978, lemma 3.3), Topsoe (1974) and a classical result of Marczewski (1953). 
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Charles H. Jepsen (tbp) discovered six incomparable cuboids [1984, 624] which will tile a 
3 X 5 X 9 cuboid (Fig. 1). 
He later proved that this example, 


(’x1x9)+(Qx2x7)4+(1 4x 6)4+(2x2x 5) 4+(2X4x 4) +(3 x3 x 4) = 135, 
has minimum volume. He proposes two further problems: 


If we erect prisms of height 2 on the seven pieces that tile a 13 X 22 rectangle [1984, 625, 
Fig. 2] we have a trivial incomparable tiling of a 2 x 13 X 22 cuboid. What is the 2x BX C 
cuboid of least volume that admits a non-trivial tiling (one in which the pieces don’t all share a 
common dimension)? Fig. 2 illustrates the smallest found by Jepsen. It derives from two tilings of 
a 35 x 49 rectangle, having a 16 X 21 rectangle in common. One piece is 2 X 16 X 21. The other 
ten all have thickness 1, and other dimensions 18 x 19, 17 x 21, 16 x 22, 15 x 24, 14 x 25, 
13 X 26,9 X 28,6 X 30,5 X 31 and 2 xX 36: the Ist, 3rd, 5th, 8th and 9th are in the top layer. 

Alternatively, is it possible to tile a 2 x B X C cuboid with incomparable 1 x p X q pieces? 
Le., are there two tilings of the same rectangle, with m and n pieces, say, and all m + n pieces 
incomparable? 

Later still, Jepsen sends the non-trivial incomparable tiling of a cube shown in Fig. 3: 


(1 x 10 x 10) +(2 x 5 x 10) +(3 X 6 X 9) +(4 X 4 x 10) +(4 x 5 x 8) 
+(4 6x 7) +(5 x 5 X 6) = 10°. 


He can prove that the number of pieces, seven, is minimal, and he believes that the edge-length, 
10, is minimal, too. 

My indebtedness to innumerable correspondents is clear. Thanks, too, to several readers who 
said that they find the section enjoyable as well as instructive. I will make a New Year resolution 
that it will appear more regularly. 
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A COMBINATORIAL THEOREM ON CIRCULANT MATRICES 


DEAN S. CLARK 
Department of Mathematics, University of Rhode Island, Kingston, RI 02881 


Introductory treatments frequently say that matrices are just rectangular arrays of numbers, 
but, of course, the real intent is to develop their algebraic properties. Here, we go no further than 
matrices as formal or symbolic objects. Half the time, the elements will be colors rather than 
numbers. Matrices have a capacity for storing combinatorial information, and we want to exploit 
that. The following examples extend some classical problems, and provide motivation. 


1. Problem of n queens (Gauss, ca. 1850). Can n mutually invulnerable queens be placed on 
an n X n chessboard? Surprisingly, it seems that the answer—affirmative for n > 4—has been 
known only since 1969 [1], [2]. We give a simple new proof of this fact, below. Here, however, our 
interest is in other surfaces with chessboard-like properties: the wrapped-around cylindrical or 
toroidal boards. Whether on the 8 X 8, 9 X 9, or 10 X 10 versions, the enhanced diagonals can 
make trying to place immune queens a real exercise in futility. 

When can n invulnerable queens be placed on ann X n toroidal chessboard? 


2. Ring dancers. N husbands have joined hands in a large circle, facing inward, while their N 
wives have linked arms in an inner circle, facing outward. The two groups are dancing in opposite 
directions. Now, freeze the action. Picking a husband at random, we expect that the person facing 
him is not his wife. On the other hand, it isn’t unreasonable to suspect that, somewhere on the 
circle, two spouses are facing each other. 

Ts it always possible to freeze the action at an instant when no spouses are facing each other? 


3. N pigeons, N holes. The pigeonhole principle says that it takes at least N + 1 pigeons, 
flying into N pigeonholes, to guarantee that one of the pigeonholes will contain several pigeons. 
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Here is a classic application [3], [4], [5], [6]: 

Suppose that a disk has been divided into n = 2k congruent sectors with k sectors colored 
blue and k colored red. Let the same thing be done to a smaller disk. The problem is to show that 
the smaller disk can always be positioned concentrically within the larger so that at least half the 
sectors have matching color. We comment on the solution later. For our nonstandard application, 
which removes the critical N + Ist pigeon (see below), let be arbitrary and, instead of using 
colors, label the sectors of each disk with the numbers 1, 2,..., in any order. We can place the 
disks concentrically and talk about the sum of the integer labels in a sector. 

Can the disks be positioned so that at least two sector sums are equal? 


4. A beautiful deception. To perpetrate this admitted flim-flam, one needs an accomplice 
seated anonymously in the audience. Participants are asked to draw a clockface with the numbers 
1,2,...,12 arranged in any order they please. A persuasive MC will stress the secrecy of each 
individual’s arrangement, that there are 11! or nearly 40 million possibilities, etc. This being done, 
everyone is asked to note the largest sum given by three consecutive numbers on their permuted 
clockface. This becomes a player’s secret number. Now the bidding starts. A person is to bid any 
number if they are convinced that no one else in the audience can undercut them with their secret 
number. For example, I would bid 27 if I felt certain that no one else had a secret number of 26 or 
less. The winner is either the last bidder or the first one who undercuts another bidder. It’s a game 
of nerve and reflexes. Invariably, the accomplice wins. 

What is the accomplice’s secret number, and what is the probability that someone else in the 
audience has it? 


Interestingly, all these examples boil down to questions about the elementary structure of 
circulant matrices. Recall that a circulant (retrocirculant) is a square matrix identified by its top 
row. This row is shifted successively a step to the right (left) with wraparound to produce the 
remaining rows. We use the notation Circ(a, b,c,...) (Circ. (a, b,c,...)) where a, b,c,... are 
distinct colors. Major (minor) diagonals are the monochrome sets in Circ(a, b,...) 
(Circ, (a, b,...)). The toroidal chessboard, ring dancers, and sectored disks are covered by the 
following basic result. 


THEOREM. It is possible to choose n positions in the n X n circulant matrix Circ(a, b,c,...) 
which are (i) on distinct columns, (11) on distinct rows, (ii1) of distinct colors; if and only if n > 1 is 
odd. It is possible to replace (iii) with (ii’) of distinct colors both in Circ(a,b,c,...) and 
Circ. (a, b,c,...); if and only if n > 1 is an odd integer not divisible by 3. 


Proof. Let n >1 be odd. Superimpose Circ(0,1,2,...,2—1) on Circ(a, b,...). Let (x), 
denote the nonnegative remainder after x is divided by n. The position in the kth column and 
a,th row shows diagonal number (k — a,),, k = 0,1,...,2 — 1. To locate the n positions which 
satisfy (i)—(ii1), begin circling elements starting with the upper leftmost (zero). We move from left 
to right, as the knight does in chess, over a column and down two rows circling elements as we go. 
This staircase pattern wraps around at midboard and continues to the last column. The n circled 
positions are on distinct rows because n is odd. The corresponding diagonal sequence {0,n — 1, 


n— 2,...,3,2,1} shows the circled positions have distinct color in Circ(a, b, c,...). 
To argue that n odd is necessary for (1)-(iii), suppose that in column k the a,th row element 
has been chosen, k = 0,1,...,m — 1. Since the a, and (k — a,),, are distinct, 


n—1 n—l 
0=, 0 (k- a), = Dk=5(n- In, 
k=0 k=0 


where =,, denotes congruence modulo n. This means that n must be odd. 

Now consider the stronger condition (iii’). Superimpose Circ. (0,1,2,...,2— 1) on 
Circ. (a, b,...). Circle the same n positions as before. In general, they will not lie on distinct 
minor diagonals, as exemplified by the 3 x 3 case. But if n>1 is not divisible by 3, the 
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corresponding diagonal sequence {(k + a,),}%_5 will be {0,3,6,...} without possibility of 
repetition. These same fixed positions are distinctly colored in Circ(a, b,...) and Circ. (a, b,...) 
simultaneously. 

In fact, n # 0 (mod 3) is necessary for (iii’). To see this, suppose that {a,}, {(k — a,),,} and 
{(k + a,),},0<k <n -—1, are all permutations of {0,1,...,n — 1}. Then, 


= 2 2 = — (n—1)(2n-1)n 
dL (k-a,), + XL (k+ a), = 2 Lk? implies 0 =, 2) ag =~", 
k=0 k=0 k=0 k=0 


which holds if and only if n is odd and not divisible by 3. 


COROLLARY 1. It is possible to place n invulnerable queens on the cylindrical or toroidal 
chessboard if and only if the unwrapped n X n board satisfies n > 5, n odd and not divisible by 3. 


COROLLARY 2. It is possible to place n invulnerable queens on an n X n chessboard for all n > 4. 


Corollary 1 follows by observing that the monochrome diagonals of Circ(a, b,...) 
(Circ, (a, b,...)) model the major (minor) diagonals of a wrapped-around chessboard. Queens 
placed in the circled positions on such a board remain invulnerable when it is unwrapped. Since a 
queen has been placed in the upper leftmost square, pruning the first row and column leaves the 
remaining queens in immune formation. Thus, we have already solved the classical queens 
problem of Corollary 2 for odd values of n, 3 + n, and the corresponding even values n — 1. 


Completion of proof of Corollary 2. Take n=2m+1 > 3, where 3|n. We modify the 
construction in [2] slightly and use circulant matrix arguments. As before, the queen placed in the 
upper leftmost square is deleted along with the first row and column to give the solution for 
n — 1. This being done, consider the remaining 2m X 2m submatrix in Circ. (m,m+1,..., 
n—1,0,1,...,m — 1), shown below. 


m m+ 1 cee n-3 n-2 
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Partition this submatrix into left and right halves, denoted L and R respectively. Place the 
remaining queens in positions circled as follows. Begin by circling 0 in the first column of L, then, 
moving from left to right, one column over and two rows down with w denoting a typical circled 
position in the lower half of L, w = 3h, h = 0,1,.... Wrap around to the top of L, where x is 
the typical term, x = 3i+ 1 (modn), and continue until all m columns of L are filled. We 
perform the inverse process in R: Circle n — 1 in the last column of R, then, moving from right 
to left, over one column and up two rows, with y denoting a typical circled position in the upper 
half of R, y=n— 3j —1, j =0,1,... . Wrap around to the bottom of R, where z is the typical 
term, z = n — 3k — 2 (modn), and continue until all m columns of R are filled. 

Clearly, the queens occupy distinct rows and columns. Because equations like w = y have no 
solution (3 + nm — 1,3 tm — 2), attack along minor diagonals is impossible. On the other hand, 
x = z does have solutions, and these queens would be vulnerable on the toroidal chessboard. Here 
on the unwrapped board they are safe in L and R. 

Confirming immunity along major diagonals is even easier. One checks the same positions in 
Circ(m —1,m,...,n —1,0,1,...,m— 2). 


COROLLARY 3. In Example (2) there is always an instant when none of the face-to-face pairs are 
spouses if and only if N is even. Similarly, in Example (3), there is always a superposition of disks 
with at least two equal sector sums if and only if n is even. 


Proof. Example (2) is absorbed into Example (3) by considering the disk problem with n 
distinct colors (= spouses) per disk. If the disks can be superimposed so that no sectors match, the 
ring dancers problem is solved. Let s,t,u,... denote either the color pattern or sequence of 
integer labels of the sectors of the larger disk, written linearly. Let a, b,c,... be the same for the 
smaller disk. Write Circ(a,b,c,...), the inventory of superpositions, directly underneath 
s,t,u,.... Suppose that n is even. If s,t,u,... are colors, move from left to night circling the 
element in each column of Circ(a, b,...) which matches the color of s,t,u,... heading that 
column. If s,t,u,... are integers, circlen+1—s,n+1-—t,n+1-—4,.... 

Think of the circles as pigeons and the rows of Circ(a,b,...) as the pigeonholes. In the 
classical (red-blue) disk problem, there would be k pigeons per column (half the column entries 
are color matches), giving a total of 2k* pigeons filling up 2k = n pigeonholes. One of the rows 
of Circ(a, b,...) would wind up with at least k circled elements—the desired superposition. 

But here, the standard pigeonhole principle doesn’t apply. Because the colors are distinct each 
column gets a single circle, hence, n pigeons into n pigeonholes. Enter the Theorem: The circled 
positions are in distinct columns and have distinct colors, but n is even. Thus, there must be a row 
in Circ(a, b,...) with at least two circled elements. In the sector-sums case, this means that 


(nt+1l—x)+x=n+1=(n+1-y)+y, 


where x and y label two sectors of the larger disk. 

For the ring dancers, the pigeonhole principle works in reverse. The two citcled elements in one 
row imply that there must be another row without circled elements. That is the instant when no 
spouses face each other. 


All these claims break down when 7 is odd. In that case, given colors s,t,u,..., construct 
Circ(a, b,...) by first replicating s,t,u,... from left to right up the minor main diagonal and 
then extending the colors along major diagonals. Now there is no row of Circ(a, b,...) without a 
match with an element of s,t,u,.... If s,t,u,... are integers and n is odd, it isn’t hard to label 
the sectors so that every superposition gives distinct sector sums. 

Example (4) is connected with circulants in a different way. Let x,,x,,...,x,, denote a circular 
permutation of the integers 1,2,...,.n, n>7, and s, = xX, + X41 + X40. Then, S = 


Circ(1,1,1,0,...,0)X, where X7 = (x,,...,x,), S’ = (s;,...,5,). Obviously, not every choice of 
S can result in a feasible solution X since the elements of X must be distinct integers. 
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LEMMA. Nondistinct solutions of S = Circ(1,1,1,0,...,0)X result (i) if two consecutive elements 
in S are equal; or (ii) if S contains five consecutive elements of the form abcba. 


Proof. The equation x, + X,41 + X,42 = X,41 + X,42 + X,43 implies x, = x,,3, confirming 
(i). To confirm (11), suppose, without loss of generality, that 


Xy +X. +X, = A,X, +X, +X, =O, Xp t+ X65 4+ xX =D, XS +X + XZ =A. 
These equations imply x, — x, =a— b= x, — x4, 80 x; = Xp. 


The key question regarding the flim-flam game of Example (4) is: What can a player’s secret 
number possibly be? Obviously it can’t be, say, 14 because everyone has a 12 on their clockface 
and, next to it, at least 1 and 2. The bidding should begin at 16. The bids 17,18,19,20,... pass 
uneventfully. 

Let o denote any secret number. To obtain a lower bound on o we compute 5, the average 
value of the triple-sums s;: 


Thus, o > 20 > s. In actuality, o can never equal 20. Twelve integers, the largest of them 20, 
average out to 19.5 only if at least 6 of them equal 20. If exactly 6 of the s, equal 20, then the 
others must all be 19. In that case, either two consecutive s , are the same, or else the 20’s and 19’s 
alternate. Both possibilities are violations, in view of the above Lemma. If more than 6 of the s, 
equal 20, we have the same violation because (the pigeonhole principle!) at least two consecutive 
entries in S must be 20. 

Thus, o > 21. Meanwhile, my accomplice is ready with a circular permutation for which o 
attains its lower bound. It is 1, 8, 10, 3,5, 9, 4, 6,11, 2, 7, 12. 

The only remaining question: What is the probability that someone in the audience has also 
stumbled onto a circular permutation of 1,2,...,12 for which o = 21? Frankly, I don’t know the 
probability, but I believe that it is very small. After all, my accomplice and I have never lost a 
game. 


Acknowledgment. I wish to thank Professors P. J. Davis and O. Shisha for helpful discussions. 
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Differential Equations, P. Differential Equations. Ed: Ian W. Knowles, Roger T. Lewis. Math. 
Stud., V. 92. Elsevier Sci, 1984, xx + 608 pp, $61.50 (P). [ISBN: 0-444-86875-5] Proceedings of an 
international conference held at the University of Alabama in Birmingham during March 1983. 76 
papers on ordinary and partial differential equations, from participants from 20 countries. LAS 
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Differential Equations, P. Nonlinear Singular Perturbation Phenomena: Theory and Applications. K.W. 
Chang, F.A. Howes. Appl. Math. Sci., No. 56. Springer-Verlag, 1984, viii + 180 pp, $18.80 (P). 
(ISBN: 0-387-96066-X] Recent results on the existence and asymptotic behavior of solutions of cer- 
tain classes of singularly perturbed nonlinear boundary value problems. Both scalar and vector 
problems are studied using differential inequality techniques. AO 


Partial Differential Equations, T(17-18: 1), S, P. Partial Differential Equations. Phoolan Prasad, 
Renuka Ravindran. Wiley, 1984, xi + 252 pp, $24.95. [ISBN: 0-470-20071-5] A basic introduction to 
partial differential equations. Three chapters treat first-order, linear second-order, and hyper- 
bolic partial differential equations. Physical applications and classical problems are treated as 
they arise. Good for self-study; each short section contains exercises; most contain worked exam~ 
ples. Limited bibliography. PZ 


Partial Differential Equations, P. Elliptic Problems in Nonsmooth Domains. P. Grisvard. Mono. & 
Stud. in Math., V. 24. Pitman, 1985, xiv + 410 pp, $49.95. [ISBN: 0~273-08647-2] Studies elliptic 
boundary value problems in domains with nonsmooth boundaries and problems with mixed boundary condi- 
tions by deriving modified "shift theorems." Presents results mainly in the framework of Sobolev 
spaces, but also in spaces of Holder functions. DFA 


Partial Differential Equations, $(16-18), P, L. Solving Elliptic Problems Using ELLPACK. John R. 
Rice, Ronald F. Boisvert. Ser. in Comp. Math., No. 2. Springer-Verlag, 1985, x + 497 pp, $46.50. 
[ISBN: 0-387-90910-9] ELLPACK is a software system for solving elliptic boundary value problems. 
This volume contains the user’s manual, detailed system documentation, and examples illustrating use 
of the system. AO 


Partial Differential Equations, $(18), P. Large Deviations and the Malliavin Calculus. Jean-Michel 
Bismut. Prog. in Math., V. 45. Birkhauser Boston, 1984, viii + 216 pp, $17.95. [ISBN: 0-8176- 
3220-4] The Malliavin calculus and large deviation techniques are used to study the asymptotics of 
the conditional probabilities of bridges associated with certain hypoelliptic diffusions. Includes 


a list of references. CEC 


Partial Differential Equations, P. Nonlinear Partial Differential Equations and their Applications, 
Collége de France Seminar, Volume VI. Ed: H. Brezis, J.L. Lions. Res. Notes in Math., No. 109. 


Pitman, 1984, 323 pp, $27.50 (P). [ISBN: 0-273-08646-4] Written versions of lectures held during 
the year 1982-1983 at the weekly Seminar in Applied Mathematics at the Collége de France. JAS 


es 


Verlag, 1984, v + 238 pp, $11 (P). [ISBN: 0-387~13361-5] 


Numerical Analysis, $(16-17), P. Least Squares Computations Using Orthogonalization Methods. James 
W. Longley. Lect. Notes in Pure & Appl. Math., V. 93. Dekker, 1984, xi + 308 pp, $59.75 (P). 
(ISBN: 0-8247-7232-6] Discusses the use of Gram-Schmidt orthogonalization and Givens transformations 
for least-squares computations, especially for statistical applications. Listings of FORTRAN imple- 
mentations of many of the algorithms are included. AO 


Numerical Analysis, P. Defect Correction Methods: Theory and Applications. Ed: K. Bohmer, H.J. 
Stetter. Computing Supple., V. 5. Springer-Verlag, 1984, ix + 243 pp, $20 (P). [ISBN: 0-387- 
81832-4] 13 papers on techniques which use knowledge of some measure of the error to obtain itera- 
tive improvements. Includes theoretical foundations; applications to eigenvalue problems, stiff 
ordinary differential equations, and partial differential equations; multi-grid methods and inclu- 
sion algorithms; and adaptive software. RWN 


Numerical Analysis, S(16-17), L*. Methods of Numerical Integration, Second Edition. Philip F. 
Davis, Philip Rabinowitz. Comp. Sci. & Appl. Math. Academic Pr, 1984, xiv + 612 pp, $52. [ISBN: 
0-12-206360-0] An updated and expanded edition of an important reference. Major changes have been 
made in the chapters discussing the evaluation of multi-dimensional integrals and automatic integra~ 
tion. (First Edition, TR, January 1976.) AO 


Numerical Analysis, T(16-17: 1), L. Finite Element Solution of Boundary Value Problems: Theory and 
Computation. 0. Axelsson, V.A. Barker. Comp. Sci. & Appl. Math. Academic Pr, 1984, xviii + 432 
pp, $59. [ISBN: 0-12-068780-1] An introduction to both the theoretical and computational aspects of 
the finite element method for solving boundary value problems for partial differential equations. 
Primary attention is given to second-order, linear, self-adjoint problems. AO 


Numerical Analysis, T(13-14: 1), L*. Computing in Applied Science. William J. Thompson. Wiley, 
1984, xxiv + 325 pp, $26.95. [ISBN: 0-471-09355-6] An introduction to a number of mathematical 
topics important in applications (complex numbers, finite difference methods, power series, Fourier 
expansions, differential equations, vector dynamics, and data analysis) together with related numer- 
ical methods. AO 


Functional Analysis, $(18), P. Ihe Theory of Tikhonov Regularization for Fredholm Equations of the 
First Kind. C.W. Groetsch. Research Notes in Math., V. 105. Pitman, 1984, 104 pp, $16.95 (P). 
[ISBN: 0-273-08642-1] Motivates material well with short introductory chapter on equations of the 
first kind, ill-posed problems, generalized inverses, and linear operators in Hilbert space. Then 
discusses general regularization, Tikhonov regularization and finite approximation methods for 
Fredholm equations of the first kind. BH 
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Functional Analysis, T*(18). A Course in Functional Analysis. John B. Conway. Grad. Texts in 
Math., No. 96. Springer-Verlag, 1985, xiv + 404 pp, $38. [ISBN: 0-387-96042-2] Well-motivated gra- 
duate text. Assumes measure and integration theory and point set topology and requires at least a 
concurrent course in analytic function theory. Many good problems of varying difficulty at the end 
of each section. BH 


Functional Analysis, $(18), P. Analytic Sets in Locally Convex Spaces. Pierre Mazet. Math. Stud., 
V. 89. Elsevier Sci, 1984, ix + 275 pp, $38.50 (P). [ISBN: 0-444-86867-4] This book gives defini- 
tions and basic tools for the geometry of analytic sets in the infinite dimensional case, and gen- 
eralizes the theorems on local representation of analytic spaces together with the great classical 
theorems viz. the Nullstellensatz, the direct image theorem and the theorem of Remmert-Stein. MU 


Functional Analysis, P. Fifteen Papers on Functional Analysis. Ed: Lev. J. Leifman. AMS Transl. 
Ser. 2, V. 124. AMS, 1984, viii + 183 pp, $60. [ISBN: 0-8218-3085-6] 


Analysis, T(18: 1), P. Brownian Motion and Martingales in Analysis. Richard Durrett. Wadsworth, 
1984, xi + 328 pp, $34.95. [ISBN: 0-534-03065-3] Brownian motion can be used to prove many results 
in classical analysis, but most such results are not widely known. The purpose of this book is to 
bring such results together. Chapters 1 and 2 introduce Brownian motion and the stochastic 
integral, and the final seven chapters give applications. MT 


Analysis, T(17), S, P. Lecture Notes in Mathematics-1078: Integration Theory. A.J.E.M. Janssen, P. 
van der Steen. Springer-Verlag, 1984, 224 pp, $11 (P). [ISBN: 0~-387-13386-0] Study of Lebesgue 
integration which compares the methods of Bourbaki, Daniell, and Carathéodory. Demonstrates that 
these methods have a large common ground which can be subsumed in a single theory. Clear, well~ 


motivated exposition. Many problems. BH 


Analysis, T(16-18: 1, 2), S, P, L. Singularities of Differentiable Maps, V. I: The Classification of 
Critical Points, Caustics and Wave Fronts. V.I. Arnold, §.M. Gusein-Zade, A.N. Varchenko. Mono. in 
Math., V. 82. Birkhauser Boston, 1985, x + 382 pp, $44.95. [ISBN: 0-8176-3187-9] An example-driven 
"zoology" of singularities of differentiable maps, covering basic concepts and applications with 
minimum theoretical or technical details. Adapted from courses given at Moscow State University 


from 1966 to 1978; translated from the Russian original. LAS 


Analysis, P*. n-Widths in Approximation Theory. Allan Pinkus. Erge. der Math. und ihrer Grenzge~ 
biete, Band 7. Springer-Verlag, 1985, x + 291 pp, $39. [ISBN: 0-387-13638-X] Develops basic theory 
of n-widths and applies it to Hilbert spaces, integral operators, matrices, Sobolev spaces, and 
spaces of analytic functions. Well motivated. Extensive bibliography. BH 


Analysis, P. Lecture Notes in Mathematics-1108: Global Analysis-~Studies and Applications I. Ed: 
Yu. G. Borisovich, Yu. E. Gliklikh. Springer-Verlag, 1984, 301 pp, $16 (P). [ISBN: 0~-387-13910-9] 
Translations of main articles from the series "New Developments in Global Analysis" (in Russian) 


published 1982-1984. JAS 


Analysis, P. Approximation Theory and Spline Functions. Ed: S8.P. Singh, J.W. H. Burry, B. Watson. 
NATO ASI Ser. C., V. 136. D Reidel, 1984, ix + 485 pp, $69.50. [ISBN: 90-277~-1818-0] The proceed- 
ings of a NATO Advanced Study Institute held at Memorial University of Newfoundland in 1983. Papers 
cover a wide range of topics: multivariate approximation, spline functions, rational functions, 
applications of elliptic integrals and functions, and Padé approximation. AO 


Analysis, T*(16-18: 1, 2), S, L. Curves and Singularities: A Geometrical Introduction to Singularity 
Theory. J.W. Bruce, P.J. Giblin. Cambridge U Pr, 1984, xii + 222 pp, $39.50; $15.95 (P). [ISBN: 
0-521-24945-7; 0-521-27091-X] An elementary (undergraduate) treatment of geometrical applications of 
Singularity theory that grow out of transversality and unfolding. Many examples and exercises make 
this ideally suited to class or seminar use. LAS 


Analysis, P. Lecture Notes in Mathematics-1096: Théorie du Potentiel. Ed: G. Mokobodzki, D. Pin- 


chon. Springer-Verlag, 1984, ix + 582 pp, $27.50 (P). [ISBN: 0-387~-13894-3] Proceedings of the Col- 
loque Jacques Deny held at Orsay, June 20-23, 1983. JAS 


Analysis, P. Minimal Surfaces and Functions of Bounded Variation. Enrico Giusti. Mono. in Math., 
V. 80. Birkhauser Boston, 1984, xii + 240 pp, $39.95. [ISBN: 3~7643-3153-4] “Account of the theory 
of parametric and non-parametric minimal surfaces of codimension one in Euclidean spaces of arbi- 
trary dimension." Topics include: existence and regularity almost everywhere of solutions to the 
Plateau problem, minimal cones, dimension of the singular set of surfaces, Dirichlet problem for the 


minimal surface equation, Bernstein problem for entire solutions. BH 


Algebraic Geometry, P. Surfaces with Canonical Hyperplane Sections. D.H.J. Epema. CWI Tract, V. 1. 
Math Centrum, 1984, ix + 105 pp, D£1l. 15,50 (P). [ISBN: 90-6196-267-6] Let X be a projective alge- 
braic surface. X has canonical hyperplane sections if X can be embedded in some projective space in 
such a way that a general hyperplane section is a smooth canonically embedded curve. Author classi- 
fies such surfaces and studies specifically those which are birational to an irrational ruled sur- 
face. BH 


Algebraic Geometry, P. Lecture Notes in Mathematics-l1101: Resolution of Surface Singularities. Vin- 


cent Cossart, Jean Giraud, Ulrich Orbanz. Springer-Verlag, 1984, vii + 132 pp, $7.50 (P). [ISBN: 
0-387-13904-4] The three authors give lectures on the work of Zariski, Hironaka and Abhyankar on the 
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resolution of the singularities of algebraic surfaces. Emphasis is on easy canonical processes. 
There is also an appendix by Hironaka on the resolution of excellent surfaces. LN 


Algebraic Geometry, P. Lecture Notes in Mathematics~1092: Complete Intersections. Ed: S. Greco, R. 
Strano. Springer-Verlag, 1984, vii + 299 pp, $16 (P). [ISBN: 0-387-13884-6] Lectures given at the 
First 1983 Session of the Centro Internationale Matematico Estivo (CIME) held at Acireale (Catania), 


Italy, June 13-21, 1983. JAS 


Geometry, S(15~17), P. Elementargeometrie: Das Fundament der Analytischen Geometrie. Paul Lorenzen. 
Bibliographisches Inst, 1984, 238 pp, (P). [ISBN: 3-411-00400-2] 
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angulability. Andrei Verona. Springer-Verlag, 1984, ix + 160 pp, $9.50 (P). [ISBN: 0-387-13898-6] 
A proof that any proper, topologically stable smooth map from one smooth manifold without boundary 
to another is triangulable. JAS 


Topology, 1T(15-16: 1, 2), L. General Topology. Jacques Dixmier. Undergrad. Texts in Math. 
Springer-Verlag, 1984, x + 140 pp, $18. [ISBN: 0-387~-90972-9] Originally written for students in the 
third university year in France, the present book is a translation of the original together with a 
few minor improvements and the addition of a section on normal spaces. It is informal but dense by 
American standards; many proofs are left to the student. Filter base notions are used for defini- 
tions of limits and continuity. The exercises for all chapters are ensconced in an unconscionably 
short exercise section at the end of the book. Includes a chapter on normed spaces culminating in 
Riesz’s theorem, and a chapter on infinite sums culminating with certain summable families in Hil- 
bert spaces. PH 


Operations Research, T(15), S, L. Simulation of Waiting-Line Systems. Susan L. Solomon. Prentice- 
Hall, 1983, xi + 452 pp, $32. [ISBN: 0-13-810044-6] Intended as an introductory text for simulation. 
Applications deal with waiting-line systems, but techniques apply to other simulations as well. The 
text includes three chapters discussing use of the simulation package GPSS. MT 


Optimization, S(18), P. Variational and Quasivariational Inequalities: Applications to Free Boundary 
Problems. Claudio Baiocchi, Anténio Capelo. Transl: Lakshmi Jayakar. Wiley, 1984, ix + 452 pp, 


$57.95. [ISBN: 0-471-90201-2] Studies variational and quasivariational inequalities of elliptic 
type, including the mathematical theory behind them and their application to free-boundary problems. 
Accessible to anyone with basic understanding of Lebesgue integration theory, measure theory and 
elementary properties of Hilbert and Banach spaces. No problems. Useful as a reference. Extensive 
bibliography. BH 


Optimization, T(15-16), S, L. Introduction to Non-Linear Optimization. L.E. Scales. Springer- 
Verlag, 1985, xi + 243 pp, $19.80 (P). [ISBN: 0-387-91252-5] A survey of non-linear optimization 
methods emphasizing those that are generally applicable, widely available, and supported by theory. 
Unfortunately, no exercises accompany the text. AO 


Optimization, T*(14-16: 1), L*. Combinatorial Optimization for Undergraduates. L.R. Foulds. Under- 
grad. Texts in Math. Springer-Verlag, 1984, xii + 227 pp, $36. [ISBN: 0-387-90977-X] An introduc- 
tion to combinatorial optimization techniques and applications. Topics include linear programming, 
integer programming, dynamic programming, and network algorithms. AO 


Optimization, P. Mathematical Programming. Ed: Richard W. Cottle, Milton L. Kelmanson, Bernard 
Korte. Elsevier Sci, 1984, viii + 357 pp, $59.50. [ISBN: 0-444-86821-6] Proceedings of the Interna- 
tional Conference on Mathematical Programming held in Rio de Janeiro in April 1981. Contains 
twenty-one papers. AO 


Optimization, T(17-18: 1), P. Network Flows and Monotropic Optimization. R.T. Rockafellar. Wiley, 
1984, xiii + 616 pp, $44.95. [ISBN: 0-471-88078-7] Presents an introduction to the theory of network 
programming problems and generalizations of these results to a larger class of monotropic program- 
ming problems (problems in which a preseparable convex function is minimized subject to linear con- 
straints). AO 


Optimization, P. Variational Convergence for Functions and Operators. H. Attouch. Appl. Math. Ser. 
Pitman, 1984, 423 pp, $29.95 (P). [ISBN: 0-273-08583~2] "Focuses on minimization problems and 
develops a convergence theory for sequences of functions called epi-convergence, which may be 
regarded as the “weakest” notion which allows approach to the limit in these problems." Extensive 
bibliography. BH 


Optimization, 1T(18), S, P. Nonlinear Functiona] Analysis and its Applications III: Variational 
Methods and Optimization. Eberhard Zeidler. Transl: Leo F. Boron. Springer-Verlag, 1985, xxii + 
662 pp, $59. [ISBN: 0-387-90915-X] Translation of the revised and extended German version. Topics 
include existence and uniqueness principles, extremal problems with and without side conditions, 
saddle points and duality, and variational inequalities. Useful reference with very extensive 
bibliography. Numerous problems of varying degrees of difficulty with solutions or references to 
solutions. BH 


Dynamical Systems, P. Dynamical Systems and Microphysics: Control Theory and Mechanics. Ed: Austin 
Blaquiére, George Leitmann. Academic Pr, 1984, x + 403 pp, $29.50. [ISBN: 0-12-104365-7] A collec- 
tion of papers that explore some topics of common interest in contemporary mechanics and mathemati- 
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cal system theory. AO 


Dynamical Systems, P. Fluids and Plasmas: Geometry and Dynamics. Ed: Jerrold E. Marsden. Contemp. 
Math., V. 28. AMS, 1984, xvi + 448 pp, $35 (P). [ISBN: 0-8218-5028-8] Proceedings of the summer 
research conference held at the University of Colorado, Boulder, in July 1983. The papers discuss 
geometric and dynamical systems aspects of fluid and plasma dynamics as well as analytical and 
numerical methods. AO 


Control Theory, P. Decision and Control in Uncertain Resource Systems. Marc Mangel. Math. in Sci. 
& Eng., V. 172. Academic Pr, 1985, xiii + 255 pp, $39.50. [ISBN: 0~12-468720-2] A graduate-level 
textbook, as well as a resource book, for deciding optimal policies on what to do, and how to do, 
exploration and exploitation of natural resources (exhaustible and renewable). The first two 
chapters on mathematics aim to keep the book as self-contained as possible, and includes a discus- 
sion of optimal stochastic control, one of the major tools used in the specific resource applica- 


tions which follow. A final chapter is on numerical methods. LCL 


Control Theory, P. Lecture Notes in Mathematics-1107: Nonlinear Analysis and Optimization. Ed: C. 


Vinti. Springer-Verlag, 1984, v + 214 pp, $11 (P). [ISBN: 0-387-13903-6] Proceedings of the inter- 
national conference in honor of Lamberto Cesari held in Bologna, Italy, May 3-7, 1982. JAS 


Systems Theory, P. System Theoretic Description of Physical Systems. A.J. van der Schaft. CWI 
Tract, No. 3. Math Centrum, 1984, xv + 256 pp, Dfl. 36,90 (P). [ISBN: 90-6196-269-2] Introduces 
and describes general system-theoretic models, including finite dimensional linear and nonlinear 
systems, and applies them to Hamiltonian systems, symmetries and conservation laws, gradient systems 
and optimal control. Well motivated. BH 


Probability, P. Advances in Probability Theory: Limit Theorems & Related Problems. Ed: A.A. 
Borovkov. Optimization Software (Worldwide Distr: Springer-Verlag), 1984, xiv + 377 pp, $48. [ISBN: 
0-387-90945-1] Collection of articles on limit theorems grouped into three parts: 1) Invariance 
principle of Donsker and Prokhorov, 2) Limit theorems for random processes and their applications, 
3) Miscellaneous. English translation from Russian. BH 


Probability, T(18: 1), S. Introduction to Probability Theory. Kiyosi It@. Cambridge U Pr, 1984, x 
+ 213 pp, $32.50. [ISBN: 0-521-26418-9] A measure-theoretic probability theory text. Topics include 
probability spaces, random variables and vectors, independence, sigma-algebras, characteristic func- 
tions, conditional probability measures, and limit theorems. MT 


Probability, P. Comparison Methods for Queues and Other Stochastic Models. Dietrich Stoyan. Wiley, 
1983, xiii + 217 pp, $39.95. [ISBN: 0-471-10122-2] Presents unified methods for approximating sto- 
chastic models by other, simpler models, and for obtaining bounds on stochastic models. The bounds 
and approximation methods depend on monotonicity and continuity properties of stochastic models. MT 


Probability, T*(16: 1, 2), S, P, L. Introduction to Probability Models, Third Edition. Sheldon M. 
Ross. Academic Pr, 1985, xiv + 502 pp, $28. [ISBN: 0-12-598463-4] Revision of the author’s 1980 
Second Edition (TR, January 1982). The chapter on statistical estimation has been removed and a 
chapter on Brownian motion and stationary processes and an extensive chapter on simulation have been 
added. In addition, new material has been added to all but the first chapter. RSK 


Probability, T(15: 1), L. Introductory Probability Theory. Janos Galambos. Probability: Pure & 
Appl. Dekker, 1984, vii + 200 pp, $25. [ISBN: 0-8247-7179-6] A text for a one~semester introduction 
to probability theory. Assumes a basic knowledge of calculus. Covers the usual topics, including 
conditional probability, independence, random variables, expectation, and the Central Limit Theorem. 
Separates more involved mathematics from concepts and keeps the mathematical level relatively low. 
MT 


Probability, S(17), P. General Irreducible Markov Chains and Non-negative Operators. Esa Nummelin. 
Tracts in Math., V. 83. Cambridge U Pr, 1984, xi + 156 pp, $37.50. [ISBN: 0-521-25005-6] Deals with 
theory of general irreducible Markov chains and its connection with the Perron-Frobenius theory of 
non-negative operators. Applications include queueing theory, storage theory, autoregressive 


processes and renewal theory. Extensive bibliography. BH 


arn ee NS a EN OE NE Cs SNE  Seaetaieaiaer 


1982. R.M. Dudley, H. Kunita, F. Ledrappier. Springer-Verlag, 1984, x + 396 pp, $20 (P). [ISBN: 
0-387-13897-8] Three sets of lectures: "A Course on Empirical Processes," by R.M. Dudley; "Stochas- 
tic Differential Equations and Stochastic Flow of Diffeomorphisms," by H. Kunita; and "Quelques 
Proprietes des Exposants Caracteristiques," by F. Ledrappier. JAS 


Probability, T(18: 1). Martingales and Stochastic Integrals. P.E. Kopp. Cambridge U Pr, 1984, xi + 
202 pp, $29.95. [ISBN: 0-521-24758-6] Theoretical presentation of three main topics, discrete time 
martingales, continuous-time martingales, and stochastic integrals, preceded by introductory 
material on measure theoretic probability and functional analysis. Also includes an appendix on a 
non-commutative extension of stochastic integration. RSK 


Probability, 1T(18: 1), P. Convergence of Stochastic Processes. David Pollard. Ser. in Stat. 
Springer-Verlag, 1984, xiv + 215 pp, $28. [ISBN: 0-387-90990-7] This book deals with empirical 
processes and their convergence properties, discussing convergence under different metrics and on 
several spaces. MT 
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Statistics, $(17), P. An Introduction to Latent Variable Models. B.S. Everitt. Mono. on Stat. & 
Appl. Prob. Chapman & Hall, 1984, viii + 107 pp, $20. [ISBN: 0-412-25310-0] This book introduces 
models involving latent variables, concepts that cannot be observed directly but which have effects 


on measurable variables. Latent variables are useful mostly in studies in social sciences. MT 


Statistica, P*. Time Series in the Frequency Domain. Ed: D.R. Brillinger, P.R. Krishnaiah. Hand- 
book of Stat., V. 3. Elsevier Sci, 1983, xiv + 485 pp, $88.50. [ISBN: 0-444-86726-0] Third volume 
in this series of reference books on statistical methodology and applications (Volume 1, Analysis of 
Variance, TR, June-July 1981; Volume 2, Classification, Pattern Recognition and Reduction of Dimen- 
sionality, TR, December 1983). Contains 20 chapters providing an up-to-date picture of the spectrum 
approach to time series analysis. RSK 


Statistics, T(14~16: 1, 2), S. Probability and Statistics for Engineers and Scientists, Third Edi- 
tion. Ronald E. Walpole, Raymond H. Myers. Macmillan, 1985, xiii + 639 pp. [ISBN: 0-02-424170-9] 
This new edition contains new examples using real data, several rewritten sections, a discussion of 
stem and leaf diagrams, new material on residues in regression, Latin square designs, Bartlett’s 
test for equality of several variances, and p values. Presupposes calculus. (First Edition, TR, 
August~September 1972; Second Edition, TR, December 1978.) FLW 


Statistics, T(16: 1), S. Elements of Simulation. Byron J.T. Morgan. Chapman & Hall, 1984, xiii + 
351 pp, $18.95 (P). [ISBN: 0+412-24590-6] Beginning with a review of basic probability theory 
results, this book continues with discussion of generating specific random variables, including uni- 
form, normal, exponential, binomial and Poisson random variables. It also includes testing of the 
randomness of random numbers, variance reduction, and model construction. MT 


Statistics, T(16: 1), $. Advanced Methods of Data Exploration and Modelling. B.S. Everitt, G. Dunn. 
Heinemann Educ Books, 1983, xix + 253 pp, $20. [ISBN: 0-435~82294-2] Intended as an intermediate 
text for those interested in analysis of social or behavioral data. Assumes a first course in 
statistics and some familiarity with matrix algebra. Includes plotting techniques, multivariate 
techniques, discussion of the general linear model and latent variable models. MT 


Statistice, T77(13: 1), S. Statistical Programs in BASIC. Ronald D. Schwartz, David T. Basso. Res- 
ton, 1985, xiv + 208 pp, (P). [ISBN: 0-8359-7106-6] Claimed to be a text, this book is a collection 
of BASIC programs that do calculations for topics which would appear in an introductory statistics 
course. The format of the book (for each technique) is rigid: statement of problem, algorithm, 
BASIC program, example problems. MT 


Statistics, S8(18), P. The Forecasting Accuracy of Major Time Series Methods. S. Makridakis. Wiley, 
1984, viii + 301 pp, $39.95. [ISBN: 0~471-90327-2] Assesses accuracy of forecasting methods based on 
an empirical study. The study was a forecasting competition in which participants were asked to 
forecast up to eighteen time horizons ahead. The book concludes that there are no methods that are 
clearly best and that accuracy depends on the type of data and the forecasting situation considered. 
MT 


Statistics, P. Statistical Methods for Cancer Studies. Ed: Richard G. Cornell. Statistics, V. 5l. 
Dekker, 1984, x + 479 pp, $59.50. [ISBN: 0-8247~-7169-9] The specialized nature of studies of cancer 
has led to development of unique statistical methods. This is a collection of articles describing 


some of these methods. MT 


Computer Literacy, $(13-16), P. Microcomputers and their Commercial Applications. D.E. Avison. 
Comp. Sci. Texts. Computer Science Pr, 1983, viii + 94 pp, $9.95 (P). [ISBN: 0-632-01172-6] A sub~ 
stantial, though non-theoretical, overview of the issues involved in "What”s it good for?" and “How 
do I decide?" Although it is a bit specialized for a computer literacy book, it has much more sub- 
stance than most of the computer literacy genre. However, it is written as a reference volume 
rather than as a text. JAS 


Computer Literacy. First Book on UNIX for Executives. Yukari Shirota, Tosiyasu L. Kunii. 
Springer-Verlag, 1984, xi + 154 pp, $16 (P). [ISBN: 0-387-70003-X] A strange mixture of hype and 
cartoons, sure to insult the intended audience. This overview of UNIX may whet a junior high 


reader’s appetite, but the bill of fare is junk food of no lasting value. LAS 


Computer Programming, T(14-16), S, L. Lisp Programming. I. Danicic. Comp. Sci. Texts. Computer 
Science Pr, 1983, ix + 96 pp, $11.95 (P). [ISBN: 0-632-01181-5] A serious study of the syntax of 
LISP focusing on LISP 1.5. This is not a tutorial, but rather a presentation and explanation of the 
structure and functions of LISP. JAS 


Computer Programming, $(14-15), L. CG: An Advanced Introduction. Narain Gehani. Computer Science 
Pr, 1985, xv + 332 pp, $29.95. LISBN: 0-88175-053-0] An excellent introduction to the C programming 
language for experienced programmers. Compared to more traditional introductory texts, this book 
places greater emphasis on advanced topics such as data abstraction, exception handling, and con- 
current programming. AQ 


Computer Programming, 1T(13: 1). Prelude to Programming: Problem Solving and Algorithms. William 
Mitchell. Reston, 1984, ix + 173 pp, $22.95. [ISBN: 0-8359-5614-8] Problem solving strategies and 
programming methodologies for would~be programmers. An introductory survey of useful techniques: 
graphical representations of algorithms (flowcharts and Nassi-Shneiderman diagrams), algorithm 
verification techniques, and efficiency measures. AO 
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Computer Programming, S(13-14). Learning to Program in €C. Thomas Plum. Prentice-Hall, 1983, vi + 
349 pp, $32.95. [ISBN: 0-13-527854-6] An introductory textbook on the C programming language for 
novice programmers. Emphasizes techniques that lead to readable, portable code. Not a complete 
guide to the language. AO 


Computer Programming, $(13-14). The C Puzzle Book. Alan R. Feuer. Prentice-Hall, 1982, viii + 171 
pp, $18.95. [ISBN: 0-13-109934-5] A collection of exercises designed to test the reader’s mastery of 
C language semantics. Useful as a supplement to any textbook on the language. AO 


Computer Programming, T(14~15), S. IBM-PC 8088 MACRO Assembler Programming. Dan Rollins. Macmil- 
lan, 1985, xxiii + 435 pp, (P). [ISBN: 0-02-403210-7] The 8088 processor is the microprocessor 
inside the IBM-PC. For this reason, it is one of the most popular microprocessors in use today. 
The text explains the design and architecture of the 8088, as well as how to program in 8088 assen- 
bly code. Before reading this, an individual should already know how to program in one or more 


high-level languages. MS 


Data Structures, T(16-17: 1-3), L. Data Structures and Algorithms. Kurt Mehlhorn. EATCS Mono. on 
Theoretical Comp. Sci. Springer-Verlag, 1984, $17.50 each. 1: Sorting and Searching, xiv + 336 pp 
[ISBN: 0-387-13302-X]; 2: Graph Algorithms and NP-Completeness, xii + 260 pp [ISBN: 0-387-13641-X]; 
3: Multi-dimensional Searching and Computational Geometry, xii + 284 pp. [ISBN: 0-387-13642-8] 
State-of-the-art algorithms are presented and analyzed. Volumes 2 and 3 assume familiarity with 
some material from Volume 1, but are independent of each other. AO 


Software Systems, S(15-18), P. IK!Solver for Engineers. Victor E. Wright. Reston, 1984, xii + 419 
pp, (BP). [ISBN: 0-8359~7711-0] TK!Solver is a program designed to let a user communicate with a com- 
puter, without programming, in terms familiar to an engineer. This book is a user’s guide for 
engineers who want to use a computer as a tool, who may not know much about computers, and who would 
appreciate a thorough guide to the non-trivial tool (program) TK!Solver. JAS 


Software Systems, S**(13~16), L. Unix for Users. Chris Miller, Roger Boyle. Comp. Sci. Texts. 
Computer Science Pr, 1984, x + 210 pp, $12.95 (P). [ISBN: 0-632-01182-3] An introduction to UNIX 
Version 7, intended to be sufficiently generic to be of use with most common versions of UNIX and 
even UNIX look-alikes. Well written; starts at the beginning and covers a lot of ground. The 
emphasis is on explaining the nature of UNIX and its standard utilities; details are to be found in 


the UNIX Programmer”’s Manual. JAS 


Computer Science, S(16), P. Frontiers in Computer Graphics. Ed: Tosiyasu L. Kunii. Springer- 
Verlag, 1985, xi + 443 pp, $45. [ISBN: 0-387-70004-8] The 30 papers in this text were presented at 
the Computer Graphics Conference in Tokyo, Japan, April 1984, The text is divided into 8 sections 
with 2-5 papers in each section: geometric modelling, graphics languages, visualization techniques, 
human factors issue, interactive design, CAD/CAM, display hardware, and standardization issues. In 
addition to the papers, there are over 250 full-color photographs. MS 


Computer Science, P. Algorithm Design for Computer System Design. Ed: G. Ausiello, M. Lucertini, P. 
Serafini. Springer-Verlag, 1984, 236 pp, $18.30 (P). [ISBN: 0-387-81816-2] A collection of nine 
papers from the July 1983 summer school sponsored by the International Center for Mechanical Sci- 
ences. These papers discuss “Combinatorial Problems in Computer System Design" and the "Optimal 
Design of Parallel Computing Systems." AO 


Computer Science, T(16-18: 1), P, L*. Introduction to Logic Programming. Christopher John Hogger. 
APIC Stud. in Data Proc., No. 21. Academic Pr, 1984, xiii + 278 pp, $46. [ISBN: 0~-12-352090-8] A 
comprehensive introduction to the use of symbolic logic as a programming language. In addition to 
providing an introduction to logic programming, it also discusses the derivation of logic programs 
from logic specifications and implementation of the logic programming language PROLOG. AO 


Computer Science, $(14), L*®. Modula~2 for Pascal Programmers. Richard Gleaves. Springer-Verlag, 
1984, x + 145 pp, $16.95 (P). [ISBN: 0-387-96051-1] A brief introduction to the programming language 
Modula-2 for persons already familiar with Pascal. Although features unique to Modula-2 are 
included, the most useful part of the book is the detailed description of the differences from Pas- 
cal. The book does not provide a complete introduction to Modula-2. AO 


Third Edition. Kathleen Jensen, Niklaus Wirth. Springer-Verlag, 1985, xvi + 266 pp, $14 (P). 
[ISBN: 0~-387-96048~-1] The original text has been modified to reflect the ISO Pascal standard. In 
addition, Wirth’s EBNF syntactic notation is now used and several of the example programs in the 
user’s manual have been improved. An appendix summarizes the changes made in the report (Second 
Edition, TR, December 1976). AO 


Computer Science, P. Lecture Notes in Computer Science-182: STACS 85. Ed: K. Mehlhorn. Springer- 
Verlag, 1985, vii + 374 pp, $16 (P). [ISBN: 0-387-13912-5] The 37 papers in this text were presented 
at the Second Annual Symposium on Theoretical Computer Science held in Saarbrucken, Germany, January 
1985. These papers address a wide range of research issues in theoretical computer science, includ- 
ing computability, formal language theory, coding theory, tree automata, and formal semantics. The 
papers are all at a very advanced level of presentation. MS 


Computer Science, T(15-17: 1). A First Course in Formal Language Theory. V.J. Rayward-Smith. Comp. 
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Sci. Texts. Computer Science Pr, 1983, xiii + 123 pp, $14.95 (P). [ISBN: 0-632-01176-9] A short and 
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to-the-point presentation of regular languages, context-free languages, automata, and parsing. 
Index (short), chapter exercises, no bibliography. Enjoyable to read and mostly language 
independent--uses, late in the book, a Pascal-like representation of a few algorithms and only a few 
problems ask for a program in Pascal. JAS 


Computer Science, S(16-17), P, L*. The ADA Programming Language: A Tutorial. Sabina H. Saib, Robert 
EK. Fritz. IEEE Computer Society, 1983, ix + 538 pp, $36 (P). [ISBN: 0-8186-0456-5] An anthology 
containing most of the major early papers on the Ada programming language together with several 
tutorial papers that describe particular aspects of the language. AO 


Computer Science, T(16-18: 1), L. The Theory of Relational Databases. David Maier. Computer Sci- 
ence Pr, 1983, xv + 637 pp, $28.95. [ISBN: 0-914894-42-0] A textbook for a second course on data- 
bases. Designed to bring the reader almost to the frontier of current research on the subject. AO 


Applications, $(17-18), P. Mathematische Grundlagen der Kartographie: 2., iiberabeitete und 
erweiterte Auflage. Josef Hoschek. Bibliographisches Inst, 1984, 210 pp, (P). [ISBN: 3-411-01661- 
2] Treats the most important types of map projection, and the mathematics behind them. JD-B 


Applications (Artificial Intelligence), T(16-17: 1), S, P, L. Building Expert Systems. Ed: Freder- 
ick Hayes-Roth, Donald A. Waterman, Douglas B. Lenat. Addison-Wesley, 1983, xvi + 444 pp, $34.95. 
[ISBN: 0-201-10686-8] A collection of ten articles covering the design, construction, and evaluation 
of expert systems. A good, not too technical, introduction to the subject. AO 


Applications (Artificial Intelligence), P. Robotics and Artificial Intelligence. Ed: Michael Brady, 
Lester A. Gerhardt, Harold F. Davidson. NATO ASI Ser. F, V. ll. Springer-Verlag, 1984, xvii + 693 
pp, $62.50. [ISBN: 0-387-12888-3] A collection of invited papers mostly from the NATO Advanced Study 
Institute held in June 1983 at Castelvecchio Pascoli, Italy. JAS 


Applications (Economics), P. Lectures on Schumpeterian Economics. Ed: Christian Seidl. Springer- 
Verlag, 1984, ix + 219 pp, $21.50. [ISBN: 0-387-13290-2] Thirteen lectures on Schumpeter and his 
economic theories; nine given during Schumpeter centenary celebration at the University of Graz, 
Austria in 1983. Contributors include six Austrian economists and Peter Hammond of Stanford. KS 


Information Analysis of Simultaneous Equation Models Using Integration by Monte Carlo. Luc Bauwens. 
Springer-Verlag, 1984, vi + 114 pp, $8.50 (P). [ISBN: 0-387-13384-4] A report on the use of impor- 
tance sampling (a Monte Carlo integration technique) for Bayesian analysis of simultaneous equation 
models. AO 


Applications (Engineering), P. Annual Review of Fluid Mechanics, Volume 17, 1985. Ed: Milton Van 


Dyke, J.V. Wehausen, John L. Lumley. Annual Reviews, 1985, 630 pp, $28. [ISBN: 0-8243-0717-8] 


Applications (Engineering), T(14-15: 1), L. Theory and Applications of Linear Differential and 
Difference Equations: A Systems Approach in Engineering. R.M. Johnson. Ser. in Math. & its Applic. 
Halsted Pr, 1984, viii + 183 pp, $29.95. [ISBN: 0-470-20106-1] Uses Laplace and z-transforms to 
study (via transfer functions, block diagrams, frequency domain analysis) the similarities between 
linear differential and difference equations. Examples from many fields of engineering. A final 
chapter studies digital filters. DFA 


Applications (Engineering), S(17), P. Topics in Boundary Element Research, Volume 1: Basic Princi- 
ples and Applications. Ed: C.A. Brebbia. Springer-Verlag, 1984, xiii + 256 pp, $49.50. [ISBN: 0- 
387-13097-7] Each of the ten chapters in this work describes a new application of boundary element 
methods. The chapters are self-contained and authored by different scientists who are actively 
involved in boundary element research. CEC 


Applications (Engineering), T(16-17: 4), P. Mass Transfer: Fundamentals and Applications. Anthony 
L. Hines, Robert N. Maddox. Prentice-Hall, 1985, xiii + 542 pp, $37.95. [ISBN: 0-13-559609-2] Aimed 
at chemical engineering students. Good selection of problems in each chapter. First half on 
microscopic diffusional processes and prediction and use of transport coefficients. Second half on 
macroscopic separation processes, including absorption, distillation, and extraction. Useful refer- 
ence with numerous graphs and tables and references at end of each chapter. BH 


Applications (Engineering), P. Lecture Notes in Engineering-7: Boundary Integral Equation, Analyses 
of Singular, Potential, and Biharmmonic Problems. Derek B. Ingham, Mark A. Kelmanson. Springer- 
Verlag, 1984, iv + 173 pp, $12.50 (P). [ISBN: 0-387-13646-0] Extensions of the boundary integral 
equation method to certain cases involving highly non-linear boundary conditions and governing 


elliptic equations. JAS 


Applications (Physics), P. Mathematical Physics Reviews, Volume 4 (1984). Ed: S.P. Novikov. 
Transl: Morton Hamermesh. Harwood Academic, 1984, ix + 280 pp, $99.50. [ISBN: 3-7186-0146-x] 
English translations of four Russian papers in mathematical physics. Topics include the theory of 
"typical" singularities of functions and mappings, applications of Lie algebra methods to the clas- 
sification of solutions of the Yang-Baxter equation, Hamiltonian formalism in the hydrodynamics of 
ideal fluids and the classification of integrable one-dimensional systems of certain types, which 
generalize the Korteweg-de Vries and Burgers equations. BH 
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Applications (Physics), P. An Approach to the Foundations of Quantum Mechanics, Using the Concept of 
A Filter as Primitive. Hans Kummer. Papers in Pure & Appl Math., No. 70. Queen’s U, 1984, 160 pp, 
(P). Investigates use of the concept of an operation (or filter) as a primitive concept in an 
axiomatic theory. AO 


Applications (Physics), $(18), P. Vertex Operators in Mathematics and Physics. Ed: J. Lepowsky, S. 
Mandelstam, I.M. Singer. Math. Sci. Res. Inst., V. 3. Springer-Verlag, 1985, xiv + 482 pp, $29.80. 
[ISBN: 0-387-96121-6] These proceedings of a 1983 research conference explore the connections 
between the affine Kac~Moody Lie algebras and the dual-string theory of particle physics through the 
use of formal differential operators called vertex operators. A poor attempt is made to provide an 
introduction suitable for the nonexpert reader. MU 


Applications (Physics), P. Lecture Notes in Physics-2l1: Resonances--Models and Phenomena. Ed: S. 
Albeverio, L.S. Ferreira, L. Streit. Springer-Verlag, 1984, vi + 359 pp, $18 (P). [ISBN: 0-387- 
13880-3] Proceedings of a symposium held at the University of Bielefeld in July 1984. Papers 
present recent research on the modeling of resonance phenomena. AO 


Applications (Physics), S(18), P. A Concept of Mathematical Physics: Models for Space-Time. Tamas 
Matolcsi. Akademiai Kiado, 1984, 236 pp, $26. [ISBN: 963-05-3245-X] Part one describes mathematical 
models of space-time, nonrelativistic and relativistic (both special and general) using an axiomatic 
approach. Part two presents the mathematical tools required for part one: tensors, pseudo-Euclidean 
spaces, affine spaces, smooth manifolds, and Lie groups. BH 


Applications (Physics), S(16-17), L. Mathematical Analysis of Physical Problems. Philip R. Wallace. 
Dover, 1984, xix + 616 pp, $11.95 (P). [ISBN: 0-486-64676-9] Unabridged and corrected republication 
of the work originally published by Holt, Rinehart and Winston in 1972 (TR, November 1973). For the 
advanced undergraduate or beginning graduate student. From the vibrating string to the quantum 
mechanics of the many-body problem. Well-organized, useful as a reference, at a price that is hard 
to beat. Problems, but no solutions. JK 


Applications (Physics), P. Quantum Theory of Particles and Fields: Birthday volume dedicated to Jan 
Lopuszafiski. Ed: Bernard Jancewicz, Jerzy Lukierski. World Scientific (US Dist: Heyden & Son), 
1983, xxvi + 262 pp, $33. [ISBN: 9971-950-77~4] Collection of papers on the quantum theory of par- 
ticles and fields. BH 


Applications (Physics), P. Stochastic Methods and Computer Techniques in Quantum Dynamics. Ed: H. 
Mitter, L. Pittner. Springer-Verlag, 1984, vii + 452 pp, DM 122. [ISBN: 0-387-81835-9] Proceedings 
of the 23rd Internationale Universitatswochen fur Kernphysik 1984 der Karl-Franzens-Universitat Graz 
held at Schladming (Steiermark, Austria), February 20 to March J], 1984. JAS 


Reviewers 


DFA: David F. Appleyard, Carleton; RB: Richard Brown, Carleton; JNC: Judith N. Cederberg, St. Olaf; 
BC: Barry Cipra, St. Olaf; CEC: Clifton E. Corzatt, St. Olaf; RD: Roger Day, St. Olaf; JD-B: John 
Dyer-Bennet, Carleton; SG: Steven Galovich, Carleton; BH: Bruce Hanson, St. Olaf; PH: Paul Humke, 
St. Qlaf; KK: Kenneth Kaminsky, St. Olaf; RSK: Richard S. Kleber, St. Olaf; JK: Joseph Konhauser, 
Macalester; LCL: Loren C. Larson, St. Olaf; AM: Alan Magnuson, St. Olaf; RM: Richard Molnar, 
Macalester; RWN: Richard W. Nau, Carleton; LN: Linda Ness, Carleton; AO: Arnold Ostebee, St. Olaf; 
MS: Michael Schneider, Macalester; JS: John Schue, Macalester; SS: Seymour Schuster, Carleton; JAS: 
J. Arthur Seebach, Jr., St. Olaf; KS: Kay Smith, St. Olaf; LAS: Lynn Arthur Steen, St. Olaf; MT: 
Michael Tveite, St. Olaf; MU: Milton Ulmer, Carleton; TAV: Theodore A. Vessey, St. Olaf; MW: Martha 
Wallace, St. Olaf; FLW: Frank L. Wolf, Carleton; PZ: Paul Zorn, St. Olaf. 


Section Reports 
An asterisk (*) by the title of a paper indicates that copies of the paper are available from 
the author. Papers presented under special sponsorship as part of joint meetings are so noted in 
parentheses. 


Pacific Northwest Section 


The annual meeting of the Pacific Northwest Section was held at Williamette University, Salen, 
Oregon on June 20-22, 1985. Approximately 70 persons attended the meeting. 


Invited Lectures: 
"The Undergraduate Curriculum," by Donald W. Bushaw, Washington State University. 


"Pictures and Stories from George Polya’s Photo Album," by Gerald L. Alexanderson, University of 
Santa Clara. 


C108 Center Section December 


"Graphical Data Analysis in More Than Two Dimensions," by Wesley L. Nicholson, Battelle- 
Northwest. 


Short. Presentations: 


"On Certain Generalized Circular Transforms," by Stanley D. Luke, Seattle Pacific University. 

"Multiplicative Magic Squares," by Ellis R. Von Eschen, Suffolk Community College. 

"Chains in Finite Boolean Lattices," by Roger B. Nelson and Harvey Schmidt, Jr., Lewis and Clark 
College. 

"Finite Calculus," by Rudolf Festa, Anchorage, Alaska. 


Mini~Courses: 


"Teaching Problem Solving," by Alan H. Schoenfeld, University of California, Berkeley. 
"Geometry for College Teachers," by Branko Grunbaum, University of Washington, 


Panel Discussion: 


"Teaching Discrete Mathematics,” by Willy Brandal, University of Idaho; Bryan Dorner, Pacific 
Lutheran University; Steve Prothero, Williamette University; Thomas Reimer, Lane Community Col- 
lege; Charles R.B. Wright, University of Oregon. 


Intermountain Section 


The annual spring meeting of the Intermountain Section was held April 19-20, 1985 at Weber 
State College, Ogden, Utah. There were 81 persons in attendance. 


Invited Addresses: 


"The MAA Precollegiate Mathematics Contest," by Stephen B. Maurer, Swarthmore College. 
“What is Discrete Algorithmic Mathematics?" by Stephen B. Maurer, Swarthmore College. 


Short Presentations: 


"The Family Nim," by Paul Yearout, Brigham Young University. 

"Mathematical Problems Associated with NUSAT," by Paul Talaga, Weber State College. 

"Mathematics Education in Russia," by Jim Cotts, Southern Utah State College. 

"Permutations Without Invariant Elements," by Charles Cunkle, Emeritus, Slippery Rock. 

"Pivot Point Graphing of Some Common Families of Functions," by John Thaeler, Weber State College. 

"Bases for the Space of Invariants of a Class of Transformations," by L. Kirk Tolman, Brigham 
Young University. 

"Romberg Integration for Unequal Spacing," by Donald R. Snow, Brigham Young University. 

"How To Shoot Craps If You Must," by S.N. Ethier, University of Utah. 

"Some Observations of Pairwise Votes," by Lawrence C. Ford, Idaho State University. 

"Cryptography: An Applied Avenue to Topics in Pure Mathematics," by Lee Badger, Weber State Col- 
lege. 

"The Bisection Method is Almost Never Linear," by Frank Richards, Weber State College. 

"Fitting a Rational Function to a Finite Set of Points," by Jay Huber, Ricks College. 


Panel Discussions: 


"Certification in Mathematics," by Patricia Henry, Weber State College; Donald H. Tucker, Univer- 
sity of Utah; Steven H. Heath, Southern Utah State College. 

"Discrete Mathematics," by Lee Badger, Weber State College; Robert Girse, Idaho State University; 
LeRoy Beasley, Utah State University; John Higgins, Brigham Young University; Keith Reed, 
University of Utah. 


Student Papers: 


"A Model of a Problem in Genetics: Mutants Under Stress," by Kevin Black, Brigham Young Univer- 
sity. 

"A Model for Growth in Endothermie Animals," by Tim Thurston, Weber State College. 

"A Model for the Growth of the Spruce Bud Worm,” by Brian Heath, Weber State College. 

"A Model for the Car Following Problem," by Pete Jahsman, Weber State College. 

"Satellite Attitude Approximation Using Light Sensor Data," by Lance M. Moss, Weber State Col- 
lege. 

"Simplifying JK Flip-Flops with the Boolean Algebra of Order Four," by Donald John Nicholson, 
University of Utah. 

"Modeling Reserves for Group Health Plans," by Robert L. Bartholomew, Brigham Young University. 

"Homomorphisms of Graphs and their Related Contractions," by Rick Gillman, Idaho State University. 

"Applications of the Word Problem to Public Key Cryptography," by Jeff Salt, Weber State College. 
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AAAS. AAAS Report X: Research & Develop- 
ment, FY 1986. C65. 

Abbott, Robert D.; See Lunneborg, Clifford 
E. 

Abdel~Hameed, Mohamed S.; Ginlar, Erhan}; 
Quinn, Joseph (Eds.). Reliability 
Theory and Models: Stochastic Failure 
Models, Optimal Maintenance Policies, 
Life Testing, and Structures. C43. 

Abel, Peter. Programming Assembler 
Language, Second Edition. C8. 

Abelson, Harold; Sussman, Gerald Jay; Suss- 
man, Julie. Structure and Interpreta- 
tion of Computer Programs, C18. 

Accardi, L.; Frigerio, A.; Gorini, V. 
(Eds.) Lecture Notes in Mathematics- 
1055. C36. 

Aczél, J. (Ed.) Functional Equations: His- 
tory, Applications and Theory. C5. 

Ageloff, Roy; Mojena, Richard. Applied 
Structured BASIC. C92. 

Agresti, Alan. Analysis of Ordinal 
Categorical Data. C42. 

Aickin, Mikel. Linear Statistical Analysis 
of Discrete Data. C7. 

Akers, Lynn R.; See Ewen, Dale. 

Albers, Donald J.; Alexanderson, G.L. 
(Eds.) Mathematical People: Profiles and 
Interviews. C45. 

Albert, C.3; Molino, P. Pseudogroupes de 
Lie transitifs, I: Structures princi- 
pales. C78. 

Albeverio, S.; Ferreira, L.S.; Streit, L. 
(Eds.) Lecture Notes in Physics~211. 
C107. 

Albu, Toma; N&stHsescu, Constantin. Rela- 
tive Finiteness in Module Theory. C15. 

Alexanderson, G.L.; See Albers, Donald J. 

Amar, E.; Gay, R.; Van Nguyen, Thanh 


(Eds.). Lecture Notes in Mathematics- 
1094, C99. 
AMS. American Mathematical Society Trans- 


lations, Series 2, Volume 121. C2, 

Anderson, Virgil L.3 See McLean, Robert A. 

Anton, Howard; See Rorres, Chris. 

renee » Elementary Linear Algebra, Fourth 
Edition. C54. 

Apostol, Constantin. Approximation of Hil- 
bert Space Operators, Volume II. C5. 
ApSimon, Hugh. Mathematical Byways in Ayl- 

ing, Beeling, and Ceiling. C37. 

Arbarello, E. Geometry of Algebraic 
Curves, Volume I. C79. 

Arganbright, Deane. Mathematical Applica- 
tions of Electronic Spreadsheets. C25. 

Arnold, V.I.; Guesin-Zade, &.M.; Varchenko, 
A.N. Singularities of Differentiable 
Maps, V. I: The Classification of Criti- 
cal Points, Caustics and Wave Fronts. 
C101. 

Aroian, Leo A.; See Oksoy, Dolun. 

Arveson, William. Ten Lectures on Operator 
Algebras. C39. 

Arya, Jagdish C.; Lardner, Robin W. 
Mathematical Analysis for Business and 
Economics, Second Edition. C99. 

Ashby, F. Gregory; See Townsend, James T. 

Attouch, H. Variational Convergence for 
Functions and Operators. C102. 

Aubin, Jean-Pierre; Cellina, Arrigo. Dif- 
ferential Inclusions: Set-Valued Maps 
and Viability Theory. C43. 

~----; Ekeland, Ivar. Applied Nonlinear 
Analysis. Cié. 

-----. Explicit Methods of Optimization. 
C31. 

Aulbach, Bernd. Lecture Notes in 
Mathematics-1058. C29. 

Ausiello, G.; Lucertini, M.; Serafini, P. 
(Eds.) Algorithm Design for Computer 
System Design. C105. 

Avison, D.E. Microcomputers and their Com- 
mercial Applications. C104. 

Avula, Xavier J.R. (Ed.) Mathematical 
Modelling in Science and Technology. 
c2l. 

Axelrad, E.L.3; Emmerling, F.A. (Eds.) Flex- 
ible Shells: Theory and Applications. 
C44. 

Axelsson, 0.; Barker, V.A. Finite Element 
Solution of Boundary Value Problems: 
Theory and Computation. C100. 

Azéma, J.; Yor, M. (Eds.) Lecture Notes in 
Mathematics-1059. C32. 

Babbage, Henry Prevost. Babbage’s Calcu- 
lating Engines: A Collection of Papers 
by Henry Prevost Babbage. C26, 
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Badescu, L.3; Popescu, D. (Eds.) Lecture 
Notes in Mathematics-1056. C67. 

Baglivo, Jenny A.; Graver, Jack E. 
Incidence and Symmetry in Design and 
Architecture. Cll. 

Baiocchi, Claudio; Capelo, Anténio. Varia- 
tional and Quasivariational Inequali- 
ties: Applications to Free Boundary 
Problems. C102. 

Bak, A. (Ed.) Lecture Notes in 
Mathematice-1046. C27. 

Balakrishnan, A.V. Kalman Filtering 
Theory. Cll. 

Bamberg, Gunter; Spremann, Klaus (Eds.). 
Lecture Notes in Economics and Mathemat- 
ical Systems-227, C35. 

Banichuk, N.V. Problems and Methods of 
Optimal Structural Design. C12. 

Barbu, V. Optimal Control of Variational 
Inequalities. C20. 

Barker, V.A.3; See Axelsson, 0. 

Barker, William H.; Ward, James E. The 
Calculus Companion to Accompany Cal- 
culus, Second Edition, by Howard Anton. 
C78. 

Barnett, Raymond A. Functions and Graphs: 
A Precalculus Course. C97. 

Barnett, S$. Matrices in Control Theory, 
Revised Edition. C47. 

Barr, Michael; Wells, Charles. 
Triples and Theories. C98. 

Barth, W.; Peters, C.; Van de Ven, A. 
pact Complex Surfaces. C66. 

Bass, Hyman; See Morgan, John W. 

Basso, David T.; See Schwartz, Ronald D. 

Bateson, Robert M.; Raygor, Robin D.; Bitz, 
Gregory. BASIC Programming for the 
Apple Computer. C92, 

Baumgartner, James E.; Martin, Donald A.;3 
Shelah, Saharon (Eds.). Axiomatic Set 
Theory. C26. 

Bauwens, Luc. Lecture Notes in Economics 
and Mathematical Systems~232. C106. 

Beauchamp, Kenneth G. (Ed.) Information 
Technology and the Computer Network. 
C34. 

cro » Applications of Walsh and Related 
Functions: With an Introduction to 
Sequency Theory. (C92. 

Beaumont, David; Summerfield, Anne; Wright, 
Julie. Working with RT-11. C10. 

Beauzany, B.; Lapresté, J.-T. Modéles 
&talés des espaces de Banach. C9l, 

Becker, J.; Eisele, I. (Eds.) Lecture Notes 
in Physics: WOPPLOT 83: Parallel Pro- 
cessing: Logic, Organization, and Tech- 
nology. C70. 

Becker, Richard A.; Chambers, John M. S: 
An Interactive Environment for Data 
Analysis and Graphics. C17. 

Beekman, John A.; See Keyfitz, Nathan. 

Beineke, Lowell W.; Wilson, Robin J. (Eds.) 
Selected Topics in Graph Theory 2. C26. 

Bekif, H. Lecture Notes in Computer 
Science-177. C55. 

Bell, James F. Mechanics of Solids, Volume 
I. C21. 

Bellman, Richard. 
Medicine. C35. 

Benson, David. Lecture Notes in 
Mathematics-1081. C66. 

Bensoussan, Alain; Lions, Jacques~Louis. 
Impulse Control and Quasi-Variational 
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LEMMA. Nondistinct solutions of S = Circ(1,1,1,0,...,0)X result (i) if two consecutive elements 
in S are equal; or (ii) if S contains five consecutive elements of the form abcba. 


Proof. The equation x, + X41 + X42 = X,41 + X,42 + X,43 implies x, = x,,3, confirming 
(i). To confirm (ii), suppose, without loss of generality, that 


Xp tX_p +X = A,X, +X, +X, =D, XG+t xX, + XG = b, XS + XG + XZ =A. 
These equations imply x, — x, =a— b= x ,— X4,80 X, = Xp. 


The key question regarding the flim-flam game of Example (4) is: What can a player’s secret 
number possibly be? Obviously it can’t be, say, 14 because everyone has a 12 on their clockface 
and, next to it, at least 1 and 2. The bidding should begin at 16. The bids 17,18,19,20,... pass 
uneventfully. 

Let o denote any secret number. To obtain a lower bound on o we compute 5, the average 
value of the triple-sums s;: 


Thus, o > 20 > 5s. In actuality, o can never equal 20. Twelve integers, the largest of them 20, 
average out to 19.5 only if at least 6 of them equal 20. If exactly 6 of the s, equal 20, then the 
others must all be 19. In that case, either two consecutive s, are the same, or else the 20’s and 19’s 
alternate. Both possibilities are violations, in view of the above Lemma. If more than 6 of the s, 
equal 20, we have the same violation because (the pigeonhole principle!) at least two consecutive 
entries in S must be 20. 

Thus, o > 21. Meanwhile, my accomplice is ready with a circular permutation for which o 
attains its lower bound. It is 1, 8,10, 3,5, 9, 4, 6, 11, 2, 7,12. 

The only remaining question: What is the probability that someone in the audience has also 
stumbled onto a circular permutation of 1,2,...,12 for which o = 21? Frankly, I don’t know the 
probability, but I believe that it is very small. After all, my accomplice and I have never lost a 
game. 


Acknowledgment. I wish to thank Professors P. J. Davis and O. Shisha for helpful discussions. 
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an anomaly or are there more such? Specifically, let f, denote the function whose value at the 
positive integer n is 


f,(n) = 1 + 2% +--+ +n*. 
We ask for a list of all identities of the form 


r AY 
(1) I fis = LT fas) 
Z jc 


where the exponents are positive integers and the subscripts are distinct nonnegative integers. We 
shall show the only such identity is {7 = f, and powers of this equation. 
It is well known that f, is a polynomial of degree k + 1. In fact, 
1 ‘SA(k+1 , 
fi(a-Y=7aG » ( j | Boson! 


j=l 


where the { B,} are the Bernoulli numbers. A tidy way of defining these ubiquitous numbers is via 
the identity 


x _ 7 
e 1 2 # 


so that B, is the value of the ith derivative of x/(e* — 1) at x = 0. See [5], pp. 230-246, for 


proofs and a complete discussion. 
Suppose that (1) holds. Without loss of generality we may assume that 


h(1) < h(2) < --- <h(r) and k(1) <k(2)< --- <k(s); 


moreover, that h(r) < k(s). Let us first examine the case in which k(s), the largest of the 
subscripts, is at most 3. Then the identity in (1) may involve only the polynomials 


fo(n) =n, f,(n) =5n(n +0), 


f,(n) = an(n +1)(2n+1), f,(n) = Zn(n + 1)’. 


Because f,(n) has the irreducible factor 2n + 1 which appears in none of the other three 
polynomials, f,(7) is not involved in the identity. By examining the exponent on the factor n + 1, 
we see that the only possible identity is f°“ = f¥ for some positive integer a. 

Now suppose that k(s) > 3. By evaluating the polynomial identity in (1) at n = 2 we obtain 
the integer equation 


r S 
(2) [1] [1 + 27©]°@ = T] [a + 24]. 
i=l j=l 


From a theorem of Bang [2], there is a prime divisor p of 1 + 2%‘) such that 2 belongs to the 
exponent 2k(s) in the integers modulo p. Because h(i) < k(s) for each i, we see that p cannot 
be a divisor of the left-hand side in (2). Thus there can be no identity of the form in (1) with 
k(s) > 3. This completes the proof of our theorem. 


The result of Bang é0 which we refer deals with “primitive” prime factors of expressions of the 
form a” — 1 (we were concerned above with 2°*“) — 1). This result was later generalized by 
Zsigmondy [9] to expressions of the form a” — b”: 


ZSIGMONDY’S THEOREM. If a, b, and n are integers with a > b > 0, gcd(a,b) = 1, andn > 2, 
then there is a prime divisor p of a" — b” such that p is not a divisor of a“ — b* for any integer k 
with 1 < k <n, except for the caseea = 2, b=1, n= 6. 
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Both Bang’s theorem and Zsigmondy’s theorem have been rediscovered many times in the last 
century. A partial list of references is given in [3], p. 361. It should also be noted that Zsigmondy’s 
theorem has itself been generalized to algebraic number fields—a special case of this situation 
implies that Fibonacci numbers have primitive prime factors. A recent reference on generaliza- 
tions of Zsigmondy’s theorem is Stewart [7], from which earlier references may be tracked down. 

Using Zsigmondy’s theorem, we can handle the following generalization of (1). If c,d > 1, 
k,n > 0 are integers, let f(c,d,k) be the function whose value at n is 


n—-1 
(3) f(e,d,kyn) = Y (c+ di)”. 
i=0 
Thus f(1,1,k) = f,. For a fixed pair c,d we can ask if there are any multiplicative identities 
r S 
\\ ali \\ bC/) 
(4) TT f(e.d.a()y = Tea kK), 
i=l j= 


where the {h(i)} and {k(/)} are distinct nonnegative integers. We leave it to the reader to verify 
the pleasant exercise that the only solutions of (4) are powers of the equations 


f(1,1,1)° = f(1,1,3), 


and 


f(1,2,0)° = f(1,2,1). 


Note that this result is a grand generalization of MONTHLY Problem E 2951 [8], which asked 
for a catalog of all equations 


f(1,2,h)* =f(1,2,k)’, 


and also the work of Edmonds [4] and Allison [1], who considered identities of the form 


f(i,1,h)* = f(1,1,k)”. 
This latter equation was also considered in MONTHLY Problem E 2136 [6]. 


We take this opportunity to thank Paul Bateman, Bruce Berndt, H. W. Gould, and William J. 
LeVeque for their assistance in locating references concerning the equation in (1). In light of the 
many rediscoveries of the Bang-Zsigmondy result, it would be ironic if our results here had been 
anticipated. In such an event, which is perhaps not unlikely, the responsibility is of course solely 
ours. 
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Who is Louis de Branges? (See p. 742.) 
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742 PROBLEMS AND SOLUTIONS 


Consequently H(x) must be negative when x is sufficiently large, and this is a contradiction. 


Hence L = § and } is not taken on by b. 


Also solved by Nicholas Passell, Douglas B. Tyler, J. Vukmirovi¢ (Yugoslavia), and the proposer. 


On Areas of Triangles 


6466 [1984, 441]. Proposed by I. J. Schoenberg, University of Wisconsin-Madison, from a 
problem seminar of A. Ostrowski (1924). 


Let F be a compact plane set, not contained in a line. Let 
I = sup{ area of triangle ABC: {A,B,C} Cc F}, 
J = inf{ area of triangle ABC: F C closed triangle ABC}. 
Show that 47] > J and that equality can occur. 


Composite Solution. Since F is compact there are points A, B,C in F such that J = area of 
triangle ABC. Let T be the closed triangle formed by the line / through A parallel to BC, the line 
m through B parallel to AC and the line » through C parallel to AB. Then T has area 4/. 
Further F c T, for if X is a point off / (say) on the side away from BC, then triangle XBC has 
area greater than J. Thus J < 4/. 

Equality occurs when F is a circle or when F comprises the vertices of a square. 

Solved by Miroslav D. A&ié (Yugoslavia), Richard Goldstein, John M. Ingram, Jeong-Han Kim, Sr. (Korea), N. 
J. Lord (England), A. McD. Mercer (Canada), O. P. Lossers (The Netherlands), Victor Pambuccian (Romania), Pier 
Ivan Pastro (Italy), Richard E. Pfiefer, John H. Riley, Jr., C. R. Selvaraj, University of South Alabama Problem 
Group, J. Vukmirovi¢ (Yugoslavia), and the proposer. Victor Pambuccian showed by an elaboration of the above 
argument that “compact” can be replaced by “bounded”. 


A Counting Problem 


6468 [1984, 441]. Proposed by Calvin T. Long, Washington State University. 


If a and n are integers with n > 1, prove that L,,,6(d)a”/“ = 0 (mod n). Note that if p isa 
prime, this reduces to Fermat’s Little Theorem, a? = a (mod p). 


Solution. Thirty-two solutions were submitted including one from the proposer. Several solvers 
pointed out that P. A. MacMahon [Proc. London Math. Soc., 23 (1891-2), 305-313] proved that 


* > $(d)a"/4 
n d|n 


is the number of circular permutations of n elements of a kinds, with repetitions allowed. This 
solves the problem. It was also pointed out that the problem is essentially the same as problem 
E 2242, two solutions of which appear in this MONTHLY [1971, 545-546]. 


ANSWER TO PHOTO ON PAGE 732 


The photo is of Ludwig Bieberbach (1886-1982), the proposer of a famous conjecture bearing 
his name. The question is in continuation of a similar one that appeared in this MONTHLY, 
February 1985, page 93. 
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THE TEACHING OF MATHEMATICS 
EDITED BY MARY R. WARDROP AND ROBERT F. WARDROP 


Material for this department should be sent to Professor Robert F. Wardrop, Department of Mathematics, Central 
Michigan University, Mount Pleasant, MI 48859. 


DISCRETE AND CONTINUOUS COMPOUNDING 
Morris MORDUCHOW 


Department of Mechanical and Aerospace Engineering, Polytechnic Institute of New York, Brooklyn, 
NY 11201 


In [1] R. C. Thompson shows that if g is the true growth rate in an inflationary economy, then 


(1) g=i-r 
if the interest rate i and the inflation rate r are compounded continuously, whereas 
1—r 
2 — 
(2) eT 14+r 


if the compounding is periodic. (In [2] Equation (2) is also presented, but in different terms.) 
There appears to be a contrast between the true growth rate in an inflationary economy, 
depending on whether the compounding is discrete or continuous. On the other hand, one expects 
that (2) approximates (1) if the compounding is performed over sufficiently small periods. The 
purpose of this note is to show how to modify (2) to take into account the frequency of 
compounding. 

Consider a basic time-unit, say one year, and suppose that the interest rate i 1s compounded 
“(1/n)thly”, so that in one year an investment of P dollars grows to 


(3) o-P(i++). 
After T years, the growth would be 

-\ nT 
(4) Q= P(1 + 4 | 


Similarly, at an inflation rate of r compounded (1/n)thly an item costing X dollars now would 
cost 


(5) y=x(1+7)" 


after T years. The value of (Q/Y) at any time T would denote the number of items that the 
investment could purchase at that time, and hence may be regarded as a measure of the 
purchasing power. The true growth rate g may accordingly be defined by the growth of (Q/Y), as 
follows: 
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Q_ oa g) " 
(6) yay 1+ a 
Substituting for Q and Y from (4) and (5) into (6), and solving for g, one arrives at 
1—r 
(7) g=—_... 


1+(7] 
n 
Equation (7) gives the required modification of (2) to take into account the frequency of 
compounding. Note that (7) becomes (2) for n = 1, and approaches (1) as n > oo. The discussion 


of the “inflation balancing principle” of [1] (which remains valid for all n) can similarly be 
modified to take into account the frequency of the compounding. 


References 


1. R. C. Thompson, The true growth rate and the inflation balancing principle, this MONTHLY, 3 (1983) 
207-210. 

2. Irving Fisher, Appreciation and interest, Publications of the American Economic Association, XI (August 
1896) 331-442. 
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ELEMENTARY PROBLEMS 


Solutions of these Elementary Problems should be mailed in duplicate to Professor D. H. Mugler, Department 
of Mathematics, University of Santa Clara, Santa Clara, CA 95053, by April 30, 1986. Please place the solver’s 
name and mailing address on each (double-spaced) sheet. Include a self-addressed card or label (for acknowledge- 
ment). 


E 3117. Proposed by William C. Waterhouse, Pennsylvania State University. 


The 1983 Putnam Competition asked entrants to show (in essence) that there exists a positive 
real number u such that [u”] is odd for (positive) odd n and even for even n. The published proof 
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[this MONTHLY, 91 (1984) 493] does not yield an explicit example. Find an explicit example of 
such a positive real number uw. 
E 3118. Proposed by Gérard Letac, Université Paul Sabatier, France. 


Consider 3 positive independent random variables U, X, Y such that U is uniform on (0,1) and 
max(UX, UY) is distributed like X. 


(1) If the distribution of Y is w(dy) = (1 + y)*1o, +00)() ay, show that X has distribution 

(2) If X and Y have the same distribution, show that there exists c > 0 such that cX has 
distribution p. 

E 3119. Proposed by Paul K. Stockmeyer, College of William and Mary, and Editors. 


(a) Prove that for every integer n > 3 there exist sets S, C [0,1] such that S, contains no 
arithmetic progression of length n and S, U {x} contains such a progression for every x € [0,1] 
— §,. 

(b) For each n > 3 exhibit such a set S, by explicit construction. 


E 3120. Proposed by Kurt C. Foster and Lee A. Rubel, University of Illinois. 


Let f and g be two twice-continuously-differentiable functions on the interval [0,1]. Define 


” , 2] —3/2 
K(x)(=F"(9[1 +(7709))] 7) 
to be the curvature of the graph of y = f(x) at the point (x, f(x)), with K,(x) similarly defined. 
Suppose f(0) = g(0) = 0 and f’(0) = g’(0) = 0, and that K,(x) > K,(x) for all x © [0,1]. Must 
g(x) > f(x) for all x € [0,1]? 


E 3121. Proposed by Calvin T. Long, Washington State University. 
Given any thirteen distinct real numbers, show that there exist at least two which satisfy the 
inequality 
eae a 2-3 | 
1+ xy 24+ 73 


E 3122. Proposed by Miha’ly Bencze, Brasov, Romania. 
Prove that 
* Uk + 2k +--+ +n* 
y, SS 


k=1 k-n* 


|e 


(1 +(-*"(7)) =(n+ yy 


SOLUTIONS OF ELEMENTARY PROBLEMS 
Irrational Series 


E 2923 [1982, 63]. Proposed by P. Erdés, Hungarian Academy of Sciences, and Claudia Spiro, 
University of Illinois. 


Let 1 < a, < a, < --- bean infinite sequence of integers. Prove that 
ioe) Jan 


n=1 


a,,! 
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is irrational. 


Solution by Ulrich Abel, Giessen, West Germany. We obtain the following generalization: 


a 


sn= roo 


n=1 n° 


is irrational for every rational r > 0. For the proof we assume S(r) = b/c for r=s/t with 
integers b,c, s,¢. Then we choose a prime number 


(1) p > max(s; 1; c; e”) 


so that c < a, < p for some n. Let N be the smallest integer with a,,,, > p. Then p is a divisor 
of 


an+1! 
2 at 
(2) a! 


and a, > c. Therefore 


ay!S(r) = an X (4,8) or + R 


is an integer. By Taylor’s formula, there exists a number h,0 < h < 1, with 


16) 
(ayy!) ri <R< » (n!) Pr" = (ayy i!) rave. 


n=a4n+] 
Hence R = (ay ,,!)/ r4"+1 e*, with 0 < x <r, and 


ay! an+1 
t%a,!R =—* (=) © expan 


Ayyii\t 


is an integer. Thus, by (1) and (2), p divides e*. But this is a contradiction to 0 < e* < e’ < p. 


Also solved by R. Breusch, E. Butler, F. Dodd, G. Ehrlich, S. M. Gagola, Jr., M. Golomb, M. F. Kruelle, L. M. 
Levine, O. P. Lossers (The Netherlands), J. P. Robertson, J. M. Stark, K. L. Stellmacher, University of South 
Alabama Problem Group, and the proposers. 


Fractional Powers of Binomial Coefficients 


E 2933 [1982, 212]. Proposed by Doug Hensley, Texas A & M University. 
Show that forl <k<n 


n l/n n l/n n l/n 
te) = (ea) Hata) - 


Solution by V. D. Mascioni (student), Swiss Federal Institute of Technology, Zurich, Switzer- 
land. We will show that the inequality 


nN 1/n n l/n n l/n n l/n 
(i) (ea) = leta) -G) 


is true for all k such that] <k <n. 
At first, we simplify both terms dividing or multiplying by the common factors and we get 


eim) -l@=pocern) *laeen) -liarw) 


738 PROBLEMS AND SOLUTIONS [December 
Or 
1 1 


1 Vint 1 1\” 
loam) lea Graey Uk) large Gc) 


Now, f(k) =k" —(n—k+1)'”" is a strictly decreasing function of k if1<k<n.In 
fact, 


f'(k) _ = (Koorrnn +(n —k+ 1) rd") <0 


if0<k<n+1.If k > n/2 we obtain also 


1 1 
(n _ ky" f( ) ki/n 
and the inequality is verified for k > n/2. Using the relation (" ) =(," ) and the fact that 
k > n/2 implies n — k < n/2, we immediately see that the inequality is true for all k such that 
1<k <n. Defining ") = 0, our result remains true for k = 0. 


f(k) > f(k +), 


k} 


Also solved by R. Breusch, S. Heller, O. P. Lossers (The Netherlands), A. K. Lyzzaik (Saudi Arabia), L. E. 
Mattics, and the proposer. 


A Generalization of a Putnam Definite Integral 
E 2960 [1982, 498]. Proposed by Malcolm J. Sherman, State University of New York at Albany. 


Evaluate 


P—_*—.. where a > 0. 
o (14+ x*)(1 + x?) 


Solution by Paul Smith, University of Victoria, Canada. Setting u = 1/x we obtain 


f ft 
1/t(1 + x*)(1 + x*) (1 + u?)(1 + u®) 
Thus 
—— tim (3 ax )-4. 
0 (L+x7)(1+x%) rol 241 + x? 4 
REMARKS 


(1) The restriction a > 0 is clearly unnecessary. 
(2) The published solution [1] to Putnam problem A3 (1980) shows that i me 
0 


for all a. The substitution of x = tan yields an alternate solution. 


du 
1+ tan*u 


_ 7 
4 
Reference 


1. L. F. Klosinski, G. L. Alexanderson, and A. P. Hillman, The William Lowell Putnam mathematical 
competition, this MONTHLY, 88 (1981) 605-612. 


Comment: As pointed out by M. S. Klamkin (Canada), the problem appears in G. Pélya’s Induction and Analogy 
in Mathematics, pp. 202, 275 with a generalization. 


Also solved by 63 other readers and the proposer. 
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The Limit Is Either 0 or a!/¢ 
E 2961 [1982, 498]. Proposed by Ivan Niven, University of Oregon. 


Find all triples of positive constants a,b,c such that 
lim (e+ 6} 
n-0oo Cc 
exists (as a finite limit), and determine its value. 
Solution by the proposer. The limit exists if and only if c>b+1. If c=b+1 the limit is 
a‘/*. If c > b + 1 the limit is zero. First assume c = b + 1. Set aside the obvious case a = 1, and 
write 


(Va + b)/c=1+(Va —1)/c=1+k,/n, 
by defining k, as n(Va — 1)/c. We note that 


(+) (1+ k,/n)" =[(1 + k,/n)"”™| 


Also we find that limk, = (log a)/c by applying L’Hopital’s rule to the quotient 
(al’/* — 1)/(c/x). Hence 


limk,/n=0 and lim(1+k,/n)"“" =e. 
Taking limits in (*) we get 


k, 


lim| (Ya + b) /c| "= lim(1 + k,/n)” = e€@89/¢ = Ql¥°, 
If c > b + 1, then, since limVa = |, there is a positive number h such that (Va + b)/c<1—-h 


for all n sufficiently large. Also, lim(1 — h)” = 0. If c < b + 1, there is a positive number h such 


that (Va + b)/c >1+h for all n sufficiently large. And, of course, (1 + h)” tends to infinity 
with n. 


Also solved by 43 other readers, who all used essentially the same approach. 


Lattice Partitionings 


E 2994 [1983, 287]. Proposed by John E. Wetzel, University of Illinois. 


Let Z* be the set of lattice points in R%, the set of points all of whose coordinates are integers. 
For any subset A of Z% write D(A) for the distance set of A, the set of distances XY for X,Y 
in A. 

(a) Show that Z@ can be partitioned into two subsets A and B so that neither 9(A) nor D(B) 
equals D(Z*). 

(b) Show that if A U B = Z4 but A M B # @, then either D(A) or D(B) must equal D(Z*). 


Solution by Michael Josephy, University of Costa Rica, San José, Costa Rica. 
(a) Take 


A= { x = (x1,..., xg) Ee Z4:)' x, even}, 
B= { x = (x,,...,Xg) E74: ) x; odd}. 


Then 1 belongs to neither 9(A) nor D(B). 
(b) If A 1 B # @, we can take (without loss of generality) 0 € A M B. Suppose m = x; 
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+ ++» +x2, but /m € Q(B). Then x = (x1,.--5Xy) EA. 
Let us prove (A) = B(Z*). If Vn € B(Z“), then n = y? +--+ +y?. Let y = (y,.--5 Vy): 
If y€A orx+y €A, then 
Vn =|ly -— 0 = [K(x + y) - 2 
is in B(A). Otherwise, y € B and x + y € B, whence 
vm =|(x+y) —yll€ Q(B), 
a contradiction. 


Also solved by C. Hurd, A. Kovaéec (Austria), G. S. Lessells (Ireland), O. P. Lossers (The Netherlands), M. D. 
Meyerson, N. Miku (The Netherlands), D. Moews, Y. Peres and N. Kremerman (Israel), D. A. Rawsthorne, D. 
Richman, A. Rosenfeld, D. K. Skilton, and N. C. Wormald (Australia), H. G. Williams, and the proposer. 


ADVANCED PROBLEMS 


Solutions of these Advanced Problems should be mailed in duplicate to Professor D. H. Mugler, Department 
of Mathematics, University of Santa Clara, Santa Clara, CA 95053, by April 30, 1986. The solver’s full 
post-office address should be on each sheet. 


6505. Proposed by Harry Gonshor, Rutgers University. 


Prove that for any function f: R — R there is a function g which differs from f on a set of 
measure 0 and has the Darboux (i.e., intermediate value) property but is not continuous. 


6506. Proposed by Ronald Evans, University of California, San Diego. 
Let p > 3 be prime and write f(n) for the Legendre symbol (n/p). Prove that 


Xs + 2)(1 — f(4m + 5)) =1. 


SOLUTIONS OF ADVANCED PROBLEMS 
An Inequality for Rational Functions 


6462 [1984, 371]. Proposed by J. LeVeque and Lloyd N. Trefethen, Courant Institute of 
Mathematical Sciences, New York University. 


Let S be the unit circle (or any other circle) in the complex plane. Let r be a rational function 
of type (n,n) with no poles on S. 

(a) Show that ||r’||, < 4an||r||,, where || - ||, and ||-||,, are the L, and L, norms on S. Note 
that ||r’||, is the arclength of the image of S under r. 

(b)* Show that ||r’||, < 27n|lr\l... 


Solution of part (a) by James C. Smith, University of South Alabama, Mobile, Alabama. Take 
S to be the unit circle and let k = max{n, m}. We will show that, if r is a rational function of 
type (n,m) with no poles on S, then 


Ir’lh < 27k +n + m)|IIrIL.. 


When m = n the bound 2n(z + 2)||r||,, is better than 47n||r||,,. 
We have r = P/Q where P and Q are polynomials of degree n and m, respectively. Let 


f(0) = |r(e’®)| and g(@) = arg(r(e’’)). Then f7(0) = r(e’’)r(e’’), where r(e’’) is a rational 
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id i(n—m)0 


function in e " of type (n,m). It is easy to check that e 4 f? can be expressed as 4/B, 


where A and B are polynomials in e’” and the degree of A does not exceed 2(n + m). Thus 
dr iap 4 pe 


for at most 2(n + m) values of @ in [0,277). It follows that, for some a, [a, a + 27) is the union 
of at most 2(m + m) disjoint half-open intervals on which f(@) is monotone and nonnegative, and 
hence that 


i'n = f PF Un + m)I If lle 


Note that r can be written as the product of linear fractional transformations say T,, 7,,..., T;,. 
Since each 7; maps S one to one and onto a circle, we have 
27 d 
[gga] < 20 


Now, g(@) = Zarg(7;) and so ||g’||, < 27k. 
In conclusion, since r(e) = f(@)e8,, we have 


r’ = ie'Sfg’ + e'8f". 
Hence 


I'l <Ifg’lh + IAIh < 27k filo + 2(2 + m)IIf llc = 2n( 7 + 2) I7leo 


The proposers also solved part (a). Part (b)* remains open. 


A Least Upper Bound 


6464 [1984, 440]. Proposed by Roger Cooke, University of Vermont. 


What is the least upper bound of real numbers b such that there exists a continuous real-valued 
function f(x) satisfying 


xf(x) — f (FC) + (1?) dt — bin(x) > 00 as x > 00? 
1 
Solution by The University of South Alabama Problem Group, Mobile, Alabama. Set h(x) = 


xf(x) + (4). Then we must find the least upper bound L of real numbers 5 satisfying 


4 
If h(x) = 0 and b < 4, then the limit is indeed infinite, so L > 4. On the other hand we will 


h(x) — (OO a -(b~ G)im(x) + as x > 00. 


2 
show that h(x) — i May dt takes on non-positive values for x arbitrarily large. 
1 


Suppose, to the contrary, that there exists a continuous function h(x) and an x, > 1 such that 
for x > Xo, 


h(x) - eo dt > 0. 


2 
x H’ 1 
Set H(x) = I A(t)” dt, then for x > x), h(x)? = xH’(x) so (x —. Integrating both 
1 of H( x x 
sides of this inequality from x) to x, we have 


H(x)) — H(x)~* > In(x) — In(x9). 
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Consequently H(x) must be negative when x is sufficiently large, and this is a contradiction. 


Hence L = } and } is not taken on by Db. 


Also solved by Nicholas Passell, Douglas B. Tyler, J. Vukmirovi¢ (Yugoslavia), and the proposer. 


On Areas of Triangles 


6466 [1984, 441]. Proposed by I. J. Schoenberg, University of Wisconsin-Madison, from a 
problem seminar of A. Ostrowski (1924). 


Let F be a compact plane set, not contained in a line. Let 
I = sup{ area of triangle ABC: {A,B,C} Cc F}, 
J = inf{ area of triangle ABC: F C closed triangle ABC}. 
Show that 4/7 > J and that equality can occur. 


Composite Solution. Since F is compact there are points A, B,C in F such that J = area of 
triangle ABC. Let T be the closed triangle formed by the line / through A parallel to BC, the line 
m through B parallel to AC and the line n through C parallel to AB. Then 7 has area 41. 
Further F c T, for if X is a point off / (say) on the side away from BC, then triangle XBC has 
area greater than J. Thus J < 4/. 

Equality occurs when F is a circle or when F comprises the vertices of a square. 

Solved by Miroslav D. ASi¢é (Yugoslavia), Richard Goldstein, John M. Ingram, Jeong-Han Kim, Sr. (Korea), N. 
J. Lord (England), A. McD. Mercer (Canada), O. P. Lossers (The Netherlands), Victor Pambuccian (Romania), Pier 
Ivan Pastro (Italy), Richard E. Pfiefer, John H. Riley, Jr., C. R. Selvaraj, University of South Alabama Problem 


Group, J. Vukmirovi¢ (Yugoslavia), and the proposer. Victor Pambuccian showed by an elaboration of the above 
argument that “compact” can be replaced by “bounded”’. 


A Counting Problem 


6468 [1984, 441]. Proposed by Calvin T. Long, Washington State University. 


If a and n are integers with n > 1, prove that L,,,6(d)a”"/* = 0 (mod n). Note that if p is a 
prime, this reduces to Fermat’s Little Theorem, a”? = a (mod p). 


Solution. Thirty-two solutions were submitted including one from the proposer. Several solvers 
pointed out that P. A. MacMahon [Proc. London Math. Soc., 23 (1891-2), 305-313] proved that 


* 3 6(d)a"/4 
n d|n 


is the number of circular permutations of n elements of a kinds, with repetitions allowed. This 
solves the problem. It was also pointed out that the problem is essentially the same as problem 
E 2242, two solutions of which appear in this MONTHLY [1971, 545-546]. 


ANSWER TO PHOTO ON PAGE 732 


The photo is of Ludwig Bieberbach (1886-1982), the proposer of a famous conjecture bearing 
his name. The question is in continuation of a similar one that appeared in this MONTHLY, 
February 1985, page 93. 


REVIEWS 


EDITED BY ALLAN L. EDMONDS AND JOHN H. EWING 
COLLABORATING EDITOR FOR FILMS: SEYMOUR SCHUSTER 


Basic Algebra. By Nathan Jacobson. Volume I (1974) and II (1980). W. H. Freeman, San 
Francisco. Algebra (second edition). By Saunders Mac Lane and Garrett Birkhoff. Macmillan 
Publishing Company, New York, 1979. 


KENNETH BOGART 
Department of Mathematics, Dartmouth College, Hanover, NH 03755 


It is ironic that two of the most modern texts in abstract algebra, two of the best examples of 
the power of abstraction, have neither the word “modern” nor the word “abstract” in their titles. 
This modesty signals the beginning of a new style in abstract algebra texts. More than a decade 
ago, this reviewer lamented the difficulty of teaching to undergraduates from the first edition of 
Algebra, a book which was the first to depart significantly from the Van der Waerden tradition. 
The second edition maintains this departure but as a result of reorganization is much more 
accessible to undergraduates. The first volume of Jacobson’s Basic Algebra (BAI) appears to be 
much more in the Van der Waerden tradition, but has been written so that the nontraditional 
beginning of Volume II (BAIT) is no surprise. 

What are the ingredients that mark the departure from tradition? The most striking change is 
the early use of universal mapping properties in characterizing algebraic objects. Informally, a 
universal characterization of an object describes a property of the object and then asserts that 
anything else with the property is related by a morphism to the original object. Typically, the 
property can be expressed in terms of morphisms as well. Thus, universality is an idea of category 
theory. (In much the same way that group theory abstracts the study of the automorphisms of a 
single mathematical object, category theory abstracts the study of homomorphisms among 
mathematical objects.) As we learn in Algebra, virtually all the familiar algebraic constructions— 
rings of quotients, homomorphic images, direct products, even the natural number 1—are 
universal constructions. Chapter 4 of Algebra explains universality in terms of functors to the 
category of sets. When they are introduced in Chapter 4, categories and functors appear just as 
natural a part of algebra (and Algebra) as groups and homomorphisms. 

Jacobson uses universal characterizations to good effect throughout BAI without stopping to 
describe how they are all manifestations of the same idea. In Chapter 1 of BAII, he gives us a 
discussion of category theory that expresses universality in terms of functors between arbitrary 
categories. The uninitiated reader might not recognize the extent to which universality pervaded 
BAI from the discussion in BAII, but the reader astute enough to ask the “obvious” questions will 
have the reward of being able to answer them. Later chapters of BAII—especially those on 
modules and homological algebra—make excellent use of the categorical underpinnings from 
Chapter 1. 

A second departure from tradition of these texts is the introduction of ideas from universal 
algebra. In universal algebra, we study algebraic systems (sets with operations defined on them) in 
general rather than concentrating on a particular system. We study concepts such as homomor- 
phisms, product constructions, subalgebras, etc., that can apply to all these systems. Further, we 
study families of algebras satisfying certain axioms; if these happen to be the axioms of a familiar 
system such as a group, then we are studying properties that distinguish that system from others. 
In Algebra the principal ideas from universal algebra are the almost traditional lattice theoretic 
ideas. Lattices are used primarily as an interesting algebraic system rather than being applied to 
the study of other algebraic systems. In BAII, however, Chapter II is almost a course in universal 
algebra, covering the basic isomorphism theorems, direct and inverse limit constructions, free 
algebras and varieties of algebras. These topics do not play a heavy role in the remainder of the 
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volume; this is more a testimony on the lack of influence of universal algebra on recent 
developments in “core” algebra than it is a criticism of the text. What is unfortunate is that 
neither text adopts from universal algebra the appealing idea of the equivalence of morphisms and 
congruence relations—a congruence relation for an operation like group composition preserves 
that operation in the same way that congruence modulo n preserves addition and multiplication. 
This idea ties together all the basic homomorphism theorems of groups, rings, vector spaces, etc., 
as manifestations of a common idea, and, for example, makes it natural that only normal 
subgroups appear in the homomorphism theory of groups while all subspaces appear in the 
homomorphism theory of vector spaces (and no sublattice of a lattice plays the role of the kernel 
of a homomorphism). The fact that virtually all our familiar algebraic systems are special cases of 
monoids is clear in both books; this point of view is certainly in harmony with universal algebra. 

In both texts, ideas involving linearity are discussed for modules over rings rather than vector 
spaces over fields. This aids our real understanding of these ideas by showing what properties of 
the rings are actually used at each stage of development. Even better, ideas from different areas of 
the traditional organization appear as corollaries of the same idea. For example, in Algebra 
determinants and tensor products live together and appear later with exterior algebras (an even 
more natural marriage than determinants and tensors). In BAI, determinants appear with exterior 
algebras. In both, the theories of finite abelian groups and the rational canonical form cohabitate. 

Algebra follows one tradition from the Birkhoff and Mac Lane “Survey Series”, namely linear 
algebra’s role is more predominant than in other texts. Perhaps 40% of the book consists of topics 
we would normally associate with linear and multilinear algebra. This book is well suited to give 
all the advanced training in algebra an undergraduate needs. In fact, as one who teaches graduate 
courses in combinatorics using topics from widely separated branches of algebra, I would be 
delighted if my graduate students had mastered the concepts in Algebra. 

In its two volumes, Basic Algebra comprises an entire education, advanced undergraduate and 
graduate, in algebra, an education deep enough to serve as a springboard to research in most 
traditional areas of algebra. It is unlikely that many graduate students, even those specializing in 
algebra, have learned such a breadth of ideas in such depth. To my knowledge, only Bourbaki has 
attempted as deep a survey of algebra as appears in BAII. (Of course in their encyclopedic way, 
the Bourbaki group covers even more.) Though linear and multilinear algebra is woven throughout 
BAI and BAI], it does not play as prominent a role in Basic Algebra as in Algebra. With that 
exception, BAII takes us on a tour of “core” algebra including commutative algebra, the structure 
theory of rings and algebras, representation theory for groups, and field theory in addition to its 
category theory, universal algebra and homological algebra. (Homological algebra, which arose 
from work on chains and cochains in algebraic topology, is based on the study of sequences of 
modules connected by homorphisms. Homological algebra has provided proofs of results of 
algebraic interest, some appearing in BAII, as well as results of topological interest. In the guise 
of exact sequences, it has had an influence on Algebra as well.) 

Of course, it is unfair to expect a textbook to contain a complete compendium of results; 
however, I admit to disappointment that after developing the machinery needed to prove the 
unique factorization theorem for regular local rings, BAII does not include it even as a series of 
exercises. To a lesser extent, I felt a similar unsatisfied feeling when the results on modular and 
distributive lattices were not used in the appropriate places in ring theory (for example, a 
Dedekind domain is a domain whose lattice of ideals is distributive, and the modularity of the 
lattice of submodules of a module has not been exploited). When defining a coalgebra, Professor 
Jacobson comments that “It is nice to have a pretty definition, but it is even nicer to have some 
pretty examples.” His examples are undeniably pretty, but in neither one do we learn what makes 
the coalgebra structure of interest. Frequently, the examples leave something to our imagination in 
this way. Since there is a tradeoff between the amount covered and the details included in a book, 
we should not regard the rather telegraphic style of examples in BAII as a defect. However, even 
the experienced mathematician can profit from discussions of BAII with a more knowledgeable 
colleague. 
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What role can these books play in a college mathematics curriculum? Both Algebra and BAI 
are appropriate for a one or two semester course in abstract algebra. If this were an undergraduate 
course, I’d want to restrict enrollment to “honors” students, successful mathematics students who 
are motivated by a genuine interest in mathematics. I would also be much more comfortable if the 
students had had a linear algebra course—a highly geometric version of the sophomore level 
course offered by many institutions would be totally appropriate, and would make the student’s 
relationship with Algebra especially rewarding. With a strict selection of topics (omitting entire 
chapters), Basic Algebra I seems ideally suited for a two semester or three quarter course which 
would cover all the standard topics (including Galois theory) in the groups-rings-fields-vector 
spaces tradition. However, separate study in linear algebra would be essential to fill out the 
traditional program. Algebra would shine in a three semester sequence. Again, the sequence 
would cover all the traditional topics (Galois theory is in the second edition of Algebra and was 
not in the first) but would also provide an advanced linear and multilinear algebra background 
and touch gently on categories. As undergraduate texts, both lack a sufficient variety of 
straightforward exercises to give students practice with routine techniques. Both also expect the 
reader to think about the examples rather than merely digest them. At times the attention to detail 
in Basic Algebra obscures the idea behind a proof. Thus an instructor using one of these books 
will still have some work to do. 

For graduate students, one might use portions of BAI in a one semester course or use Algebra 
in a two semester course. Either of these courses could be profitably followed by a course from 
BAII. How long would the course last? As long as possible! Realistically, there are at least four 
semesters worth of work in BAII. This volume is Professor Jacobson’s gift to those of us who use 
algebra; I believe that as many mathematicians as possible should be exposed to as much of it as 
possible. 

The authors of two of the most successful traditional series in abstract algebra have given us 
books with a fresh and much more modern perspective in algebra. We know now how algebra 
should be taught in the future. 


(The reviewer would like to thank Ernst Snapper for valuable conversations about this review.) 


The Analysis of Linear Partial Differential Operators, Vols. I & II. By Lars Hoérmander. 
Grundlehren der mathematischen Wissenschaften 256 & 257, Springer-Verlag, Berlin-Heidel- 
berg, 1983. ix + 391 pp. 


MICHAEL E. TAYLOR 
Department of Mathematics, SUNY at Stony Brook, Stony Brook, NY 11794 


History. Calculus was perfected by Newton to formulate and solve differential equations 
arising in mechanics. Work on continua, modeling things such as moving fluids and vibrating 
solids, quickly led to the formulation of partial differential equations, arid the subject of PDE has 
enjoyed a long and productive history. A number of important developments in analysis have 
been motivated by, and particularly effective in, the development of linear PDE, the subject of the 
volumes by Professor Hérmander. 

What makes linear equations special is that the superposition principle applies: 


(1) Pu=f, Pvu=g=>P(u+av)=ftag. 


It was noticed early that numerous constant coefficient linear PDE that arose naturally had 
special solutions in terms of trigonometric functions, or exponentials, seen to be equivalent in view 
of Euler’s formula 


(2) e’* = cosx + isin x. 
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Thus it was tempting to exploit the superposition principle by representing general functions as 
superpositions of trigonometric functions. This idea was proposed by Daniel Bernoulli, as a tool 
for solving the wave equation 


(3) (0°/at? — d*/dx?)u =0 


arising in the study of vibrating strings (in the linear approximation). The idea was not accepted at 
that time, but it began to gain reluctant acceptance after being reintroduced by Fourier as a tool 
in his investigation of the heat equation 


(4) (d/dt — 0°/dx*)u=0. 
Today, the Fourier transform, defined for integrable functions on R” by 
(5) F f(é) =f(E) = (27) "7 [f(xer*™* dx, 
is a cornerstone of analysis. The Fourier inversion formula returns f(x) from f: 
(6) fx) = (2m)? f F(a) ed 
The Fourier transform intertwines differentiation and multiplication: 
(7) F (d/dx,) = if F, Fx, = i( d/dt,) F, 


and hence it is often effective in reducing constant coefficient PDE to algebraic problems, or to 
ODE with parameters. Note in particular that 


(8) A=@/dxi +--+ +0°/dx? = FA = —|é\’F; 


A is called the Laplace operator. The wave and heat equations in n space variables are, 
respectively, 


(9) (a*/ar? — A)u=0, 
and 
(10) (d/at— A)u=0. 


Natural exploitation of the Fourier transform led people to the limits of understanding the 
integral as it was understood in the eighteenth and nineteenth centuries, and the production of 
natural classes of functions (or other objects) on which the Fourier transform operates was 
achieved only in this century. A milestone was Lebesgue’s solution of the problem of producing 
the right notion of integral of functions. Out of this came the Banach spaces L’?(R”), 1 < p < 00, 
and particularly the Hilbert space L?(R”), which played a major part in the theory of functional 
analysis developing around the turn of the century. One aspect of the Fourier inversion formula is 
that F is a unitary operator on L?(R"). 

The development of theories of bounded operators on Banach and Hilbert spaces afforded 
only indirect attacks on partial differential equations, through the use of integral equations. 
One extension of this was the study of unbounded operators on Hilbert space, pursued by 
von Neumann, and later by K. O. Friedrichs and many others, which has been very influential. 
But the major advance was surely the creation of the theory of distributions. 


Distributions. Distributions such as the Dirac delta function 6(x), defined by 


(11) [f(*)8(x) dx = f(0), 


had arisen as a convenient language for describing certain “ideal” solutions to PDE. It was 
realized that, in some sense, 5(x) was the derivative of the Heaviside function H(x), defined to 
be 1 for x > 0, 0 for x < 0. L. Schwartz showed in a very elegant fashion that the methods of 
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functional analysis could be applied to the study of such “generalized functions,’ to produce 
topological vector spaces (particularly ’( R”), &’( RR”), and Y’(R”) as described below) contain- 
ing all functions in L'(R”) (compactly supported in the case of &’(R”)), and also closed under 
the action of differentiation. The Schwartz kernel theorem provided the basis of treating general 
operators as being generalized integral operators. 

Also, the neatest presentation of Fourier analysis on R” was seen to exploit the Fourier 
transform on the Schwartz space “(R”), defined by 


(12) ue A(R") @ x®D*%u € L?(R") forall a, B. 

Here x? = xf --- x8», and D* = DM --- De, with D =id/dx,. One has ¥: S(R") > 
S(R"), and the Fourier inversion formula implies ¥ is an isomorphism. By duality there is an 
isomorphism 

(13) F: S'(R") > F(R"); 

the space ¥’(R”) of tempered distributions is the space of continuous linear functionals on 
F(R"), 1.e., its dual space. There are natural inclusions 


(14) P(R") Cc 12(R") CPR"), 
induced by 
(15) (u,v) =f u(x)o(») dx. 


The space 9’( R”) is dual to C5°( R”), the space of smooth, compactly supported functions, and 
the space &’( R”) is dual to C°(R”). 


Distributions and PDE . With the machinery of distributions, it is convenient to formulate and 
exploit the notion of fundamental solution to a linear PDE: 
(16) PE, = 8(x—y). 
If P has constant coefficients, one can take E,(x) = E(x — y). In general, E will be a 


distribution. One has a similar notion of fundamental solution of an initial value problem. For 
example, the fundamental solution to the heat equation would satisfy (10) for t > 0 and also 


(17) lim E(t, x) = 8(x). 


Let us indicate how Fourier analysis produces this fundamental solution. If E(t, £) denotes the 
pa:tial Fourier transform with respect to x, thg PDE (10) leads to 


(18) (d/dt) E(t,&) = —|€PE(t, &), 
and (17) yields 

(19) E(0,£) = (27). 
Hence 

(20) E(t,£) =(27) "ee. 


Calculating E(t,x) from (20) is equivalent to computing the Fourier transform of a Gaussian 
function, a basic computation in the subject; one obtains 


(21) E(t,x) = (4nt) "eo PP /4", 
Similarly, the Poisson kernel P(y, x), satisfying 
(22) (a/dy* + A)P(y,x)=0, fory>0,x € R’, 


lim P( y,x) = 6(x), 
ylO 
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is characterized by its partial Fourier transform with respect to x: 
(23) P(y,€) = (29) e8, 


The computation of the inverse Fourier transform of this is easy if n = 1, but not so simple if 
n > 1. One way to obtain a formula for P(y, x) is to exploit the “subordination identity”: 


(24) e él _ sy? [Per Meni -9/? dt. 
0 
Application to (23), together with (21), gives 


—(n+1)72 
(25) P(y,x) =, y(y? + |xP) OO 


In order to prove (24), one might note it suffices to obtain an independent proof of its implication 
(25), for any one special case. As the direct verification that (23) implies (25) for n =1 is 
Straightforward, one has a proof of (24). The formula (25) for P(y,x) can be analytically 
continued in y for Rey > 0, and one can pass to the limit of purely imaginary y, to obtain 
formulas for the fundamental solution to the wave equation: 


(26) (a?/atr?-A)W=0, W(0,x)=0, (d/dt)W(0,x) =8(x), 
which satisfies 
(27) W(t,é) = |é|"+ sin z]é]. 


If we note that the inverse Fourier transform of (27)~"/7|&|" !e~”'§! is obtained by integrating (25) 
from y to oo, so is equal to 


(28) cn(y? + |x| 
for n > 2, it follows that 


2 D/2 


(29) W(t,x) =lime’ Im (|x|? —(t- ie) 
elO 


In particular, for n = 3, one obtains the formula 
(30) W(t,x) = (4at) *8(|x| — t). 


Integral formulas equivalent to these formulas for fundamental solutions are classical, of course, 
but the calculations (18)-(30) might give a feel for the utility and power of distribution theory and 
Fourier analysis. 

A precise study of constant coefficient equations often enables one to get a hold on variable 
coefficient PDE, treating P(x, D) = Lig <m4q(x)D* as a perturbation of a constant coefficient 
operator obtained by freezing the coefficients. For example, if P(x, D) is elliptic, which means 


(31) Pi(x,é)= Dd a,(x)é&* #0 for € # 0, 
|a|=m 

then fundamental solutions for the constant coefficient elliptic operators can be pasted together to 
give the leading term in an inverse of P(x, D). An iterative process will improve this approxima- 
tion. For hyperbolic equations, such as the wave equation (9), in the variable coefficient case, 
more powerful methods are required. The volumes under review concentrate on the study of 
constant coefficient operators and some results on variable coefficient operators which one can 
deduce by such perturbations. More powerful methods, involving particularly pseudodifferential 
operators and Fourier integral operators, will be developed in following volumes by the author 
and applied to large classes of variable coefficient PDE. 


The Book. The first of Hérmander’s volumes is devoted primarily to the theory of distribu- 
tions. It starts on a very basic level, with a detailed study of calculus of C” functions of several 
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variables, and proceeds with masterly organization to develop the theory to an advanced level. 
Specific fundamental solutions such as mentioned above are constructed. Actually, the author 
develops the material quite far before introducing the Fourier transform. Homogeneous distribu- 
tions are introduced, and the behavior of A on R” under dilations leads us to look for a 
homogeneous distribution as a fundamental solution (at least for n > 3); rotational symmetry 
settles what it has to be, up to a scalar multiple. Change of variables enables one to treat any 
second order constant coefficient elliptic operator La,,D,D,, with La,,& §, positive definite. 
Analytic continuation is used to pass to fundamental solutions of second order hyperbolic 
operators. When the Fourier transform is introduced, some of these formulas are rederived, 
though by that point the emphasis is more on general classes of constant coefficient PDE, where 
analytical identities are not so available. One has for example the construction of a fundamental 
solution for an arbitrary constant coefficient operator P(D). Use of Fourier analysis leads to 
studying P(é)~', for € in the complex domain, and one needs a detailed understanding of the 
behavior of polynomials near their zero sets. 

In studying solutions to a PDE Pu =f, one thing that is important is a description of the 
singularities of the solution. In particular, one wants to describe the singularities of the fundamen- 
tal solution. For the wave equation in n space variables, formula (29) shows the fundamental 
solution is singular on the cone |x| = |t|. A refinement of the notion of singular support has been 
introduced, the wave front set. Given a distribution u € 9’(R"), WFu is a subset of the 
cotangent bundle T*R”, projecting onto the singular support of u in R”, telling not only where u 
is singular, i.e., where one cannot introduce a cut-off ~ so that ~u € C>°, but also where the 
Fourier transform of Wu fails to tend rapidly to zero. In the case of a distribution with a simple 
singularity along a smooth surface, WFu typically consists of all nonzero cotangent vectors 
normal to the surface. It is shown that for elliptic P, even with variable coefficients, WF( Pu) = 
WFu. This contains the elliptic regularity theorem. Volume I also considers notions of wave front 
set associated to other categories than C”, for example the analytic wave front set, and ends with 
an introduction to hyperfunctions, an analogue of distributions, consisting of locally finite sums of 
analytic functionals. The theory of hyperfunctions has been developed quite far by M. Sato and 
collaborators, making heavy use of homological algebra. The introduction given here simplifies 
some of the foundations of that subject. 

With the basic theory of distributions and Fourier analysis thoroughly set down, Volume II 
proceeds deeper into the study of linear PDE, particularly with constant coefficients. Detailed 
analysis is made of the regularity of certain fundamental solutions of general constant coefficient 
operators P(D), including a classification of when P(D) is hypoelliptic, 1.e., satisfies 


(32) sing supp Pu = singsupp u, 


or even WF( Pu) = WFu. A special class of hypoelliptic operators is the class of elliptic operators; 
the heat operator is also hypoelliptic. The wave front set of a fundamental solution E of P(D) is 
studied in great detail. Also questions of global solvability of an equation P(D)u =f ona 
domain Q ¢ R” are studied. Topics on variable coefficients include operators with “constant 
strength,” and also a chapter on scattering theory, for short range potentials. As mentioned, 
subsequent volumes will go more deeply into the study of variable coefficient linear PDE. 

These volumes are to some degree a greatly extended rewrite of Hérmander’s Linear Partial 
Differential Operators, published by Springer-Verlag in 1964. They cover an enormous amount of 
analysis. In addition to topics mentioned above, there are self-contained treatments of topics 
ranging from the Gauss-Green formula and Cauchy’s integral formula, to a complete though brief 
discussion of manifolds, differential forms, and Hamiltonian vector fields, to the Malgrange 
preparation theorem and the Tarski-Seidenberg theorem. Some of these topics will be more 
completely appreciated when they are used in subsequent volumes. The two volumes which are 
out, and their companions which will follow, will not likely serve as the texts for one’s first brush 
with PDE, but the serious analyst will find here an elegant presentation of a vast amount of 
material on linear PDE, by a consummate master of the subject. 
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Random Walks and 
Electric Networks 


by J. Laurie Snell and Peter Doyle 


xiii + 159 pages. Hardbound 
MAA Member: $18.50 List: $24.00 


In this newest addition to the Carus Mathematical Monographs, the authors 
examine the relationship between elementary electric network theory and ran- 
dom walks, at a level which can be appreciated by the able college student. We 
are indebted to them for presenting this interplay between probability theory 
and physics in so readable and concise a fashion. 

Central to the book is Polya’s beautiful theorem that a random walker on an 
infinite street network in d-dimensional space is bound to return to the starting 
point when d=2, but has a positive probability of escaping to infinity without 
returning to the starting point when d= 3. The authors interpret this theorem as 
a statement about electric networks, and then prove the theorem using tech- 
niques from classical electrical theory. The techniques referred to go back to 
Lord Rayleigh who introduced them in connection with an investigation of 
musical instruments. 

In Part I the authors restrict themselves to the study of random walks on finite 
networks, establishing the connection between the electrical concepts of cur- 
rent and voltage and corresponding descriptive quantities of random walks re- 
garded as finite state Markov chains. Part II deals with the idea of random walks 
on infinite networks. 


Table of Contents 


Part I: Random Walks on Finite Networks 
Random Walks in One Dimension 
Random Walks in Two Dimensions 
Random Walks on More General Networks 
Rayleigh’s Monotonicity Law 

Part Il: Random Walks on Infinite Networks 
Poyla’s Recurrence Problem 
Rayleigh’s Short-Cut Method 
The Classical Proof of Polya’s Theorem 
Random Walks on More General Infinite Networks 
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CARUS MATHEMATICAL 
ONOGRAPHS — 


Students and teachers of mathematics as well as nonspecialists and 
workers in other fields, will all appreciate the expository excellence of 
these monographs. A wide range of topics in pure and applied mathe- 
matics are covered, each one written by a renowned expositor in the 
field. A glance at the list of titles of the books and the list of authors will 
convince you of the quality of this resoected series of monographs. 
Eighteen titles are currently available: 


List Price MAA Member 


No. 1. Calculus of Variations, by G. A. Bliss $19.00 $14.00 

No. 2. Analytic Functions of a Complex Variable, 19.00 14.00 
by D. R. Curtiss 

No. 4. Projective Geometry, by J. W. Young 19.00 14.00 

No. 6. ‘Fourier Series and Orthogonal Polynomials, 19.00 14.00 
by Dunham Jackson 

No. 8. Rings and Ideals, by N. H. McCoy 19.00 14.00 

No. 9. The Theory of Algebraic Numbers (Second 19.00 14.00 
edition) by Harry Pollard, and H. G. 
Diamond 

No. 11. Irrational Numbers, by Ivan Niven 19.00 14.00 

No. 12. Statistical Independence in Probability, Analysis 19.00 14.00 
and Number Theory, by Mark Kac 

No. 13. A Primer of Real Functions (Third edition), 19.00 14.00 
by R. P. Boas, Jr. 

No. 14. Combinatorial Mathematics, by H. J. Ryser 19.00 14.00 

No. 15. Noncommutative Rings, by I. N. Herstein 19.00 14.00 

No. 16. Dedekind Sums, Hans Rademacher and Emil 19.00 14.00 
Grosswald 

No. 17. The Schwarz Function and its Applications, 19.00 14.00 
by P. J. Davis 

No. 18. Celestial Mechanics, by Harry Pollard 19.00 14.00 

No. 19. Field Theory and its Classical Problems, 24.00 18.50 
by Charles Hadlock 

No. 20. The Generalized Rieman Integral, by 21.00 15.20 
R. M. McLeod ¢ 

No. 21. From Error-Correcting Codes through Sphere 24.00 18.50 
Packings to Simple Groups, by Thomas M. 
Thompson. 

No. 22. Random Walks and Electric Networks, by 24.00 18.50 


J. Laurie Snell and Peter Doyle. 
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The Role of Mathematics in Science 
by Leon Bowden and M.M. Schiffer 

220 pp. Paperbound 

List: $16.00 MAA Member: $12.50 


This book is based on a series of lectures given over a period 
of years to high school mathematics teachers. The lectures 
focused on topics in mechanics, questions of population 
growth, probability and other uses of exponential functions, 
optics and application of matrices to relativity theory. 

The aim of the book is to illustrate the power and elegance 
of mathematical reasoning in science with some examples 
ranging from the work of Archimedes to that of Einstein. The 
book starts with problems of the lever, the mirror and the 
growth of populations and ends up with problems of space 
travel and atomic energy. 

This excellent book will appeal to the advanced high 
school student, to the undergraduate and to their teachers. A 
sampling of the Table of Contents should inspire you to order 
your copy now. 


Table of Contents 


Chapter 1. The Beginnings of Mechanics 

Chapter 2. Growth Functions 

Chapter 3. The Role of Mathematics in Optics 

Chapter 4. Mathematics with Matrices - Transformations 

Chapter 5. What is the Time? Einstein’s Transformation 
Problem 

Chapter 6. Relativistic Addition of Velocities 
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Studies in 
Computer Science 


Edited by Seymour Pollack 
408 pp. Hardbound List: $33.50 MAA Member: $25.00 


STUDIES IN COMPUTER SCIENCE continues the tradition of excel- 
lence of the respected series MAA Studies in Mathematics. Written by 
computer scientists for mathematicians, the book presents a readable 
and balanced discussion of the role of mathematics in computer sci- 
ence, and the contributions of computing to mathematics. 

Opening with an historical overview of the development of com- 
puter science, the narrative continues with three articles on the nature 
of computer programs and the programming process, a discussion of 
computational complexity, and a concluding group of articles on the 
impact of computer science on artifical intelligence, numerical analy- 
sis, statistics, and simulation. 

In the introduction the editor expresses his hope that the book will 
provide ‘an interesting look at an explosive field in the process of be- 
coming.’” What an understatement! The process is well advanced; the 
impact of computer science on mathematics and its teaching is already 
immense. And this book is not just interesting, it is exciting and impor- 
tant as background reading for anyone interested in the interactions 
between mathematics and computer science. Teachers will find that it 
enriches their teaching and students will find it rewarding supplemen- 
tary reading. Order your copy today! 
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Real Functions 


by Ralph P. Boas, Jr., 


Carus Mathematical Monograph, #13 
Third Edition 


Xi + 232 pages. Hardbound. 
List! $49.00 MAA Member: $414.00 


“A gold mine of interesting, uncommon insight and examples . . . an orderly 
composition of 24 partially independent elegant snapshots from the theory of 
sets and real functions.” Lynn A. Steen, commenting on the second edition of 
“A Primer of Real Functions” in The American Mathematical Monthly, 
1974. 

The Third Edition includes the most significant revisions to date of this 
classic volume. Terminology has been modernized, proofs improved, and sec- 
tions have been completely rewritten. Much new material has been added. The 
Primer contains the basic material on functions, limits, and continuity that stu- 
dents ought to know before starting a course in real or complex analysis. It is a 
good place for a student (or anyone else) to see techniques of real analysis at 
work in uncomplicated but interesting situations. 

The author says, ‘My principal objective was to describe some fascinating 
phenomena, most of which are not deep or difficult but rarely or never appear 
in textbooks or in standard courses.” Boas has achieved that objective. The re- 
sult is a book that should be on every teacher’s book shelf, in every college li- 
brary, and a part of every serious mathematics student’s experience. 


Order From: 

The Mathematical Association of America 
1529 Eighteenth Street, N.W. 

Washington, D.C. 20036 


December 1985 


Dear MAA Member, 


The MAA Board of Governors passed a motion to use a method of approval voting for the next 
election of the President and First Vice President in 1986. Their proposal still has to be approved 
by the members in January. 

APPROVAL VOTING is a significant concept in the area of applied mathematics. The method 
allows voters to vote for, or approve of, as many candidates as he wishes in a multicandidate 
race. The book, written by Steven Brams and Peter Fishburn, is a comprehensive text on the 
subject which puts the idea on solid scientific grounds. 

As an MAA member, it would be beneficial to read the book before the January meeting. To help 
you become more informed on this new system of voting, we are pleased to offer APPROVAL 
VOTING to MAA members at 25% discount. Simply mail this letter to: Birkhauser Boston, 38O 
Green Street, Cambridge, MA 02139, or call (617)576-6638. We will be happy to fulfill your 
order. 


Yours sincerely, 


ZV 


Klaus Peters 
Publisher, Birkhauser Boston 


APPROVAL VOTING / Softcover /224pp. ff 
LIST PRICE: $14.95 - SPECIAL OFFER: $10.75 ‘@ 
Valid until January 31, 1986 : 5 
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International Business Machines Corporation Scott Foresman & Company, Inc. 
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Carus Mathematical Monograph #21 


From Error Correcting Codes 
through Sphere Packings to 
Simple Groups, 

by Thomas M. Thompson 

224 pp. Hardbound 

List: $24.00 MAA Member: $18.50 


Two of the most fascinating problems to challenge mathemati- 
cians in recent years concern the construction of data transmis- 
sion codes that can correct errors introduced by static and the 
search for efficient ways to pack ping-pong balls into a box. Can 
one design the best error-correcting codes? Can one find the most 
efficient sphere packing? 

Therein lies a fascinating story which is told with great skill 
and clarity in this important addition to the Carus Mathematical 
Monograph series. The author has packed (sic) into 175 pages all 
of the basic mathematical ideas of this saga woven into a gripping 
historical account of the journey from error-correcting codes to 
sphere packings to simple groups. 

Table of Contents 
Chapter 1. The Origin of Error-Correcting Codes 
Chapter 2. From Coding to Sphere Packing 
Chapter 3. From Sphere Packing to New Simple Groups 
Appendix 1. Densest Known Sphere Packings 
Appendix 2. Further Properties of the (12,24) Golay Code 
and the Related Steiner System S (5,8,24) 
Appendix 3. A Calculation of the Number of Spheres with 
Centers in A, adjacent to one, two, three, and 
four adjacent spheres with centers in A». 
Appendix 4. The Mathieu Group M,, and the order of M>, 
Appendix 5. The Proof of Lemma 3.3 
Appendix 6. The Sporadic Simple Groups 


IAA STUDIES IN 
MATHEMATICS | 


Studies in Numerical Analysis 


MAA Studies in Mathematics #24 


Gene H. Golub, Editor 
415 pp. Hardbound. 
List: $42.00 MAA Member: $31.00 


This volume is a collection of papers describing the wide range of research ac- 
tivity in numerical analysis. The articles describe solutions to a variety of prob- 
lems using many different kinds of computational tools. Some of the compu- 
tations require nothing more than a hand held calculator: others require the 
most modern computer. While the papers do not cover all of the problems that 
arise in numerical analysis, they do offer an enticing and informative sample. 
Numerical analysis has a long tradition within mathematics and science, 
beginning with the work of the early astronomers who needed numerical pro- 
cedures to help them solve complex problems. The subject has grown and de- 
veloped many branches, but it has not become compartmentalized. Solving 
problems using numerical techniques often requires an understanding of sever- 
al of the branches. This fact is reflected in the papers in this collection. 
Computational devices have expanded tremendously over the years, and 
the papers in this volume present the different techniques needed for and made 
possible by several of these computational devices. 
Table of Contents 
The Perfidious Polynomial, James H. Wilkinson 
Newton’s Method, Jorge J. Moré and D. C. Sorensen 
Research Directions in Sparse Matrix Computations, Jain S. Duff 
Questions of Numerical Conditions Related to Polynomials, Walter Gautschi 


A Generalized Conjugate Gradient Method for the Numerical Solution of Elliptic 
Partial Differential Equations, Paul Concus, Gene H. Golub and Dianne P. 
O’Leary 


Solving Differential Equations on a Hand Held Programmable Calculator. J. Barkley 
Rosser 


Finite Difference Solution of Boundary Value Problems in Ordinary Differential 
Equations, V. Pereyra 


Multigrid Methods for Partial Differential Equations, Dennis C. Jespersen 
Fast Poisson Solvers, Paul N. Swarztrauber 


Poisson’s Equation in a Hypercube: Discrete Fourier Methods, Eigenfunction Ex- 
pansions, Padé Approximation to Eigenvalues, Peter Henrici 
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